Chapter 3. The Reader Faced with a Digital Library: the Experience of the
3.4. Current problems
42 Digital Libraries
The number of profiles is impressive, but it would be useful to establish how many of those profiles are actually used on a regular basis. From the very beginning a large number of registered profiles of researchers that never actually use the system could be observed. These profiles were created by users who were experimenting and exploring the system. It is still too early to judge how large this group of profiles is and what might become of it.
3.3.2.3. The most frequently used sources
Databases that are part of the multimedia library as well as PubMed are frequently used. Structured information is therefore the most frequently used source.
Amongst the websites used, a collection of the latest publications in electronic journals are the most consulted. Internal web pages are the least consulted. This phenomenon was, however, anticipated during Biolib’s development process.
Even if it is still too early to scientifically analyze the usage of Biolib, it is still important to underline that Biolib disposes of very efficient statistical tools. These were conceived to enable the multimedia library to adapt sources and services to the needs of users. All data traffic and transactions that are carried out on the portal are stored in a database.
The Reader Faced with a Digital Library 43 quite satisfactory; however, this may not remain the case if the number of sources increases.
3.4.1.2. Selecting data based on categories or clusters?
Separating results according to their topic is another option. The idea of introducing a mediator when it comes to selecting information was quickly abandoned. The experience with the training of the MeSH thesaurus of the National Library of Medicine has convinced the multimedia library to make a similar investment. However, a classification taxonomy of Biolib’s search results happens before the user actually gets to see them. After a long study of tools suggested by programs such as Verity K2 and linguistic corpuses which cover similar fields to those researched at the Pasteur Institute, the multimedia library abandoned this project. Classification of the sources relies heavily on human resources. Employing people, of course, represents an expense that must be covered. This amount of funding has to be justified. Furthermore, researchers continually use new terminology in their search entries. These terms and concepts are often not found in taxonomies and, even if they are, they are not necessarily linked to the same concept.
Another interesting alternative might therefore be separating the results with the help of tools that are linked to clusterization. However, tests on programs that have appeared on the market have convinced neither the information officers nor the IT consultants working on this project. Those programs that were tested in 2003 were difficult to integrate into a complex technical architecture. Some of them were entirely incompatible with the existing system. Others were approved by the information officers who believe that they would add value to the system. The disadvantage of these programs and the reason why, in the end, they were not installed is the fact that their navigation within the search results was rather complex. When carrying out the same search twice, different results appeared.
Another disadvantage consisted of the fact that it was impossible to click on an icon that would take the user back to the previous page. Last, but not least, the logic followed by this program was not very transparent and was not always easy to follow.
The multimedia library is currently carrying out tests on technically advanced versions of clusterization programs. The market for this type of programs has stabilized and the programs that are offered can be adapted much more easily than their predecessors. Their integration into Biolib is therefore much more straightforward. However, we already know that clusterization will not be entirely satisfactory. A mixed solution of a semi-automatic analysis of the user’s search based on the calculation of clusters at the same moment as the hits are displayed on screen may be the ideal solution. The question of whether there are enough human and financial resources to opt for this mixed solution, however, still remains
44 Digital Libraries
unanswered. If the answer is no, the option of using a clusterization tool on its own still remains. This option, however, would put an end to all attempts to introduce a fine taxonomy into Biolib.
3.4.2. Can the costs be controlled?
Introducing a portal such as Biolib brings about the challenge of overcoming financial difficulties. In order to control the financial situation, the budget needs to be analyzed in advance. The integration of new resources that often need to be paid for still creates difficulties for libraries and documentation centers. These problems become increasingly difficult to deal with. Open access-based projects will certainly force the producers and distributors of scientific information to review their prices.
In the short term, however, it is still very difficult to predict the costs for access to these sites from one year to another.
Furthermore, the creation of a common interface that enables the user to search all those sources at once might oblige the library to buy new programs. Accessing a new dynamic web server might, for example, force the library to buy a linking tool.
It might be necessary to extend licenses. As libraries do not have an impact on the development of what is offered on the market, it is not always possible to plan what developments will be required in the course of following year. Buying new programs is not only a strain on the budget when the program is bought, but licenses need to be renewed and therefore represent a constant strain on the budget.
Moreover, maintenance of these programs is another burden on the budget. There is no guarantee that a chosen solution will be the right one. Technical developments, economic changes, notably in the policies of publishers, and also financial restrictions have an impact on possible solutions.
In the medium term, the budget of the multimedia library will no longer be sufficient to cover maintenance and the costs for the development of Biolib. As is already the case for subscriptions to electronic journals, all other costs incurred by the multimedia library will have to be shared with its users. This trend is inevitable.
3.4.3. How to create a new dialog with the user
The scientific multimedia library initiated the creation of Biolib. However, it remained difficult to make all members of staff, especially those working at reception, engage with this project. The idea that the user becomes entirely independent within his/her research meant some librarians came to question their profession and ethos.
The Reader Faced with a Digital Library 45 Since the introduction of Biolib and the continuous increase in its usage, it has become much easier to engage with members of staff who were not part of the initial project. The creation of teams that ensure the promotion of the portal created a new form of dialog with the user. Information officers now go to the researchers’
laboratories and know about the researchers’ work environment and their specific needs. Qualities such as being able to listen and to give understandable explanations are highly appreciated by the user.
To finalize the portal, an online helpdesk has been installed on the library’s server. Questions are answered according to the specializations of the staff. This enables the user to benefit from the individual qualities and skills of each member of staff. This service was installed in 2004 and its usage has been continually increasing ever since. The questions asked are very diverse and all answers given are of the same quality. A traditional helpdesk within the library could not have provided this level of service as the quality of the answer would strongly depend on the qualifications of the person working at the helpdesk at the time the user asked his/her question.
In conclusion, the introduction of Biolib has given an overview of the user’s behavior regarding digital information. This overview is not at all linked to the idea of whether the user is specialized in a particular field or not. The fact that all resources are found in a single place underlines the virtues as well as the shortfalls of digital libraries. The difficulties professional librarians were faced with became obvious. They needed to change their behavior to adapt themselves to the needs of users. They also needed to familiarize themselves with new software and work hand in hand with IT consultants. Last, but not least, it still remains difficult to keep an overview of the budget as the cost of acquisition is constantly changing. The creation of digital libraries and documentary portals shows to what extent the digital revolution has provoked changes in the behavior of users. These changes are irreversible and have led to the creation of new services within libraries.
46 Digital Libraries
3.4.4. Appendix: Biolib’s search interface
Figure 3.1. Multisource search