Library Cataloging
Melvyl Recommender Project
Steve Toub of the California Digital Library has posted this to a couple of e-mail lists.
The Melvyl Recommender Project, which explored next-generation services for library catalogs, has reached its conclusion. This project was funded by the Andrew W. Mellon Foundation.
Popular commercial services such as Google, eBay, Amazon, and Netflix have evolved quickly over the last decade to help people find what they want, developing information retrieval strategies such as usefully ranked results, spelling correction, and recommendations. Library catalogs, in contrast, have changed little and are not well equipped to meet changing needs and expectations.
The Melvyl Recommender Project explored methods and feasibility of closing this gap. An additional extension project to the Melvyl Recommender Project carried out deeper explorations into the most interesting and promising questions raised during the original project, and to add obvious missing pieces of functionality. The principal area of investigation was the impact of adding full-text objects to what had previously been a metadata-only index.
Overall findings from both portions of the project include:
The text-based discovery application, the eXtensible Text Framework
(XTF) that was the backbone of the project's system (known as "Relvyl") proved capable of scaling to millions of records and hundreds of concurrent users, indicating that this is an approach worth pursuing for providing ranking, recommendation and other types of functionality with an online catalog.
Use of an index based single word spelling correction algorithm addressed 90 percent of misspelled single words.
Initial examination of faceted browsing and FRBR-like document groups indicated that each of these features could substantially improve the patron's experience of working with large result sets.
User assessment confirmed that users prefer relevance ranked results over unranked results, although more investigation is required to determine whether content-based ranking with or without different types of weights (based on circulation or holdings) is more effective.
Two types of recommendation strategies were explored:
circulation-based ("patrons who checked this out also checked out...") and text-similarity ("More like this..."). User assessment was conducted against the first type and showed that users like getting recommendations, which are useful for performing academic tasks, and they can also serve a unique query expansion function.
Adjustments to keyword searching strategies, document scoring and the index-based spelling correction dictionary allowed for an effective combination of full-text and metadata only records into one system, in which neither type of record was privileged.
Much of the functionality explored in both phases of the project can be found in the Relvyl prototype.
More information about the entire project can be found on the CDL website.
OPAC
-
Extensible Text Framework (xtf)
The California Digital Library (CDL) is pleased to announce a new release of its search and display technology, the eXtensible Text Framework (XTF) version 2.1. XTF is an open source, highly flexible software application that supports the search, browse...
-
Marc Records Under The Microscope
The University of North Texas (UNT)-Texas Center for Digital Knowledge (TxCDK) announces a project investigating the coding of information in MARC records from the OCLC WorldCat database. The Institute of Museum and Library Services, an independent Federal...
-
Digital Repository Management System
The Fedora project was established under the auspices of the Andrew W. Mellon Foundation to build a digital object repository management system based on the Flexible Extensible Digital Object and Repository Architecture (Fedora). The new system, designed...
-
New Melvyl Catalog
Seen on What's New at the CDL.California Digital Library (CDL) rolls out the new Melvyl-T catalog allowing library patrons - faculty, students, staff, and other researchers as well as the public at large - to search a state-of-the-art catalog of over...
-
Scout Portal Toolkit
Internet Scout Project is pleased to announce the 1.0 release of the Scout Portal Toolkit (SPT)!This open source software package, funded by the Andrew W. Mellon Foundation, allows groups or organizations to develop a portal online without making a big...
Library Cataloging