Library Cataloging
Acronym Definitions
Adding semantic meaning to text can only help our users. High-recall extraction of acronym-definition pairs with relevance feedback by Anna Yarygina and Natalia Vassilieva has been publshed by HP Laboratories as HPL-2012-46.
This paper addresses the problem of extracting acronyms and their definitions from large documents in a setting, when high recall is required and user feedback is available. We propose a three step approach to deal with the problem. First, acronym candidates are extracted using a weak regular expression. This step results in a list of acronyms with high recall but low precision rates. Second, definitions are constructed for every acronym candidate from its surrounding text. And last, a classifier is used to select genuine acronym- definition pairs. At the last step we use relevance feedback mechanism to tune the classifier model for every particular document. This allows achieving reasonable precision without losing recall. As opposed to existing approaches, either created to be generic and domain independent or tuned to one particular domain, our method is adaptive to an input document. We evaluate the proposed approach using three datasets from different domains. The experiments prove the validity of the presented ideas.
-
Frar: Extending Frbr Concepts To Authority Data
FRAR: Extending FRBR Concepts to Authority Data by Glenn E. Patton is now available.The IFLA FRANAR Working Group is charged with extending the concepts of the IFLA Functional Requirements for Bibliographic Records to authority data. The paper reports...
-
Marbi
The minutes for the MARBI meeting held during the ALA Annual Conference in June 2003 are now available. ...
-
Opacs
"An XML document repository: A new home for University at Buffalo, library systems" by Mark Ludwig appears in Library Hi Tech News vol. 20 no. 6 (2003) I first heard Mark describe this at a LITA Forum a few years back and it blew my mind. The millions...
-
Open Source
An interesting project, Bibliographic.This project intends to provide a comprehensive and high quality bibliographic function within OpenOffice. The planned bibliographic function will utilize the latest open standards and will make the fullest use of...
-
Cites & Insights
Cites & Insights 3:4 (April 2003) is available as a PDF download. The issue includes:Perspective: A Zine is Not a WeblogThe Library Stuff (three items)Bibs & BlatherThe Filtering/Censorware Follies: CIPA and the SupremesThe Good Stuff (nine items)Trends...
Library Cataloging