Library Cataloging
Metadata Extraction
Effective Metadata Extraction from Irregularly Structured Web Content by Baoyao Zhou, Wei Liu, Yu Yang, Weichun Wang Ming Zhang, (HPL-2008-203)
Metadata extraction is one crucial module for domain specific Web content discovery and management, because the accuracy and completeness of the extracted metadata would directly affect the quality of subsequent domain information services. Our Online Course Organization project aims to build an online course portal to serve the course information obtained from the Web. Since most course pages are irregularly structured, most existing approaches are not effective for extracting course metadata. In this paper, we proposed a novel hierarchical clustering approach to generate a web page semantic structure model from the DOM tree, called Logical Structure Model, such that the hidden patterns and knowledge can be revealed and used to facilitate identifying course metadata. The experimental results have shown that our solution can achieve effective metadata extraction
-
Free Your Metadata
Free Your Metadata is a site that describes using Google Refine and some extensions to clean and reconcile metadata, and automate the creation of personal, corporate and geographic names. Clean up Clean up your metadata and discover how to handle those...
-
Acronym Definitions
Adding semantic meaning to text can only help our users. High-recall extraction of acronym-definition pairs with relevance feedback by Anna Yarygina and Natalia Vassilieva has been publshed by HP Laboratories as HPL-2012-46. This paper addresses the problem...
-
Learning Object Metadata
The latest issue of Interdisciplinary Journal of Knowledge and Learning Objects has some interesting articles.Tree View Editing Learning Object Metadata by Zeynel Cebeci and Yoldas Erdogan.This paper introduces and examines an authoring tool called as...
-
Metadata Tools And Learning Objects
A Framework for Metadata Creation Tools by Valentina Malaxa and Ian Douglas appears in the initial issue of Interdisciplinary Journal of Knowledge and Learning ObjectsMetadata is an increasingly important aspect of resource discovery. Good metadata has...
-
Metadata Software
Describethis is a metadata extraction and generation tool. It can convert most kinds of metadata found in documents (GeoTags, bibTex, EXIF, RDF, Creative Commons, etc) to valid Dublin Core elements and also generate keywords from the parsed content. It...
Library Cataloging