Library Cataloging
UNICODE
It's coming to MARC. This is a gentle introduction. The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) by Joel Spolsky.
Before I get started, I should warn you that if you are one of those rare people who knows about internationalization, you are going to find my entire discussion a little bit oversimplified. I'm really just trying to set a minimum bar here so that everyone can understand what's going on and can write code that has a hope of working with text in any language other than the subset of English that doesn't include words with accents. And I should warn you that character handling is only a tiny portion of what it takes to create software that works internationally, but I can only write about one thing at a time so today it's character sets.
-
Unicode And Marc
News from LC.The revised Character set specifications are now posted on the MARC site. They take into account the use of the full Unicode repertoire, as opposed to only the MARC-8 subset of Unicode, and also include the loss-less and lossy techniques...
-
Unicode
Michael Doran collected some resources concerning MARC and Unicode. Coded Character Sets items include:A Technical Primer for LibrariansMARC UnicodeResources on the Web: With an emphasis on library automation and the internet
...
-
Articles
A few interesting articles in the current issue of RLG DigiNews.Computational Linguistics Meets Metadata, or the Automatic Extraction of Key Words from Full Text ContentCharacter Sets and Character Encoding: A Brief IntroductionLEADERS: Linking EAD to...
-
Marc
MARC::Charset is a package that allows you to easily convert between the MARC-8 character encodings and Unicode (UTF-8). The Library of Congress maintains some essential mapping tables and information about the MARC-8 and Unicode environments. MARC::Charset...
-
Cataloging In Other Languages
The other day I mentioned the problem of identifying just what the language is on some materials. Ukrainian and Russian or the Scandinavian languages can appear very similar to someone who does not speak them. Owen Massey brought language guessers to...
Library Cataloging