Library Cataloging
MARC
MARC::Charset is a package that allows you to easily convert between the MARC-8 character encodings and Unicode (UTF-8). The Library of Congress maintains some essential mapping tables and information about the MARC-8 and Unicode environments. MARC::Charset is essentially a Perl implementation of the specifications found at LC, and supports the following character sets:
- Latin (Basic/Extended + Greek Symbols, Subscripts and Superscripts)
- Hebrew
- Cyrillic (Basic + Extended)
- Arabic (Basic + Extended)
- Greek
- East Asian Characters Includes 13,478 'han' characters, Japanese Hiragana and Katakana (172 characters), Korean Hangul (2,028 characters), East Asian Punctuation Marks (25 characters), 'Component Input Method' Characters (35 characters)
-
Unicode And Marc
There is a new e-mail discussion list, UNICODE-MARC. There don't seem to be any posts yet.The Library of Congress has set up a special listserv for the MARC 21 user community to discuss and arrive at consensus on various issues concerning the implementation...
-
Marc & Unicode
Changes to the MARC-8 character set and Unicode mappings have been made affecting these characters.CJK characters from the EACC repertoireEszett, Euro signGreek symbolsAlif...
-
Marc
The following documents are available for review by the MARC 21
community:Proposal 2004-07: Applying Field 752 (Added Entry - Hierarchical Place Name) for Different Purposes in the MARC 21 Bibliographic FormatProposal 2004-08: Changing the MARC-8 to...
-
Marc Character Set
After many years and several thwarted attempts at getting the American National Standard for Extended Latin (ANSI/NISO Z39.47) character set assigned an escape sequence for use according to ISO 2022 in MARC 21 records, the international Registration Authority...
-
Web Standards
Library Techlog pointed out this useful tool for standards compliant Web pages. The Demoroniser cleans up MS-only codes.Western language HTML documents are written in the ISO 8859-1 Latin-1 character set, with a specified set of escapes for special characters....
Library Cataloging