Library Cataloging
MARC Character Set
After many years and several thwarted attempts at getting the American National Standard for Extended Latin (ANSI/NISO Z39.47) character set assigned an escape sequence for use according to ISO 2022 in MARC 21 records, the international Registration Authority has finally made an assignment which allows users to designate and invoke the ANSEL extended Latin set as a graphic character set. For those of you who understand the workings of character sets in MARC records, you will know that this is good news.
The hexadecimal value assigned to be used as the final character in an ISO 2022 escape sequences to designate and invoke ANSEL is:
hex 45 [represented graphically by the Latin letter capital E]
For many years the MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media have indicated that the final character for escape sequences for ANSEL was not assigned. The first registration request forwarded by NISO to the Registration Authority was delayed by other American bodies who objected to the inclusion of mappings of ANSEL characters to ISO/IEC 10646 characters and the call to revise the registration process itself. The most recent submission of ANSEL for assignment of an escape sequence made its way through the registration process and the final byte of the escape sequence was registered on September 26, 2002.
The Network Development and MARC Standards Office would like to thank the people at NISO, RLG, and Everson Typography who helped get this assignment through the registration process. Although a small detail in the larger context of MARC character encodings, it was a void that we have long wanted to fill.
Information about this new escape sequence will be added to the online version of the MARC 21 Specifications as well as the next printed edition of that document.
From the MARC mail list.
-
Unicode And Marc
News from LC.The revised Character set specifications are now posted on the MARC site. They take into account the use of the full Unicode repertoire, as opposed to only the MARC-8 subset of Unicode, and also include the loss-less and lossy techniques...
-
Unicode And Marc
There is a new e-mail discussion list, UNICODE-MARC. There don't seem to be any posts yet.The Library of Congress has set up a special listserv for the MARC 21 user community to discuss and arrive at consensus on various issues concerning the implementation...
-
Sorting
Guidelines for the Non-Sorting Control Character Technique has been posted by the Network Development and MARC Standards Office Library of Congress.With Proposal No 98-16R (Nonfiling characters in all MARC formats), the MARC 21 community approved the...
-
Marc
MARC::Charset is a package that allows you to easily convert between the MARC-8 character encodings and Unicode (UTF-8). The Library of Congress maintains some essential mapping tables and information about the MARC-8 and Unicode environments. MARC::Charset...
-
Web Standards
Library Techlog pointed out this useful tool for standards compliant Web pages. The Demoroniser cleans up MS-only codes.Western language HTML documents are written in the ISO 8859-1 Latin-1 character set, with a specified set of escapes for special characters....
Library Cataloging