Encoding Information
ALA-LC (Library of Congress) Romanization with Diacritics
Example: Russkīĭ i͡azyk
This encoding implements the Library of Congress's
transliteration system [pdf] for vernacular
Russian. The diacritical marks are rendered with Unicode. Note that they will
usually not show up in a browser, but should appear when the text is
copied and pasted to a full-featured word processor.
ALA-LC (Library of Congress) Romanization without Diacritics
Example: Russkii iazyk
This output-only encoding converts Cyrillic to a readable form of Library of
Congress transliteration that can be used on any ASCII-compatible system.
CP1251 (Windows)
Example: Ðóññêèé ÿçûê
This encoding is also known as Win1251 and is used in many Microsoft products.
The encoding dedicates the upper 128 characters of the ASCII set to Cyrillic,
leaving the lower 128 for Latin letters. This allows both alphabets to be mixed
within the same font.
ISO 9 Romanization
Example: Russkij âzyk
ISO 9 is an international standard for transliterating Cyrillic which has, in
contrast to the Library of Congress system, a one-to-one Latin to Cyrillic
character mapping (so a Cyrillic text that is 100 characters long will be
100 characters long when converted to ISO 9). Some ISO 9 diacritics may
not show up in a browser, but should appear when pasted to a word processor.
More information on ISO 9 is avialable on
Wikipedia.
KOI8
Example: òÕÓÓËÉÊ ÑÚÙË
This encoding refers to the KOI8-ru variant of the KOI8 family, which is used
on many Russian Web pages. Like CP1251, it dedicates half of the ASCII set
to Cyrillic, but with a different letter arrangement. Some fonts are available
through
AATSEEL.