|
 |
|
ISO 639 and IETF 1766 Standardised Language Codes
A listing of abbreviation codes to represent natural language names, as standardised by ISO (ISO 639) and IETF (RFC 1766) and registered with IANA. With alphabetical tags to enable fast search for a particular language or language code. Maintained by Jacob Palme. [eng]
|
 |
|
UNIMARC: Language Codes
[]
|
 |
|
Coverage of European languages by ISO Latin alphabets
A reference resource compiled by Jukka Korpela. [eng]
|
 |
|
Language Code Lists
[]
|
 |
|
Internationalization / Localization
Links to both onsite and offsite resources from the W3C. [eng]
|
 |
|
Reminder about 4 medieval English Latin characters
An article by Michael Everson, discussing the encoding standards for "hwair", "wynn", "ezh" and "yogh". [eng]
|
 |
|
On the status of the Latin letter þorn and of its sorting order
An article (by Michael Everson and Baldur Sigurðsson) reviewing the history of alphabet from Phoenician times to argue that "thorn" should be counted as a separate independent letter (rather than a derived one) and sorted accordingly. [eng]
|
 |
|
A tutorial on character code issues
"This document tries to clarify the concepts of
character repertoire, character code, and
character encoding especially in the Internet
context... ASCII, ISO 646,
ISO 8859 (ISO Latin, especially ISO Latin 1),
Windows character set, ISO 10646 (UCS),
Unicode, UTF-8, UTF-7, MIME, and QP are
used as examples." Part of the IT
and communication free
information site. [eng]
|
 |
|
HTML authoring in different languages - a link list
Links colected by Jukka Korpela. [eng]
|
 |
|
Techniques for multilingual Web sites
What does a multilingual Web site mean. Linking to versions in different languages. Language negotiation mechanism: language negotiation as a protocol issue, language negotiation and settings in a server (with Apache as example), language settings in browsers. [eng]
|
 |
|
Sample Pages for Various Character Sets
A demonstration of the multilingual capability of Web pages, including content negotiation and use of charset in MIME types. [eng]
|
 |
|
W3C Internationalization and Localization
World-Wide Character Sets, Languages, and Writing Systems [eng]
|
 |
|
Internationalization/localization: HTTP
HTTP protocol issues related to use of various national languages. Part of W3C site. [eng]
|
 |
|
HTML Document Representation
The Document Character Set. Character encodings. Choosing an encoding, and notes on specific encodings. Specifying the character encoding. Character references: numeric, character entity references. Undisplayable characters. A chapter in W3C Recommendation of 24 December 1999. [eng]
|
 |
|
Language information and text direction
The W3C Recommendation of 24 December 1999 (HTML 4.01 Specification) regarding two important
issues that affect the internationalization of HTML:
specifying the language (the lang attribute) and direction
(the dir attribute) of text in a document. [eng]
|
 |
|
ISO 8859 Alphabet Soup
A commented graphical overview of the ISO 8859 character sets. [eng]
|
 |
|