ISO 15924: Difference between revisions

Browse history interactively ← Previous edit Next edit →Content deleted Content addedVisual WikitextInline

Revision as of 15:36, 11 November 2006 editEvertype (talk \| contribs)Autopatrolled, Extended confirmed users, File movers13,006 edits Deleting sentence which is incorrect as it stands. Greek letters are "Grek", even if they are used in the IPA orthography which is "Latn" apart from these letters.← Previous edit		Revision as of 15:37, 11 November 2006 edit undoEvertype (talk \| contribs)Autopatrolled, Extended confirmed users, File movers13,006 edits Deleting POV analysis. "Downside" is an opinion.Next edit →
Line 33:		Line 33:
	Strictly speaking every writing system has a code, if no other can be applied it is: Zzzz/999 - Code for uncoded script.		Strictly speaking every writing system has a code, if no other can be applied it is: Zzzz/999 - Code for uncoded script.

	Some of these scripts may get codes in the future. If in need of codes the private range 900-949 (Qaaa - Qabx), offers space, but only allows coding 50 scripts. The downside is that if someone wants to use the numeric ranges to determine the group a script belongs to, this is not possible anymore. E.g. one can have an alphabetic writing system privately assigned, which then not falls in the alphabet range. There are no private use areas in the group ranges.		Some of these scripts may get codes in the future. If in need of codes the private range 900-949 (Qaaa - Qabx), offers space, but only allows coding 50 scripts. There are no private use areas in the group ranges.

	==See also==		==See also==

Revision as of 15:37, 11 November 2006

ISO 15924, Codes for the representation of names of scripts, defines two sets of codes for a number of writing systems (scripts). Each script is given both a four-letter code and a numeric one.

Script is defined as "set of graphic characters used for the written form of one or more languages". So scripts that are not (yet) "used for the written form of one language" are explicitly excluded.

Where possible the codes are derived from ISO 639-2 where the name of a script and the name of a language using the script are identical (example: Gujarātī ISO 639 guj, ISO 15924 Gujr). Preference is given to the 639-2 Bibliographical codes, which is different to the otherwise often favored use of the Terminological codes.

ISO 15924 could be used, similarly to ISO 3166, in addition to ISO 639, which defines codes for natural languages, for instance in markup languages like HTML and XML to help Web browsers determine which typeface to use for foreign text. This way one could differentiate, for example, between Serbian written in the Cyrillic (srp-Cyrl) or Latin (srp-Latn) script, or mark romanized text as such.

Maintenance

ISO has appointed the Unicode Consortium as the Registration Authority for the standard. In 2004, the Registration Authority appointed Michael Everson to act as Registrar.

Script codes

Numeric ranges

000-099 Hieroglyphic and cuneiform scripts
100-199 Right-to-left alphabetic scripts
200-299 Left-to-right alphabetic scripts
300-399 Alphasyllabic scripts
400-499 Syllabic scripts
500-599 Ideographic scripts
600-699 Undeciphered scripts
700-799 (unassigned)
800-899 (unassigned)
900-999 Private use, aliases, special codes

Special codes

Zxxx || 997 || Code for unwritten languages 
Zyyy || 998 || Code for undetermined script 
Zzzz || 999 || Code for uncoded script

Qaaa-Qabx are 50 codes for private use. The Qaaa-Qabx range is not mentioned in the standard itself, but in the codelist.

Scripts without code

Strictly speaking every writing system has a code, if no other can be applied it is: Zzzz/999 - Code for uncoded script.

Some of these scripts may get codes in the future. If in need of codes the private range 900-949 (Qaaa - Qabx), offers space, but only allows coding 50 scripts. There are no private use areas in the group ranges.

References

Unicode - ISO 15924 Registration Authority

External links

Template:ISO 15924/footer

Categories: