Misplaced Pages

ISO 15924: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 04:13, 9 November 2006 editTobias Conradi (talk | contribs)37,615 edits Script is defined as "set of graphic characters used for the written form of one or more languages". So scripts that are not (yet) "used for the written form of one language" are explicitly excluded.← Previous edit Revision as of 10:20, 9 November 2006 edit undoEvertype (talk | contribs)Autopatrolled, Extended confirmed users, File movers13,006 edits Deleting material which does not seem appropriate. "Claims"?Next edit →
(One intermediate revision by the same user not shown)
Line 7: Line 7:
ISO 15924 could be used, similar to ], in addition to ], which defines codes for natural languages, for instance in markup languages like ] and ] to help ]s determine which ] to use for foreign text. This way one could differentiate, for example, between ] written in the Cyrillic (srp-Cyrl) or Latin (srp-Latn) script, or mark ] text as such. ISO 15924 could be used, similar to ], in addition to ], which defines codes for natural languages, for instance in markup languages like ] and ] to help ]s determine which ] to use for foreign text. This way one could differentiate, for example, between ] written in the Cyrillic (srp-Cyrl) or Latin (srp-Latn) script, or mark ] text as such.


ISO has appointed the ] as the Registration Authority for the standard. The Registration Authority page in turn claims ] of Evertype has been appointed Registrar by the Registration Authority. ISO has appointed the ] as the Registration Authority for the standard.

The role of the Registration Authority is defined in the standard, but the existence of a Registrar is not even mentioned.


==Script codes== ==Script codes==
Line 30: Line 28:


Qaaa - Qabx , 50 codes for private use. The Qaaa - Qabx - range is not mentioned in the standard itself, but in the codelist. There is no statement in the standard that prohibits this range to be changed by the RA. Qaaa - Qabx , 50 codes for private use. The Qaaa - Qabx - range is not mentioned in the standard itself, but in the codelist. There is no statement in the standard that prohibits this range to be changed by the RA.

==Scripts without code==
There are also scripts that have no code. Some of them may get codes in the future. If in need of codes the private range 900-949 (Qaaa - Qabx), offers space, but only allows coding 50 scripts. The downside is that then one has e.g. alphabetic writing system not in the alphabet range.

''in brackets, the range the script falls in is indicated''

* ]
* ]
* ] (200-299)
* ]
* ] (400-499)
* ] (400-499)
* ]
* ]
* ] (400-499)
* ] (500-599)
* ]
* ]
* ] (200-299, )
* ] (600-699)
* ]
* ] (200-299)
* ]
* ] (000-099)
* ] (400-499)
* ] (200-299)
* ]
* ]
* ]
* ]
* ]
* ]
* ]
* ]
* ] (200-299)
* ]


see also: ]


==See also== ==See also==

Revision as of 10:20, 9 November 2006

ISO 15924, Codes for the representation of names of scripts, defines two sets of codes for a number of writing systems (scripts). Each script is given both a four-letter code and a numeric one.

Script is defined as "set of graphic characters used for the written form of one or more languages". So scripts that are not (yet) "used for the written form of one language" are explicitly excluded.

Where possible the codes are derived from ISO 639-2 where the name of a script and the name of a language using the script are identical (example: Gujarātī ISO 639 guj, ISO 15924 Gujr). Preference is given to the 639-2 B-codes, which is different to the otherwise often favored use of the T codes.

ISO 15924 could be used, similar to ISO 3166, in addition to ISO 639, which defines codes for natural languages, for instance in markup languages like HTML and XML to help Web browsers determine which typeface to use for foreign text. This way one could differentiate, for example, between Serbian written in the Cyrillic (srp-Cyrl) or Latin (srp-Latn) script, or mark romanized text as such.

ISO has appointed the Unicode Consortium as the Registration Authority for the standard.

Script codes

Numeric ranges

000-099 Hieroglyphic and cuneiform scripts
100-199 Right-to-left alphabetic scripts
200-299 Left-to-right alphabetic scripts
300-399 Alphasyllabic scripts
400-499 Syllabic scripts
500-599 Ideographic scripts
600-699 Undeciphered scripts
700-799 (unassigned)
800-899 (unassigned)
900-999 Private use, aliases, special codes

Special codes

Zxxx || 997 || Code for unwritten languages 
Zyyy || 998 || Code for undetermined script 
Zzzz || 999 || Code for uncoded script

Qaaa - Qabx , 50 codes for private use. The Qaaa - Qabx - range is not mentioned in the standard itself, but in the codelist. There is no statement in the standard that prohibits this range to be changed by the RA.

See also

External links

Categories: