SBCS, or single-byte character set, is used to refer to character encodings that use exactly one byte for each graphic character. An SBCS can accommodate a maximum of 256 symbols, and is useful for scripts that do not have many symbols or accented letters such as the Latin, Greek and Cyrillic scripts used mainly for European languages. Examples of SBCS encodings include ISO/IEC 646, the various ISO 8859 encodings, and the various Microsoft/IBM code pages.
The term SBCS is commonly contrasted against the terms DBCS (double-byte character set) and TBCS (triple-byte character set), as well as MBCS (multi-byte character set). The multi-byte character sets are used to accommodate languages with scripts that have large numbers of characters and symbols, predominantly Asian languages such as Chinese, Japanese, and Korean. These are sometimes referred to by the acronym CJK. In these computing systems, SBCSs are traditionally associated with half-width characters, so-called because such SBCS characters would traditionally occupy half the width of a DBCS character on a fixed-width computer terminal or text screen.
In modern times
Though single-byte character sets have largely been supplanted by UTF-8 and its variants on modern systems, they have found a niche in code golfing, where the smaller byte size of characters allows participants to gain an edge if they use SBCSs with specially-designed programming languages such as Vyxal and GolfScript.
See also
References
- "Single-byte Character Sets". Microsoft. Retrieved 2017-04-17.
- "SBCS, DBCS, and MBCS data conversion considerations". IBM. 14 September 2015. Retrieved 2017-04-17.
- "Usage statistics of UTF-8 for websites". w3techs.com. Retrieved 2024-09-09.
- "Vyxal language README". Github. 4 September 2024. Retrieved 2024-09-09.
Character encodings | |
---|---|
Early telecommunications | |
ISO/IEC 8859 |
|
Bibliographic use | |
National standards | |
ISO/IEC 2022 | |
Mac OS Code pages ("scripts") | |
DOS code pages | |
IBM AIX code pages | |
Windows code pages | |
EBCDIC code pages | |
DEC terminals (VTx) | |
Platform specific |
|
Unicode / ISO/IEC 10646 | |
TeX typesetting system | |
Miscellaneous code pages | |
Control character | |
Related topics | |
Character sets |
This character encoding article is a stub. You can help Misplaced Pages by expanding it. |
This writing system–related article is a stub. You can help Misplaced Pages by expanding it. |