Latin-1 Supplement - Misplaced Pages

(Redirected from C1 Controls and Latin-1 Supplement) Not to be confused with ISO/IEC 8859-1. For a list of all Latin characters encoded in Unicode, see Latin script in Unicode. See also: Basic Latin (Unicode block) Unicode character block

Latin-1 Supplement or C1 Controls and Latin-1 Supplement
Range	U+0080..U+00FF (128 code points)
Plane	BMP
Scripts	Latin (64 char.) Common (64 char.)
Major alphabets	French German Icelandic Portuguese Spanish
Symbol sets	Punctuation Mathematics Currency
Assigned	128 code points 33 Control or Format
Unused	0 reserved code points
Source standards	ISO/IEC 8859-1
Unicode version history

1.0.0 (1991)	128 (+128)

Unicode documentation
Code chart ∣ Web page
Note:

The Latin-1 Supplement (also called C1 Controls and Latin-1 Supplement) is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

The C1 Controls and Latin-1 Supplement block has been included in its present form, with the same character repertoire since version 1.0 of the Unicode Standard. Its block name in Unicode 1.0 was simply Latin1.

Character table

Code	Result	Description	Acronym
C1 Controls
U+0080		Padding Character	PAD
U+0081		High Octet Preset	HOP
U+0082		Break Permitted Here	BPH
U+0083		No Break Here	NBH
U+0084		Index	IND
U+0085		Next Line	NEL
U+0086		Start of Selected Area	SSA
U+0087		End of Selected Area	ESA
U+0088		Character (Horizontal) Tabulation Set	HTS
U+0089		Character (Horizontal) Tabulation with Justification	HTJ
U+008A		Line (Vertical) Tabulation Set	LTS
U+008B		Partial Line Forward (Down)	PLD
U+008C		Partial Line Backward (Up)	PLU
U+008D		Reverse Line Feed (Index)	RI
U+008E		Single-Shift Two	SS2
U+008F		Single-Shift Three	SS3
U+0090		Device Control String	DCS
U+0091		Private Use One	PU1
U+0092		Private Use Two	PU2
U+0093		Set Transmit State	STS
U+0094		Cancel Character	CCH
U+0095		Message Waiting	MW
U+0096		Start of Protected Area	SPA
U+0097		End of Protected Area	EPA
U+0098		Start of String	SOS
U+0099		Single Graphic Character Introducer	SGCI
U+009A		Single Character Introducer	SCI
U+009B		Control Sequence Introducer	CSI
U+009C		String Terminator	ST
U+009D		Operating System Command	OSC
U+009E		Private Message	PM
U+009F		Application Program Command	APC
Latin-1 Punctuation and Symbols
U+00A0		Non-breaking space	NBSP
U+00A1	¡	Inverted exclamation mark
U+00A2	¢	Cent sign
U+00A3	£	Pound sign
U+00A4	¤	Currency sign
U+00A5	¥	Yen sign
U+00A6	¦	Broken bar
U+00A7	§	Section sign
U+00A8	¨	Diaeresis
U+00A9	©	Copyright sign
U+00AA	ª	Feminine ordinal indicator
U+00AB	«	Left-pointing double angle quotation mark
U+00AC	¬	Not sign
U+00AD		Soft hyphen	SHY
U+00AE	®	Registered sign
U+00AF	¯	Macron
U+00B0	°	Degree symbol
U+00B1	±	Plus-minus sign
U+00B2	²	Superscript two
U+00B3	³	Superscript three
U+00B4	´	Acute accent
U+00B5	µ	Micro sign
U+00B6	¶	Pilcrow sign
U+00B7	·	Middle dot
U+00B8	¸	Cedilla
U+00B9	¹	Superscript one
U+00BA	º	Masculine ordinal indicator
U+00BB	»	Right-pointing double angle quotation mark
U+00BC	¼	Vulgar fraction one quarter
U+00BD	½	Vulgar fraction one half
U+00BE	¾	Vulgar fraction three quarters
U+00BF	¿	Inverted question mark
Letters
U+00C0	À	Latin Capital Letter A with grave
U+00C1	Á	Latin Capital letter A with acute
U+00C2	Â	Latin Capital letter A with circumflex
U+00C3	Ã	Latin Capital letter A with tilde
U+00C4	Ä	Latin Capital letter A with diaeresis
U+00C5	Å	Latin Capital letter A with ring above
U+00C6	Æ	Latin Capital letter AE
U+00C7	Ç	Latin Capital letter C with cedilla
U+00C8	È	Latin Capital letter E with grave
U+00C9	É	Latin Capital letter E with acute
U+00CA	Ê	Latin Capital letter E with circumflex
U+00CB	Ë	Latin Capital letter E with diaeresis
U+00CC	Ì	Latin Capital letter I with grave
U+00CD	Í	Latin Capital letter I with acute
U+00CE	Î	Latin Capital letter I with circumflex
U+00CF	Ï	Latin Capital letter I with diaeresis
U+00D0	Ð	Latin Capital letter Eth
U+00D1	Ñ	Latin Capital letter N with tilde
U+00D2	Ò	Latin Capital letter O with grave
U+00D3	Ó	Latin Capital letter O with acute
U+00D4	Ô	Latin Capital letter O with circumflex
U+00D5	Õ	Latin Capital letter O with tilde
U+00D6	Ö	Latin Capital letter O with diaeresis
Mathematical operator
U+00D7	×	Multiplication sign
Letters
U+00D8	Ø	Latin Capital letter O with stroke
U+00D9	Ù	Latin Capital letter U with grave
U+00DA	Ú	Latin Capital letter U with acute
U+00DB	Û	Latin Capital Letter U with circumflex
U+00DC	Ü	Latin Capital Letter U with diaeresis
U+00DD	Ý	Latin Capital Letter Y with acute
U+00DE	Þ	Latin Capital Letter Thorn
U+00DF	ß	Latin Small Letter sharp S
U+00E0	à	Latin Small Letter A with grave
U+00E1	á	Latin Small Letter A with acute
U+00E2	â	Latin Small Letter A with circumflex
U+00E3	ã	Latin Small Letter A with tilde
U+00E4	ä	Latin Small Letter A with diaeresis
U+00E5	å	Latin Small Letter A with ring above
U+00E6	æ	Latin Small Letter AE
U+00E7	ç	Latin Small Letter C with cedilla
U+00E8	è	Latin Small Letter E with grave
U+00E9	é	Latin Small Letter E with acute
U+00EA	ê	Latin Small Letter E with circumflex
U+00EB	ë	Latin Small Letter E with diaeresis
U+00EC	ì	Latin Small Letter I with grave
U+00ED	í	Latin Small Letter I with acute
U+00EE	î	Latin Small Letter I with circumflex
U+00EF	ï	Latin Small Letter I with diaeresis
U+00F0	ð	Latin Small Letter Eth
U+00F1	ñ	Latin Small Letter N with tilde
U+00F2	ò	Latin Small Letter O with grave
U+00F3	ó	Latin Small Letter O with acute
U+00F4	ô	Latin Small Letter O with circumflex
U+00F5	õ	Latin Small Letter O with tilde
U+00F6	ö	Latin Small Letter O with diaeresis
Mathematical operator
U+00F7	÷	Division sign
Letters
U+00F8	ø	Latin Small Letter O with stroke
U+00F9	ù	Latin Small Letter U with grave
U+00FA	ú	Latin Small Letter U with acute
U+00FB	û	Latin Small Letter U with circumflex
U+00FC	ü	Latin Small Letter U with diaeresis
U+00FD	ý	Latin Small Letter Y with acute
U+00FE	þ	Latin Small Letter Thorn
U+00FF	ÿ	Latin Small Letter Y with diaeresis

Subheadings

The C1 Controls and Latin-1 Supplement block has four subheadings within its character collection: C1 controls, Latin-1 Punctuation and Symbols, Letters, and Mathematical operator(s).

C1 controls

The C1 controls subheading contains 32 supplementary control codes inherited from ISO/IEC 8859-1 and many other 8-bit character standards. The alias names for the C0 and C1 control codes are taken from ISO/IEC 6429:1992.

Latin-1 punctuation and symbols

The Latin-1 Punctuation and Symbols subheading contains 32 characters of common international punctuation characters, such as the inverted question and exclamation marks, a middle dot, and symbols such as currency signs, spacing diacritic marks, vulgar fractions, and superscript numbers.

Letters

The Letters subheading contains 30 pairs of majuscule and minuscule accented or novel Latin characters for western European languages, and two extra minuscule characters (ß and ÿ) not commonly used as the first letter of words.

Mathematical operator

The Mathematical operator subheading is used for the multiplication and division signs.

Number of symbols, letters and control codes

The table below shows the number of letters, symbols and control codes in each of the subheadings in the C1 Controls and Latin-1 Supplement block.

Type of subheading	Number of symbols	Range of characters
C1 controls	32 control codes	U+0080 to U+009F
Latin-1 punctuation and symbols	32 punctuation and symbols	U+00A0 to U+00BF
Letters	30 pairs of majuscule and minuscule accented Latin characters	U+00C0 to U+00D6, U+00D8 to U+00F6 and U+00F8 to U+00FF
Mathematical operators	The U+00D7 × MULTIPLICATION SIGN and U+00F7 ÷ DIVISION SIGN symbols.	U+00D7 and U+00F7

Compact table

C1 Controls and Latin-1 Supplement
Official Unicode Consortium code chart (PDF)

U+008x

XXX

BPH

NBH

IND

NEL

SSA

ESA

HTS

HTJ

VTS

PLD

PLU

SS2

SS3

U+009x

DCS

PU1

PU2

STS

CCH

SPA

EPA

SOS

XXX

SCI

CSI

OSC

APC

U+00Ax

NBSP

SHY

U+00Bx

U+00Cx

U+00Dx

U+00Ex

U+00Fx

1. As of Unicode version 16.0

Emoji

The Latin-1 Supplement block contains two emoji: U+00A9 and U+00AE.

The block has four standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the two emoji, both of which default to a text presentation.

Emoji variation sequences
U+	00A9	00AE
base code point	©	®
base+VS15 (text)	©︎	®︎
base+VS16 (emoji)	©️	®️

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Latin-1 Supplement block:

Version	Final code points	Count	L2 ID	WG2 ID	Document
1.0.0	U+0080..009F	32	X3L2/95-002		PDAM No. 3 to ISO/IEC 10646-1 on coding of C1 controls, 1994-11-01
			X3L2/95-028	N1148	Nine tables of replies to repeated/extended votes, 1995-02-22
				N1203	Umamaheswaran, V. S.; Ksar, Mike (1995-05-03), "5.3", Unconfirmed minutes of SC2/WG2 Meeting 27, Geneva
			X3L2/95-061		DAM no.3 to ISO/IEC 10646-1 (Coding of C1 controls), 1995-06-01
				N1307	Table of replies to JTC1 letter ballot on 10646 DAM 3, Coding of C1 Controls, (SC2 N 2666), 1996-01-15
				N1309	Paterson, Bruce (1996-01-17), Report and Disposition of Comments on DAM 1, UTF 16 and DAM 2, UTF-8, DAM 3, Coding of C1 Controls, and DAM 4, Removal of Annex G: UTF1
				N1312	Paterson, Bruce (1996-01-17), Draft Final Text of 10646 AMD-3, Coding of C1 Controls
			L2/99-048		Umamaheswaran, V. S. (1999-02-04), C1 controls in the code charts
			L2/99-054R		Aliprand, Joan (1999-06-21), "C1 Controls", Approved Minutes from the UTC/L2 meeting in Palo Alto, February 3-5, 1999
				N3046	Suignard, Michel (2006-02-22), Improving formal definition for control characters
				N3103 (pdf, doc)	Umamaheswaran, V. S. (2006-08-25), "M48.33", Unconfirmed minutes of WG 2 meeting 48, Mountain View, CA, USA; 2006-04-24/27
	U+00A0..00FF	96			(to be determined)
			X3L2/94-077	N994	Davis, Mark (1994-03-03), ISO/IEC 10646-1 - Proposed Draft Corrigendum 1
			X3L2/94-098	N1033 (pdf, doc)	Umamaheswaran, V. S.; Ksar, Mike (1994-06-01), "8.1.15", Unconfirmed Minutes of ISO/IEC JTC 1/SC 2/WG 2 Meeting 25, Falez Hotel, Antalya, Turkey, 1994-04-18--22
			L2/11-016		Moore, Lisa (2011-02-15), "Correct mistakes in property assignments for super and subscripted letters (B.13.4) ", UTC #126 / L2 #223 Minutes
			L2/11-116		Moore, Lisa (2011-05-17), "Consensus 127-C14", UTC #127 / L2 #224 Minutes, Change the general category of to U+00AA FEMININE ORDINAL INDICATOR and U+00BA MASCULINE ORDINAL INDICATOR "Lo" for Unicode 6.1.
			L2/11-261R2		Moore, Lisa (2011-08-16), "Consensus 128-C6", UTC #128 / L2 #225 Minutes, Change the general category from "So" to "Po" ...
			L2/15-050R		Davis, Mark; et al. (2015-01-29), Additional variation selectors for emoji
Proposed code points and characters names may differ from final code points and names See also L2/13-207, L2/14-054, L2/14-063, L2/15-051A, L2/15-051B Refer to the history section of the Miscellaneous Symbols and Pictographs block for additional emoji-related documents

References

"Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
"Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
The Unicode Standard Version 1.0, Volume 1. Addison-Wesley Publishing Company, Inc. 1991 . ISBN 0-201-56788-1.
"3.8: Block-by-Block Charts" (PDF). The Unicode Standard. version 1.0. Unicode Consortium.
^ "Unicode 6.2 code charts" (PDF). The Unicode Standard. Retrieved 1 April 2013.
"UTR #51: Unicode Emoji". Unicode Consortium. 2023-09-05.
"UCD: Emoji Data for UTR #51". Unicode Consortium. 2023-02-01.
"UTS #51 Emoji Variation Sequences". The Unicode Consortium.

Unicode

Code points

Characters

Special purpose	BOM Combining grapheme joiner Left-to-right mark / Right-to-left mark Soft hyphen Variant form Word joiner Zero-width joiner Zero-width non-joiner Zero-width space
Lists	Characters CJK Unified Ideographs Combining character Duplicate characters Numerals Scripts Spaces Symbols Halfwidth and fullwidth Alias names and abbreviations Whitespace characters

Processing

Algorithms	Bidirectional text Collation ISO/IEC 14651 Equivalence Variation sequences International Ideographs Core
Comparison of encodings	BOCU-1 CESU-8 Punycode SCSU UTF-1 UTF-7 UTF-8 UTF-16/UCS-2 UTF-32/UCS-4 UTF-EBCDIC

On pairs of
code points

Usage

Related standards