Misplaced Pages

Arabic Presentation Forms-B

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
See also: Arabic Presentation Forms-A (Unicode block), Alphabetic Presentation Forms (Unicode block), and Specials (Unicode block) For a list of all Arabic characters encoded in Unicode, see Arabic script in Unicode. Unicode character block
Arabic Presentation Forms-B
RangeU+FE70..U+FEFF
(144 code points)
PlaneBMP
ScriptsArabic (140 char.)
Common (1 char.)
Symbol setscontextual and isolate forms of Arabic letters and points
Assigned141 code points
Unused3 reserved code points
Unicode version history
1.0.0 (1991)140 (+140)
3.2 (2002)141 (+1)
Unicode documentation
Code chart ∣ Web page
Note:
The characters in this block were re-ordered in Unicode 1.0.1, in the process of merging with ISO/IEC 10646.

Arabic Presentation Forms-B is a Unicode block encoding spacing forms of Arabic diacritics, and contextual letter forms. The special codepoint ZWNBSP (zero width no-break space) is also here, which is only meant for a byte order mark (that may precede text, Arabic or not, or be absent). The block name in Unicode 1.0 was Basic Glyphs for Arabic Language; its characters were re-ordered in the process of merging with ISO 10646 in Unicode 1.0.1 and 1.1.

The presentation forms are present only for compatibility with older standards, and are not currently needed for coding text.

Block

Arabic Presentation Forms-B
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FE7x ﹿ
U+FE8x
U+FE9x
U+FEAx
U+FEBx ﺿ
U+FECx
U+FEDx
U+FEEx
U+FEFx ZW
NBSP
Notes
1. As of Unicode version 16.0
2. Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Arabic Presentation Forms-B block:

Version Final code points Count UTC ID L2 ID WG2 ID Document
1.0.0 U+FE70..FE72, FE74, FE76..FE7F 14 UTC/1991-048B Whistler, Ken (1991-03-27), "14 addtional [sic] Arabic spacing diacritics", Draft Minutes from the UTC meeting #46 day 2, 3/27 at Apple
U+FE80..FEFC 125 (to be determined)
U+FEFF 1 UTC/1991-054 Whistler, Ken, FF Proposal
UTC/1991-048B Whistler, Ken (1991-03-27), "III.I.4", Draft Minutes from the UTC meeting #46 day 2, 3/27 at Apple
L2/05-137 Freytag, Asmus (2005-05-10), Handling "defective" names
L2/05-108R Moore, Lisa (2005-08-26), "Consensus 103-C7", UTC #103 Minutes, Create a "Normative Name Alias" property and file in the UCD. Populate the property with names from the sections "Typos" and "Bad or misleading names" from document L2/05-137.
3.2 U+FE73 1 L2/01-069 Davis, Mark (2001-01-29), Proposal Summary Form for Arabic character tail for final Seen family (Seen, Sheen, Saad, Daad)
L2/01-095 N2322 Umamaheswaran, V. S. (2001-02-05), Proposal to add "Arabic Tail Fragment" character
L2/01-012R Moore, Lisa (2001-05-21), "Motion 86-M30", Minutes UTC #86 in Mountain View, Jan 2001, Accept the addition of the character ARABIC TAIL FRAGMENT at U+FE73, with the properties of an extender.
L2/01-344 N2353 (pdf, doc) Umamaheswaran, V. S. (2001-09-09), "7.9", Minutes from SC2/WG2 meeting #40 -- Mountain View, April 2001
  1. Proposed code points and characters names may differ from final code points and names

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. ^ "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992-11-03. Retrieved 2016-07-09.
  4. "Layout Controls" (PDF). The Unicode Standard, Version 12.0.0. The Unicode Consortium. p. 871.
  5. "3.8: Block-by-Block Charts" (PDF). The Unicode Standard. version 1.0. Unicode Consortium.
  6. The Unicode Consortium. The Unicode Standard, Version 6.0.0, (Mountain View, CA: The Unicode Consortium, 2011. ISBN 978-1-936213-01-6), Chapter 8

Notes

  1. As the name suggests, it was also used to prohibit line breaks at its position, but this usage was deprecated in Unicode 3.2.
Category: