Romanization of Arabic: Difference between revisions

Browse history interactively ← Previous edit Next edit →Content deleted Content addedVisual WikitextInline

Revision as of 05:22, 2 November 2007 editDrmaik (talk \| contribs)2,990 edits Revert to revision 168434204 dated 2007-11-01 02:24:21 by Mr.Slade using popups ← Previous edit		Revision as of 08:13, 23 November 2007 edit undoFiet Nam (talk \| contribs)920 edits →Comparison table: - Arabic doesn't have L in it's alphabet.Next edit →
Line 64:		Line 64:
	\| '		\| '
	\| {{unicode\|ʾ}}<!--br><small>(''zero word-initially'')</small-->		\| {{unicode\|ʾ}}<!--br><small>(''zero word-initially'')</small-->
	\| ' <!--small>(''disappears after 'al-' and where alif wa{{unicode\|ṣ}}l is.''</small-->		\| ' <!--small>(''disappears after 'al-' and where alif wa{{unicode\|ṣ}}r is.''</small-->
	\| {{IPA\|/ʔ/}}		\| {{IPA\|/ʔ/}}
	\|-		\|-
Line 138:		Line 138:
	\|-		\|-
	! <big>ﺩ</big>		! <big>ﺩ</big>
	\| {{unicode\|~~dāl~~}}		\| {{unicode\|dār}}
	\| D		\| D
	\|colspan="2"\| {{unicode\|d}}		\|colspan="2"\| {{unicode\|d}}
Line 148:		Line 148:
	\|-		\|-
	! <big>ﺫ</big>		! <big>ﺫ</big>
	\| {{unicode\|~~ḏāl~~}}		\| {{unicode\|ḏār}}
	\| Z		\| Z
	\|colspan="2"\| {{unicode\|dh}}		\|colspan="2"\| {{unicode\|dh}}
Line 287:		Line 287:
	\| k		\| k
	\| {{IPA\|/k/}}		\| {{IPA\|/k/}}
	\|-
	! <big>ﻝ</big>
	\| {{unicode\|lām}}
	\| L
	\|colspan="2"\| {{unicode\|l}}
	\|colspan="3"\| {{unicode\|l}}
	\| l
	\| l
	\| l
	\| {{IPA\|/l/}}
	\|-		\|-
	! <big>ﻡ</big>		! <big>ﻡ</big>
Line 377:		Line 367:
	\| à		\| à
	\| {{IPA\|/aː/}}		\| {{IPA\|/aː/}}
	\|-
	! <big>ﻻ</big>
	\| {{unicode\|lām ʼalif}}
	\| LA
	\|colspan="2"\| {{unicode\|lā}}
	\| {{unicode\|lā}} \|\| {{unicode\|laʾ}} \|\| {{unicode\|lā}}
	\| la
	\| {{unicode\|lʾ}}<!--br><small>(''with hamza'')</small><br-->; {{unicode\|lā}}<!--br><small>(''with lengthening alif'')</small-->
	\| <!--small>treated as laam then alif usually:</small--> laa
	\|{{IPA\|/lː/}}
	\|-
	! <big>ال</big>
	\| {{unicode\|ʼalif lām}}
	\| AL
	\|colspan="2"\| {{unicode\|al-}}
	\| {{unicode\|al-}} \|\| {{unicode\|ʾˈal}} \|\| {{unicode\|al-}}
	\| al
	\| al-
	\| al-;<!--small>When assimilation occurs:</small--> ál-
	\| var.
	\|}		\|}

Revision as of 08:13, 23 November 2007

This article may require cleanup to meet Misplaced Pages's quality standards. No cleanup reason has been specified. Please help improve this article if you can. (December 2006) (Learn how and when to remove this message)

Arabic alphabet
Arabic script
History Transliteration Diacritics Hamza Numerals Numeration
v t e

Different approaches and methods for romanizing Arabic exist. They vary in the way that they address the inherent problems of rendering written and spoken Arabic in the Latin alphabet; they also use different symbols for Arabic phonemes that do not exist in English or other European languages.

Romanization Issues

Any transliteration system has to make a number of decisions, dependent on its intended field of application. One basic problem is that written Arabic is normally unvocalized, i.e. many of the vowels are not written out, and must be supplied by a reader familiar with the language. But unvocalized Arabic writing does not give a reader unfamiliar with the language sufficient information for accurate pronunciation. An exact equivalent of e.g. صدام حسين would be Template:ArabDIN, which is meaningless to an untrained reader. The "full transliteration" adds information not in the text, which has to be supplied by a speaker of Arabic, Template:ArabDIN. Usually, newspapers and popular books use not a transliteration, but a transcription: instead of transliterating each written letter they try to reproduce the sound of the words according to the orthography rules of the target language: Saddam Hussein.

Most issues around the romanization of Arabic are about transliterating vs. transcribing – others, about what should be romanized:

transliteration ignores assimilation (sandhi) of the article before the "sun letters," and may be easily misread by non-Arabs. For instance an-nur (or an-nuur, or an-noor) would be more correctly transliterated along the lines of alnur. In the transcription an-nur, a hyphen is added and the unpronounced 'l' removed for the convenience of the uninformed non-Arab reader, who would otherwise pronounce an 'l', probably not understand the word to be nur, pronounce only one 'n', and be confused by the role of the double 'n'. Alternatively, if the shadda is not transliterated (since it is strictly not a letter), a hypercorrect transliteration would be alnur, which presents similar problems for the uninformed non-Arab reader.
a transliteration must render the "tied tā" (ta marbouta ة) faithfully, a transcription must render the sound ("a" like any other "a" or "t" like any other "at" — or in a vocalized text nothing vs. t)
- ISO 233 has a unique symbol, ẗ, ISO/R 233 uses superscript , .
"broken alif" (Template:ArabDIN, ى) must be transliterated with a special symbol, but is transcribed like standing alif, when it stands for a long a (ā)
Nunation: what is true elsewhere is also true for nunation: transliteration renders what you see, transcription what you hear.

A transcription may reflect the language as spoken, for example, by the people of Baghdad, or the official standard as spoken by a preacher in the mosque or a TV news reader. A transcription is free to add phonological (such as vowels) or morphological (such as word boundaries) information. Transcriptions will also vary depending on the writing conventions of the target language; compare English Omar Khayyam with German Omar Chajjam, both for عمر خيام (unvocalized Template:ArabDIN, vocalized Template:ArabDIN).

A transliteration is ideally fully reversible: a machine must be able to translate it into Arabic and back. A transliteration may be criticized as flawed for any of the following reasons:

A "loose" transliteration is ambiguous, rendering several Arabic phonemes with an identical transliteration, or digraphs for a single phoneme (such as sh) may be confused with two adjacent phonemes;
Symbols representing phonemes may be considered too similar (e.g., ` and ' or ʿ and ʾ for ayin and hamza);
ASCII transliterations using capital letters to disambiguate phonemes are easy to type but may be considered unaesthetic.

A fully accurate transcription may not be necessary for native Arabic speakers as they would be able to pronounce names and sentences correctly anyway, but it can be very useful for those not fully familiar with spoken Arabic and who are familiar with the Roman alphabet. An accurate transliteration serves as a valuable stepping stone for learning, pronouncing correctly, and distinguishing phonemes. It is a useful tool for anyone familiar with the sounds of Arabic but who are not fully conversant in the language.

One criticism is that a fully accurate system would require special learning that most do not have to actually pronounce names correctly, and that with a lack of a universal Romanization system they will not be pronounced correctly by non-native speakers anyway. The precision will be lost if special characters are not replicated and if someone is not familiar with Arabic pronunciation.

Transliteration standards

Deutsche Morgenländische Gesellschaft (1936): Adopted by the International Convention of Orientalist Scholars in Rome. It is the basis for the very influential Hans Wehr dictionary (ISBN 0-87950-003-4).
ISO/R 233 (1961). Replaced by ISO 233 in 1984 but still encountered.
BS 4280 (1968): Developed by the British Standards Institute.
SATTS: One-to-one mapping to Latin Morse equivalents.
UNGEGN (1972):
DIN-31635 (1982): Developed by the Deutsches Institut für Normung (German Institute for Standardization).
ISO 233 (1984).
Qalam (1985): A system that focuses upon preserving the spelling, rather than the pronunciation, and uses mixed case.
ISO 233-2(1993). Simplified transliteration.
Buckwalter Transliteration (1990s): Developed at Xerox by Tim Buckwalter ; doesn't require unusual diacritics.
ALA-LC (1997).
SAS: Spanish Arabists School (José Antonio Conde and others, early 19th century onwards).

A table comparing romanizations using DIN 31635, ISO 233, ISO/R 233, UN, ALA-LC, and Encyclopaedia of Islam systems is available here: .

Comparison table

Letter	Name	SATTS	UNGEGN	ALA-LC	DIN	ISO	ISO/R	Qalam	SAS	SM	IPA
ﺀ	hamza	E	ʼ, —	—, ’	ʾ	ˈ, ˌ	—, ’	'	ʾ	'	/ʔ/
ﺍ	ʼalif	A			ā	ʾ	ā	aa	a, i, u; ā	aa	/a(ː)/
ﺏ	bāʼ	B	b		b			b	b	b	/b/
ﺕ	tāʼ	T	t		t			t	t	t	/t/
ﺙ	ṯāʼ	C	th		ṯ			th	ṯ	ç	/θ/
ﺝ	ǧīm, jīm, gīm	J	j		ǧ			j	ŷ	j	/ʤ/ / /g/
ﺡ	ḥāʼ	H	ḩ	ḥ	ḥ			H	ḥ	ḥ	/ħ/
ﺥ	ḫāʼ	O	kh		ḫ	ẖ		kh	j	x	/x/
ﺩ	dār	D	d		d			d	d	d	/d/
ﺫ	ḏār	Z	dh		ḏ			dh	ḏ	đ	/ð/
ﺭ	rāʼ	R	r		r			r	r	r	/r/
ﺯ	zāy	;	z		z			z	z	z	/z/
ﺱ	sīn	S	s		s			s	s	s	/s/
ﺵ	šīn	:	sh		š			sh	š	š	/ʃ/
ﺹ	ṣād	X	ş	ṣ	ṣ			S	ṣ	ṣ	/sˁ/
ﺽ	ḍād	V	ḑ	ḍ	ḍ			D	ḍ	ḍ	/dˁ/
ﻁ	ṭāʼ	U	ţ	ṭ	ṭ			T	ṭ	ṭ	/tˁ/
ﻅ	ẓāʼ	Y	z̧	ẓ	ẓ			Z	ẓ	đ̣	/ðˁ/
ﻉ	ʻayn	`	ʻ		ʿ			`	ʿ	ř	/ʕ/
ﻍ	ġayn	G	gh		ġ		ḡ	gh	g	ğ	/ɣ/
ﻑ	fāʼ	F	f		f			f	f	f	/f/
ﻕ	qāf	Q	q		q			q	q	q	/q/
ﻙ	kāf	K	k		k			k	k	k	/k/
ﻡ	mīm	M	m		m			m	m	m	/m/
ﻥ	nūn	N	n		n			n	n	n	/n/
ﻩ	hāʼ	~	h		h			h	h	h	/h/
ﻭ	wāw	W	w		w			w	w; ū	w; o	/w/, /uː/
ﻱ	yāʼ	I	y		y			y	y; ī	y; e	/j/, /iː/
ﺁ	ʼalif mamdūda	AEA	ā	ā, ʼā	ʾā	ʾâ	ā, ʾā		ā	'aa	/ʔaː/
ﺓ	tāʼ marbūṭa	@	h, t		h, t	ẗ	,	h, t	t; —	ŧ	/a/, /at/
ﻯ	ʼalif maqṣūra	/	y		ā	ỳ		ae	à	à	/aː/

Online

Main article: Arabic Chat Alphabet

Online communication is sometimes restricted to an ASCII environment in which not only the Arabic letters themselves but also Roman characters with diacritics are unavailable. Even when Arabic letters and Roman characters with diacritics are available, they are often difficult to type. This problem is faced by most speakers of languages that use non-Roman alphabets, or heavily modified ones. An ad hoc solution consists of using Arabic numerals which mirror or resemble the relevant Arabic.

External links

Categories: