ARABIC TRANSLITERATION
I developed my transliteration system before XML days. To make it XML-friendly I would:
replace < with I (for hamza-under-alif)
replace > with O (for hamza-over-alifthe A is already used for bare alif)
replace & with W (for hamza-on-waw)
Transliteration | Arabic Windows | Unicode Value and Unicode Name |
' | C1 | U+0621 ARABIC LETTER HAMZA |
| | C2 | U+0622 ARABIC LETTER ALEF WITH MADDA ABOVE |
> | C3 | U+0623 ARABIC LETTER ALEF WITH HAMZA ABOVE |
& | C4 | U+0624 ARABIC LETTER WAW WITH HAMZA ABOVE |
< | C5 | U+0625 ARABIC LETTER ALEF WITH HAMZA BELOW |
} | C6 | U+0626 ARABIC LETTER YEH WITH HAMZA ABOVE |
A | C7 | U+0627 ARABIC LETTER ALEF |
b | C8 | U+0628 ARABIC LETTER BEH |
p | C9 | U+0629 ARABIC LETTER TEH MARBUTA |
t | CA | U+062A ARABIC LETTER TEH |
v | CB | U+062B ARABIC LETTER THEH |
j | CC | U+062C ARABIC LETTER JEEM |
H | CD | U+062D ARABIC LETTER HAH |
x | CE | U+062E ARABIC LETTER KHAH |
d | CF | U+062F ARABIC LETTER DAL |
* | D0 | U+0630 ARABIC LETTER THAL |
r | D1 | U+0631 ARABIC LETTER REH |
z | D2 | U+0632 ARABIC LETTER ZAIN |
s | D3 | U+0633 ARABIC LETTER SEEN |
$ | D4 | U+0634 ARABIC LETTER SHEEN |
S | D5 | U+0635 ARABIC LETTER SAD |
D | D6 | U+0636 ARABIC LETTER DAD |
T | D8 | U+0637 ARABIC LETTER TAH |
Z | D9 | U+0638 ARABIC LETTER ZAH |
E | DA | U+0639 ARABIC LETTER AIN |
g | DB | U+063A ARABIC LETTER GHAIN |
_ | DC | U+0640 ARABIC TATWEEL |
f | DD | U+0641 ARABIC LETTER FEH |
q | DE | U+0642 ARABIC LETTER QAF |
k | DF | U+0643 ARABIC LETTER KAF |
l | E1 | U+0644 ARABIC LETTER LAM |
m | E3 | U+0645 ARABIC LETTER MEEM |
n | E4 | U+0646 ARABIC LETTER NOON |
h | E5 | U+0647 ARABIC LETTER HEH |
w | E6 | U+0648 ARABIC LETTER WAW |
Y | EC | U+0649 ARABIC LETTER ALEF MAKSURA |
y | ED | U+064A ARABIC LETTER YEH |
F | F0 | U+064B ARABIC FATHATAN |
N | F1 | U+064C ARABIC DAMMATAN |
K | F2 | U+064D ARABIC KASRATAN |
a | F3 | U+064E ARABIC FATHA |
u | F5 | U+064F ARABIC DAMMA |
i | F6 | U+0650 ARABIC KASRA |
~ | F8 | U+0651 ARABIC SHADDA |
o | FA | U+0652 ARABIC SUKUN |
` | U+0670 ARABIC LETTER SUPERSCRIPT ALEF | |
{ | U+0671 ARABIC LETTER ALEF WASLA | |
P | 81 | U+067E ARABIC LETTER PEH |
J | 8D | U+0686 ARABIC LETTER TCHEH |
V | U+06A4 ARABIC LETTER VEH | |
G | 90 | U+06AF ARABIC LETTER GAF |
The full Arabic character set can be viewed at the Unicode website:
Arabic: U+0600 to U+06FF (PDF format)
Arabic Presentation Forms-A: U+FB50 to U+FDFF (PDF format)
Arabic Presentation Forms-B: U+FE70 to U+FEFF (PDF format)
The TITUS page for U+0600 through U+06FF displays the actual characters in your browser (UTF-8 encoding).
You can test your web browser's Arabic Unicode support at Alan Woods Unicode Resources website.
The Microsoft developer website has a useful table of the Arabic Windows (1256) and ISO 8859-6 code pages and their corresponding Unicode values.
HOME | CORPUS COMPILATION | WORD FREQUENCY COUNTS | CONCORDANCING | MORPHOLOGY ANALYSIS | ARABIC LEXICON
Copyright © 2002 QAMUS LLC