• Home
  • Line#
  • Scopes#
  • Navigate#
  • Raw
  • Download
1# © 2016 and later: Unicode, Inc. and others.
2# License & terms of use: http://www.unicode.org/copyright.html
3# Generated using tools/cldr/cldr-to-icu/build-icu-data.xml
4#
5# File: xh_xh_FONIPA.txt
6# Generated from CLDR
7#
8
9# Pronunciation rules for isiXhosa.
10#
11# Author: mjansche@google.com (Martin Jansche)
12#
13# These rules transcribe isiXhosa into the phoneme inventory used within the
14# NCHLT Speech Corpus (https://sites.google.com/site/nchltspeechcorpus/home).
15#
16# The rules were tested using the NCHLT-inlang isiXhosa pronunciation dictionary
17# (http://rma.nwu.ac.za/index.php/resource-catalogue/nchlt-inlang-dictionaries.html).
18# They correctly account for 14,999 out of 15,000 entries in the dictionary.
19#
20# The NCHLT 2013 phone set does not distinguish short and long vowels and does
21# not indicate tone in any way. Transcription of tone is out of scope without a
22# dictionary, since tone is generally not indicated in the orthography. Nasal
23# clicks are not treated as separated phonemes in the NCHLT 2013 phone set and
24# are transcribed as a sequence of nasal plus click instead.
25#
26# One minor notational deviation from the NCHLT 2013 phone set is that we use a
27# tie bar within the complex (slack voiced) clicks, e.g. ɡ\u0361ǀ instead of ɡǀ, to
28# avoid ambiguity and make the phoneme inventory uniquely decodable.
29::Lower;
30nyh → ɲʰ;
31n { tsh → t\u0361ʃʼ;
32tsh → t\u0361ʃʰ;
33tyh → cʰ;
34bh → bʰ;
35ch → ǀʰ;
36dl → ɮ;
37dy → ɟ;
38gc → ɡ\u0361ǀ;
39gq → ɡ\u0361ǃ;
40gr → ɣ;
41gx → ɡ\u0361ǁ;
42hl → ɬ;
43kh → kʰ;
44kr → k\u0361x;
45mh } [^l] → mʰ;  # <mhl> denotes /mɬ/ instead
46nh → nʰ;
47ny → ɲ;
48ph → pʰ;
49qh → ǃʰ;
50sh → ʃ;
51th → tʰ;
52tl → t\u0361ɬʼ;
53ts → t\u0361sʼ;
54ty → cʼ;
55xh → ǁʰ;
56aa → | a;
57ee → | e;
58ii → | i;
59kc → | c;
60kq → | q;
61mm → | m;
62oo → | o;
63rh → | r;
64uu → | u;
65a → a;
66b → ɓ;
67c → ǀ;
68d → d;
69e → ɛ;
70f → f;
71g → ɡ;
72h → h;
73i → i;
74j → d\u0361ʒ;
75k → kʼ;
76l → l;
77m → m;
78n } g → ŋ;
79n → n;
80o → ɔ;
81p → pʼ;
82q → ǃ;
83r → r;
84s → s;
85t → tʼ;
86u → u;
87v → v;
88w → w;
89x → ǁ;
90y → j;
91z → z;
92
93