Professional Documents
Culture Documents
Hausa Orthography LIN 207 - 1
Hausa Orthography LIN 207 - 1
This note brings together basic information about the Latin script and its use for
the Hausa language. It aims to provide a brief, descriptive summary of the modern,
printed orthography and typographic features, and to advise how to write Hausa
using Unicode.
Hausa has numerous dialects, each of which may pronounce letters differently. The
pronunciation in this document is generally based on the phonemic transcriptions
provided by Wiktionary, although in some places we describe differences with
other dialects, especially Kano, with more phonetic information.
The centre of the native speaker population is in northern Nigeria and southern
Niger, but Hausa is spoken across the width of the Sahel region, and mostly in
mutually intelligible form. There are significant minorities in Benin, Cameroon,
and Chad, and as a trade language it also reaches into Ghana, Togo, and parts of
Sudan. It is estimated that there are 100-150 million speakers.
Orthographic development & variants
This orthography was introduced by the British around the beginning of the 20th
century, and aligned with the Pan-Nigerian alphabet in the 1980s.5
There are or have been a number of variant practices for writing Hausa boko. They
include:
Language code ha
Script type alphabet
Total characters 16 + 30
Letters 9
Combining marks 4
Punctuation 3
Native digits 0
Possible other 30
Character counts exclude
ASCII.
Vowels 5 letters
Case distinction yes
Cursive script no
Text direction ltr
Baseline romn
Word separator space
Wraps at word
Hyphenation ?
Justification ?
Native speakers 43,928,100
Region afr
See the comparison table
See the key
Basic features
The Latin script is an alphabet. This means that it is largely phonetic in nature,
where each letter represents a basic sound. See the table to the right for a brief
overview of features for the modern Hausa orthography using the Latin script.
There is one systematic difference between the orthographies used in Niger and
Nigeria (see basicconsonants), but difficulties in accommodating non-ASCII
characters give rise to a number of variant representations. See Orthographic
development & variants.
Hausa text runs left-to-right in horizontal lines. Words are separated by spaces.
The orthography is bicameral. The visual forms of letters don't usually interact.
Hausa has 21 consonant letters (42 counting uppercase), one of which can be
written in two different ways. A number of sounds are written using digraphs.
Hausa has 10 vowel letters (20 including uppercase). Diphthongs are written using
digraphs.
Long and short vowel sounds are distinctive phonetically, but are not reflected in
the orthography.
Combining marks
Show
Punctuation
Show
Phonology
Click on the sounds to reveal locations in this document where they are mentioned.
Vowel sounds
Plain vowels
i iːu uːe eːo oːa aː
Diphthongs
iuuiai au
Consonant sounds
post-
labi dent alveol alveol retrofl palat glott
al al ar ar ex al velar al
pb td cɟ kɡ ʔ
ɓ ɗ cʼ kʷ ɡʷ
stop kʲ gʲ
kʼ kʷʼ
kʲʼ
affricate t͡ sʼ t͡ ʃ d͡ ʒ
f sz ʃʒ h
fricative
fʲ
nasal m n
approxim w l j
ant j̰
trill/flap ɾ ɽ
Tone
Hausa uses 3 tones, high, low and falling. They are not indicated in normal text 1,
eg.
Structure
Hausa has 3 syllable types: CV, CVV, and CVC, where VV can be a long vowel or
a diphthong.1 The long vs. short vowel distinction is phonemically important,
however when a syllable with a long vowel acquires and final consonant, the
vowel is shortened.
Consonant clusters may occur where syllables are side by side, but not within a
syllable. Gemination is, however, a distinctive feature.1
Vowels
Hausa has 10 vowel letters (20 including uppercase). Diphthongs are written using
digraphs.
Long and short vowel sounds are distinctive phonetically, but are not reflected in
the orthography.
Vowel letters
5 vowel letters are used, each with an upper and lower case form.
ii iː0069
uu uː0075
ee eː0065
oo oː006F
aa aː0061
5
I0049
U0055
E0045
O004F
A0041
Although long and short vowel sounds are phonemically distinctive, Hausa doesn't
distinguish between them in writing, eg.uƙuba ʔù.kʼúː.bàː misery, anguish
There was an attempt at one point to introduce vowel doubling to indicate long
vowels in Niger, but that was dropped.
Diphthongs
iuiu0069
0075
uiui0075
0069
aiai0061
0069
auau0061
0075
Long and short vowel sounds are distinctive phonetically, but are not reflected in
the orthography.
This section maps Hausa vowel sounds to common graphemes in the Latin
orthography. Click on a grapheme to find other mentions on this page (links
appear at the bottom of the page). Click on the character name to see examples
and for detailed descriptions of the character(s) shown.
Plain vowels
i iː
Diphthongs
iu
iu [U+0069 LATIN SMALL LETTER
I + U+0075 LATIN SMALL LETTER U]
ui
Tones
However, in academic and education text the tone can be indicated using one of the
following characters, representing high, low, and falling, respectively.
́ rare0301
̀ rare0300
̂ rare0302
Consonants
bb0062
ɓɓ0253
tt0074
dd0064
ɗɗ0257
kk006B
ƙkʼ0199
gɡ0067
B0042
Ɓ0181
T0054
D0044
Ɗ018A
K004B
Ƙ0198
G0047
both
ct͡ ʃ0063
jʒ (d͡ ʒ)006A
both
C0043
J004A
ff0066
ss0073
zz007A
hh0068
F0046
S0053
Z005A
H0048
both
mm006D
nn006E
both
M004D
N004E
ww0077
rɽ ɾ0072
ll006C
yj̰ 0079
ƴj̰ 01B4
ʼʔ02BC
W0057
R0052
L004C
Y0059
Ƴ01B3
Digraphs
A number of the consonant sounds in Hausa are written using multiple letters.
tst͡ sʼ sʼ0074
0073
kwkʷ006B
0077
kyc006B
0079
ƙwkʷʼ0199
0077
shʃ0073
0068
ʼyj̰ 02BC
0079
The last digraph in the list above is included here because it is the standard way to
write that sound in Nigeria (as opposed to the form in the previous section which is
used in Niger). Other implosives and ejectives may also be written as digraphs
(see Ejectives & implosives).
Allophonic pronunciations
Ejectives and implosives can be represented using special characters, but may also
be represented by a combination of apostrophe and the basic character 5, or often
just by the basic character itself. For example,ɗaya ɗá.já onemight be also
writtendʼayaor simplydaya
Wikipedia uses the special character forms, but the BBC, CRI, and VOA Hausa
pages just use ordinary characters for the ejectives.
The following table lists some single characters that may be written using a normal
letter and apostrophe. Note how the order of the character and apostrophe is
reversed in the final example.
Apostrophe
On its own, the apostrophe represents a glottal stop, eg.jamaʼa (d)ʒà.má.ʔà public,
communityIt is not written in initial position to represent a glottal stop.
It is also used in some cases to spell certain sounds as described in Ejectives &
implosives.
This section maps Hausa consonant sounds to common graphemes in the Latin
orthography. Click on a grapheme to find other mentions on this page (links
appear at the bottom of the page). Click on the character name to see examples
and for detailed descriptions of the character(s) shown.
Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc.
Stops
p
Affricates
t͡ sʼ
Fricatives
f
Nasals
m
This section describes typographic features related to digits, dates, currencies, etc.
Writing styles
How are fonts grouped into recognisable writing styles? How is each writing style
used?
You can experiment with examples using the Hausa (boko) character app.
Transforming characters
Is the orthography bicameral? Are there other character pairings, especially when
transforms are needed to convert between the two?
Hausa is bicameral, and applications may need to enable transforms to allow the
user to switch between cases.
Graphemes
Grapheme clusters
Emphasis
How are emphasis and highlighting achieved? If lines are drawn alongside, over
or through the text, do they need to be a special distance from the text itself? Is it
important to skip characters when underlining, etc? How do things change for
vertically set text?
Other punctuation
Punctuation not already mentioned, such as dashes, connectors, separators, scare
quotes, etc.
Other inline text decoration
Any other form of highlighting or marking of text, such as underlining, numeric
overbars, etc. What characters or methods (eg. text decoration) are used to convey
information about a range of text? If lines are drawn alongside, over or through
the text, do they need to be a special distance from the text itself? Is it important to
skip characters when underlining, etc? How do things change for vertically set
text?
This section describes typographic features related to word boundaries, phrase &
section boundaries, bracketed text, quotations & citations, emphasis, abbreviation,
ellipsis & repetition, inline notes & annotations, other punctuation, and other
inline text decoration.
Word boundaries
The concept of 'word' is difficult to define in any language (see What is a word?).
Here, a word is a vaguely-defined, but recognisable semantic unit that is typically
smaller than a phrase and may comprise one or more syllables.
Words can also be hyphenated. This is especially, but not solely, true for words
that repeat the same sound, eg.wuƙi-wuƙi wù.kʼìː.wú.kʼíː children fidgetingdushi-
dushi dú.ʃí.dú.ʃí difficult to see, hazyatone-janar ʔà.tóː.nè.(d)ʒá.nàr attorney
general
What characters are used to indicate the boundaries of phrases, sentences, and
sections?
6
,002C
;003B
:003A
.002E
?003F
!0021
both
(0028
)0029
start end
standar ( [U+0028 LEFT ) [U+0029 RIGHT
d PARENTHESIS] PARENTHESIS]
Quotations & citations
“201C
”201D
‘2018
’2019
Hausa texts may use quotation marks around quotations. Of course, due to
keyboard design, quotations may also be surrounded by ASCII double and single
quote marks.
start end
initial “ [U+201C LEFT ” [U+201D RIGHT
DOUBLE DOUBLE QUOTATION
QUOTATION MARK] MARK]
neste ‘ [U+2018 LEFT ’ [U+2019 RIGHT
d SINGLE QUOTATION SINGLE QUOTATION
MARK] MARK]
Styling initials
Does the script use special styling of the initial letter of a line or paragraph, such
as for drop caps or similar? How about the size relationship between the large
letter and the lines alongide? where does the large letter anchor relative to the
lines alongside? is it normal to include initial quote marks in the large letter? is
the large letter really a syllable? Are dropped, sunken, and raised types found?
etc.
Are there special rules about the way text wraps when it hits the end of a line?
Does line-breaking wrap whole 'words' at a time, or characters, or something else
(such as syllables in Tibetan and Javanese)? What characters should not appear at
the end or start of a line, and what should be done to prevent that? Is hyphenation
used, or something else? What rules are used? What difficulties exist?
Does the script have special requirements for baseline alignment between mixed
scripts and in general? Is line height special for this script? Are there other
aspects that affect line spacing, or positioning of items vertically within a line?
This section describes typographic features related to general page layout &
progression; grids & tables, notes, footnotes, etc, forms & user interaction, and
page numbering, running headers, etc.
Online resources
1. Wikipedia
2. BBC News
3. Voice of America
4. CRI Online
5. Deutsche Welle
2Peter T. Daniels and William Bright, The World's Writing Systems, Oxford
University Press, ISBN 0-19-507993-0✓