the automatic annotation and analysis of speech

Persian Language

Download

This chapter describes the linguistic resources included in the file pes.zip of the "Ortolang repository".

List of phonemes

Consonant Plosives

SPPAS IPA Description Examples
b b voiced bilabial
d d voiced alveolar
k k voiceless velar
g g voiced velar
p p voiceless bilabial
q q voiceless uvular
t t voiceless alveolar
G\ ɢ voiced uvular
? ʔ glottal stop

Consonant Fricatives

SPPAS IPA Description Examples
f f voiceless labiodental
h h voiceless glottal
s s voiceless alveolar
S ʃ voiceless postalveolar
v v voiced labiodental
x x voiceless velar
z z voiced alveolar
Z ʒ voiced postalveolar

Consonant Nasals

SPPAS IPA Description Examples
m m bilabial
n n alveolar

Consonant Liquids

SPPAS IPA Description Examples
l l alveolar lateral
r r alveolar trill

Affricates

SPPAS IPA Description Examples
dZ d͡ʒ voiced postalveolar
tS t͡ʃ voiceless postalveolar

Semivowels

SPPAS IPA Description Examples
j j voiced palatal

Vowels

SPPAS IPA Description Examples
a a open front unrounded
A ɒ open back rounded
e e close-mid front unrounded
i i close front unrounded
o o close-mid back rounded
u u close back rounded
y y close front rounded

Fillers

SPPAS Description
laugh laughter
noise noises, unintelligible speech
fp filled pause (euh)
dummy un-transcribed speech

Acoustic Model

The acoustic model was created by Brigitte Bigi from the HMM prototypes extracted from other languages (mainly French and Spanish). The model was then trained with 3 minutes of manually time-aligned data and 26 minutes of manually phonetized data.

UBPA at 40ms of the initial model based on prototypes is 89.83% and UBPA of the final model is 89.96%.

The model was created using a Python script available in the SPPAS package: acmtrain.py.