Professional Documents
Culture Documents
Quran
Karim Aly Mohammad Ahmed H. Kandil
Systems and Biomedical Engineering Department Systems and Biomedical Engineering Department
Faculty of Engineering, Cairo university Faculty of Engineering, Cairo university
Giza, Egypt Giza, Egypt
karimaly08@gmail.com line 5: email address or ORCID
Abstract— A consistency analysis performed for one of the II. SEGMENTATION TECHNIQUES
famous Tajweed rules in the Holy Quran – Mudood rules -and
tested on records of one of the famous reference reciters – The core of this work is how to segment the medd pattern
Sheikh El-Hosary – trying to find a consistent boundaries to from the speech signal of the Quranic utterance, there are
evaluate other new recitation learners, a vowel detection many segmentation techniques addressed by AE Sakran [3] on
algorithm used to detect the lengthening time of the detected a review paper of segmentation techniques, the segmentation
Mudood, which applied on a dataset of 105 minutes of Quran approaches are different based on the speech features that used
records of Sheikh El-Hosary. The consistency result is normally to identify the segmentation boundaries such; short-time
distributed around the mean lengthening time as all human signal energy, short-time average zero crossing, spectral
activities with standard division of 58.2 msec and minimum centroid, spectral flux and so on [4] [5] [6].
consistency of 84.6% for 68% of the detected Mudood.
The speech features used in this work are the Mel-
Keywords—Consistency analysis, Mudood rules, Vowel Frequency Cepstral Coefficient (MFCC), short time energy,
detection. and time duration of the detected pattern. The MFCC is one of
the main speech features in the frequency domain, which it
I. INTRODUCTION scales the frequency in order to match more closely what the
It’s important for Quran readers to recite it with proper human ear can hear [7], and the short time energy is one of the
rules, these recitation rules called Tajweed rules, Tajweed main speech features in the time domain [7].
means articulating every letter from its articulation point and A mean square error function used to compare the MFCC
giving the letter its rights and dues of characteristics [1]. There and the short time energy between the vowels speech files and
are some of these rules repeated a lot through the Quran the target Aya sound file; which the minimum result of the
verses, one of these famous rules is the Mudood rules mean square error function presents the best matching
(Lengthening). Mudood means lengthening the sound with a similarity. The time duration of the matching patterns presents
letter of medd letters [1]. a good feature to neglect many matching patterns that have
There are three of medd letters in the Arabic language; alif time duration less than certain time threshold related to the
Sakinah preceded by fathah () َقال, ya’ Sakinah preceded by medd lengthening time of the reciter.
kasrah ( )قِيلand wow Sakinah preceded by dhammah ([ )هُو1].
The lengthening time change according to the position of the III. DESCRIPTION OF THE SYSTEM
medd letter between other adjacent letters, which it’s may be A vowel detection methodology applied to spot the
lengthened two, four, five and six counts. The time of each lengthening rules. The system block diagram explained in
count depends on the reciter speed but in the same time the Figure 1, which;
reciter should be consistent in his recitation with the same
count time [1]. The syllables in Arabic language classified to Verses
sound
5 combinations of consonant(C), vowels(V) and long vowels files
(V:) as follow: CV, CV:, CVC, CV: C and CVCC. [2], The
Medd represents the long vowel part of these syllables.
Database Mudood Mudood
In this work, we test the Mudood rules consistency level of Verses detection spotting
Consistency
of Sheikh El-Hosary; one of accredited reciters in the Islamic analysis
texts algorithm algorithm
community to be as a reference level for new Quran learners.
This paper is organized as follows; Section 2 describes the
used segmentation technique, Section 3 presents the Vowels
consistency computation System description, Section 4 sound
presents the system results and discussion, Section 5 presents files
the conclusion of this work.
Fig. 1. System block diagram
Figure 3 represents the consistency histogram of medd of medd, and 68% of the spotted medd data have a consistency
i: ( )المد بالياءresults. level higher than 85.4%
Table 3 represents the consistency analysis results of the
As explained in the figure 3 the consistency histogram is
almost normally distributed around the mean unit time of the medd of u: ()المد بالواو
Fig. 3. Consistency histogram of vowel a: ()مد بالياء Fig. 4. Consistency histogram of vowel u: ()مد بالواو
Figure 4 represents the consistency histogram of medd of
u: ( )المد بالواوresults.
As explained in the figure 4 the consistency histogram is Table 4 represents the consistency analysis results of all
almost normally distributed around the mean unit time of the medd types.
medd, and 68% of the spotted medd data have a consistency
level higher than 81.7%.
TABLE 4. consistency analysis of all medd types
As explained in the figure 5 and table 4 The consistency is [4] A. Hossain, N. Nahid, N. N. Khan, D. C. Gomes, and S. M. Mugab,
normally distributed around the mean unit time of the “Automatic silence/unvoiced/voiced classification of bangla velar
phonemes: New approach,” 8th ICCIT, Dhaka, 2005.
lengthening count, which represent one of the human activity
[5] E. A. Kaur and E. T. Singh, “Segmentation of continuous punjabi
performance have a mean and a standard division, a mean time speech signal into syllables,” in Proceedings of the World Congress on
of one lengthening count of 397 msec and a standard division Engineering and Computer Science, vol. 1. Citeseer, 2010, pp. 20–22.
of 58.2 msec, a worst consistency of about 84.6% calculated [6] M. Kalamani, S. Valarmathy, and S. Anitha, “Hybrid speech
at the boundary of mean – std, which represents a worst segmentation algorithm for continuous speech recognition.”,
consistency of 68% of the detected Mudood patterns, a International Journal on Applications of Information and
consistency of 69.3% calculated at the boundary of mean – Communication Engineering Volume 1: Issue 1: January 2015,
Pages:39-46
2std, which represents a worst consistency of 95% of the
[7] L. R. Rabiner and R. W. Schafer, Theory and Applications of Digital
detected Mudood patterns, a consistency of 53.9% calculated Speech, USA, NJ, Upper Saddle River:Prentice-Hall, 2010.
at the boundary of mean – 3std, which represents the worst
[8] Wikipedia contributors. "Normal distribution." Wikipedia, The Free
consistency of 99.7% of the detected Mudood patterns. Encyclopedia. Wikipedia, The Free Encyclopedia, 10 Jul. 2021. Web.
The consistency is larger in the case of Vowel a: ( المد 10 Jul. 2021.
)باأللفthan Vowel i: ( )المد بالياءand Vowel u: ())المد بالواو, which
represents minimum consistency of 87.3% at mean – std, and
the consistency is larger in the case of Vowel i: ( )المد بالياءthan
Vowel u: ()المد بالواو, which represents minimum consistency
of 85.4% at mean – std.
V. CONCLUSION
The time duration of one lengthening count is variable and
it’s an expected result because it’s represents a human activity,
which has a mean and standard division, but the main result is
that we can define a confidence boundary to the variability of
this time related to a reference reciter like Sheikh El-Hosary,
which represent a judgement boundary to the this rule for new
learners.
The worst confidence scores for Sheikh El-Hosary are
84.6%, 69.3% and 53.9% at the boundary of 68%, 95% and
99.7% of the detected lengthening patterns respectively.