Reda Elbarougy

Work place: Department of Mathematics, Faculty of Science, Damietta University, Egypt

E-mail:

Website:

Research Interests: Speech Synthesis, Speech Recognition, Computer Architecture and Organization, Pattern Recognition, Computational Science and Engineering

Biography

Reda Elbarougy received his B.Sc., and M.Sc., degrees from Mansoura University, Egypt, in May 1997, and February 2006, respectively. Both were in computer science. He was with the Faculty of Science, Mansoura University from 1999 to 2009. In July 2009, he joined the Japan Advanced Institute of Science and Technology (JAIST), Japan, as a Ph.D. student. Since 2014, he has been an Assistant Professor with Mathematics Department, Faculty of Science, Damietta University. Currently he is a post-doctor researcher funded from JSPS to conduct a research in Japan Advanced Institute of Science and Technology (JAIST) from June 2017 till now. His current research interests include speech analysis, speech emotion recognition, and synthesis.

Author Articles
A Dataset for Speech Recognition to Support Arabic Phoneme Pronunciation

By Moner N. M. Arafa Reda Elbarougy A. A. Ewees G. M. Behery

DOI: https://doi.org/10.5815/ijigsp.2018.04.04, Pub. Date: 8 Apr. 2018

It is difficult for some children to pronounce some phonemes such as vowels. In order to improve their pronunciation, this can be done by a human being such as teacher or parents. However, it is difficult to discover the error in the pronunciation without talking with each student individually. With a large number of students in classes nowadays, it is difficult for teachers to communicate with students separately. Therefore, this study proposes an automatic speech recognition system which has the capacity to detect the incorrect phoneme pronunciation. This system can automatically support children to improve their pronunciation by directly asking children to pronounce a phoneme and the system can tell them if it is correct or not. In the future, the system can give them the correct pronunciation and let them practise until they get the correct pronunciation. In order to construct this system, an experiment was done to collect the speech database. In this experiment 89, elementary school children were asked to produce 28 Arabic phonemes 10 times. The collected database contains 890 utterances for each phoneme. For each utterance, fundamental frequency f0, the first 4 formants are extracted and 13 MFCC co-efficients were extracted for each frame of the speech signal. Then 7 statics were applied for each signal. These statics are (max, min, range, mean, mead, variance and standard divination) therefore for each utterance to have 91 features. The second step is to evaluate if the phoneme is correctly pronounced or not using human subjects. In addition, there are six classifiers applied to detect if the phoneme is correctly pronounced or not by using the extracted acoustic features. The experimental results reveal that the proposed method is effective for detecting the miss pronounced phoneme ("أ").

[...] Read more.
Other Articles