Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
College
Gokongwei College of Engineering
Department/Unit
Electronics And Communications Engg
Document Type
Article
Source Title
Journal of Telecommunication, Electronic and Computer Engineering
Volume
10
Issue
1-4
First Page
133
Last Page
138
Publication Date
1-1-2018
Abstract
Voice and speaker recognition performances are measured based on the accuracy, speed and robustness. These three key performance indicators are primarily dependent on voice feature extraction method and voice recognition algorithm used. This paper aims to discuss various researches in speech recognition that has yielded high accuracy rates of 95% and above. The extracted MFCCs from MATLAB Voicebox toolbox were used as inputs to the multilayer Artificial Neural Networks (ANN) for female voice recognition algorithm. This study explored the recognition performance of the neural networks using variable number of hidden neurons and layers, and determine the architecture that would provide the optimum performance in terms of high recognition rate. MATLAB simulation resulted to a training and testing recognition rate of 100.00% when using 3-hidden-layer neural network from speech samples of a single-speaker, and highest training recognition rate of 98.11% and testing recognition rate of 87.20% when using 4-hidden-layer neural network from speech samples of several speakers. When tested with homonyms, the best recognition rate was 75.00% from a 3-hidden-layer neural network trained from a single-speaker, and 81.91% from a 4-hidden-layer neural network trained from multiple speakers. The deviation in recognition rates were primarily attributed to the variations made in the number of input neurons, hidden layers, and neurons of the speech recognition neural network. © 2018 Universiti Teknikal Malaysia Melaka. All rights reserved.
html
Recommended Citation
Brucal, S. E., Africa, A. M., & Dadios, E. P. (2018). Female voice recognition using artificial neural networks and MATLAB voicebox toolbox. Journal of Telecommunication, Electronic and Computer Engineering, 10 (1-4), 133-138. Retrieved from https://animorepository.dlsu.edu.ph/faculty_research/1923
Disciplines
Electrical and Computer Engineering
Keywords
Automatic speech recognition; Neural networks (Computer science)
Upload File
wf_no