Filipino text-to-speech system: Tagapagsalita, 2
Date of Publication
2009
Document Type
Bachelor's Thesis
Degree Name
Bachelor of Science in Computer Science
College
College of Computer Studies
Department/Unit
Computer Science
Thesis Adviser
Jocelynn W. Cu
Defense Panel Member
Charibeth Cheng
Clement Ong
Christian Echavez
Abstract/Summary
One of the main types of speech processing technologies today is Text-To-Speech (TTS) synthesis. This technology converts normal language text into speech. Many studies have been conducted to develop TTS systems for various languages. In this Filipino TTS, there are 327 diphones extracted from sets of Filipino words, 234 are found valid. Diphones will undergo pre-processing and will be compressed using Linear Predictive Coding (LPC). Through inverse LPC, the diphones can be reproduce using the coefficients and excitations stored in the codebook.
After the diphones are synthesized, its pitch, volume and duration are manipulated by a scaling factor depending on the accent mark assigned to it. Once the accent is applied to the diphone, it will be concatenated with the other diphones with the means of Overlap-Add Method (OLA) to form the output signal of the system.
25 respondents were asked to evaluate the system based on ease, syllabication, stress, articulation, and speed with the score of five being the highest and one being the lowest. The average of results for all uttered speech scored 4.453 for listening ease, 4.42 for syllabication, 3.83 for stress, 4.06 for articulation and 3.51 for speed. The linguist's average score are 3.86 for listening ease, 3.36 for syllabication, 2.3 for stress, 3 for articulation and 3.51 for speed. Also, the respondents were asked to do the accent mark test by listening to 15 Filipino words and identify the word that they heard based on the choices indicated in the survey sheet. An average score 11.21 out of 15 questions were achieved by the respondents in identifying the Filipino Heteronyms while the linguist's score was 13 out of 15.
Abstract Format
html
Language
English
Format
Accession Number
TU15152
Shelf Location
Archives, The Learning Commons, 12F, Henry Sy Sr. Hall
Physical Description
1 v. (various foliations) : ill. (some col.) ; 28 cm.
Keywords
Filipino language; Technological innovations
Recommended Citation
Jimenez, J. T., Juliano, F. S., & Silva, E. P. (2009). Filipino text-to-speech system: Tagapagsalita, 2. Retrieved from https://animorepository.dlsu.edu.ph/etd_bachelors/7657