Singing voice transformation system
Date of Publication
2006
Document Type
Bachelor's Thesis
Degree Name
Bachelor of Science in Computer Science
Subject Categories
Computer Sciences
College
College of Computer Studies
Department/Unit
Computer Science
Thesis Adviser
Clement Y. Ong
Defense Panel Member
Joel P. Ilao
Jesus E. Gonzales
Karlo Shane O. Campos
Abstract/Summary
Speech processing is a broad field of study with a variety of applications such as speech recognition, analysis and synthesis. One of the less explored fields of study is its application on the singing voice. Singing voice transformation deals with modification of the characteristics seen in the frequency spectrum of an input singing voice signal such as the pitch, amplitude and spectral shape so that a transformed voice signal is given as output. The thesis is direct application of singing voice transformation using various digital signal processing techniques. The application is a singing voice transformation system where the characteristics of the user's singing voice such as the fundamental frequency, amplitude, and spectral shaped are modified such that they resemble the characteristics of the target voice while preserving the quality of the transformed voice in terms of continuity, smoothness and its overall perceptibility. Quantitative and qualitative tests are done to asses the performance of the system in transforming the fundamental frequency, amplitude and spectral shape of the user's singing voice in terms of quality and accuracy of the transformed voice. Specifically, the Mean Opinion Score (MOS) method of subjective testing proves that the system's performance on the male-to-male, male-to-female, female-to-female, and female-to-male voice transformation is satisfactory. From a scale of 1 to 5, with 5 being the highest score, the system has a performance rating of 3.7 in terms of quality and 4 in terms of accuracy. Moreover, the most notable performance of the system is on complete transformation of the user's voice into the target voice, i.e. the fundamental frequency, amplitude, and spectral shape characteristics of the user's voice are all transformed to resemble those of the target voice's characteristics. The system is able to perfectly transform the user's voice into the target voice. Furthermore, the quantitative tests prove that the system is able to perform fundamental frequency
Abstract Format
html
Language
English
Format
Accession Number
TU13493
Shelf Location
Archives, The Learning Commons, 12F, Henry Sy Sr. Hall
Physical Description
1 v. (various foliations) : ill. ; 28 cm.
Keywords
Speech processing systems; Digital signal processing; Signal procesing--Digital techniques; Electronic music
Recommended Citation
Bullen, J., Carandang, W. M., & Villamin, J. A. (2006). Singing voice transformation system. Retrieved from https://animorepository.dlsu.edu.ph/etd_bachelors/8958