Automatic video segmentation tool for laughter detection based on audio features
Date of Publication
2010
Document Type
Bachelor's Thesis
Degree Name
Bachelor of Science in Computer Science
Subject Categories
Computer Sciences
College
College of Computer Studies
Department/Unit
Computer Science
Thesis Adviser
Merlin T. Suarez
Defense Panel Member
Jocelyn Cu
Paul Inventado
Abstract/Summary
Non-linguistic signals, specifically, laughter offers a lot of information such as cues on the emotional state of a person a topic changes in meetings. The numerous benefits of laughter, ranging from identifying activities to improving speech-to-text accuracy, have gained the interests of many researchers. Most of the recent studies regarding laughter have successfully and effectively detected laughter using different modalities. However, they have encountered several problems in manually segmenting laughter as there is no automated segmentation tool for video containing laughter. This paper presents a prototype that would automatically segment laughter from videos of meetings. The prototype uses a segmentation algorithm pattered over the model-based segmentation approach. A SVM trained on audio features (such as MFCC, Pitch, and Formants) to classify instances. In addition, this research involves the creation of a Filipino Meeting corpus which consists of videos of spontaneous meetings. The classifier achieved an accuracy of over 90%. The segmentation algorithm achieved AUC-ROC of 0.79. The prototype is able to accurately segment laughter in videos however, errors are encountered (which is usually composed of onset and offsets of laughter and other sound accompanied with breathing). In addition, the research was also able to characterize the laughter models built and define laughter in the corpus.
Abstract Format
html
Language
English
Format
Accession Number
TU18468
Shelf Location
Archives, The Learning Commons, 12F, Henry Sy Sr. Hall
Physical Description
1v. various foliations : illustrations (some colored) ; 28 cm. + one computer optical disc.
Keywords
Laughter; Nonverbal communication; Image processing--Digital techniques
Recommended Citation
Bantiling, H., Gadi, S., Lee, J., & Yang, J. (2010). Automatic video segmentation tool for laughter detection based on audio features. Retrieved from https://animorepository.dlsu.edu.ph/etd_bachelors/11194