Intelligent audio surveillance using audio acquisition, blind source separation, and audio event detection

Date of Publication

2017

Document Type

Dissertation

Degree Name

Doctor of Philosophy in Electronics and Communications Engineering

Subject Categories

Engineering

College

Gokongwei College of Engineering

Department/Unit

Electronics and Communications Engineering

Thesis Adviser

Elmer P. Dadios

Defense Panel Chair

Laurence A. Gan Lim

Defense Panel Member

Raouf N. Naguib
Alexis M. Fillone
Edwin J. Calilung
Jennifer C. Dela Cruz

Abstract/Summary

This study presents the design and implementation of an audio surveillance system using audio acquisition, blind source separation, and audio event detection. The audio acquisition from multiple USB microphones was designed for future development of an embedded computer system of some real time audio processing and analysis. It was designed using multiple threads and was implemented in Java. Thread is a sequential program that has its own thread of control and can be executed concurrently with other threads. The algorithm successfully captured audio data from multiple USB microphones. The design and implementation of blind source separation was based on independent component analysis (ICA). Independent component analysis, such as kurtosis and mutual information, were used to measure in separating the independent sources. The estimated sound sources were obtained based on the maximization of kurtosis and minimization of mutual information. The optimization process was done by using genetic algorithm and was implemented in Java programming language. The simulation was successful in separating sources up to 4 mixed signals. However, the algorithm did not work well in real recorded signals because the coefficients obtained were not enough to represent the demixing matrix. The audio event detection algorithms were implemented using mel frequency cepstral coefficients as feature vector of the audio signals. Three different classifiers were designed: adaptive neuro fuzzy inference system, neural network, and fuzzy inference system.

Abstract Format

html

Language

English

Format

Electronic

Accession Number

CDTG007287

Shelf Location

Archives, The Learning Commons, 12F Henry Sy Sr. Hall

Physical Description

1 computer disc ; 4 3/4 in.

Keywords

Algorithms; Sound analyzers; Sound--Equipment and supplies; Audio equipment test recordings

This document is currently not available here.

Share

COinS