Open domain continuous Filipino speech recognition: Challenges and baseline experiments

College

College of Computer Studies

Department/Unit

Computer Technology

Document Type

Conference Proceeding

Source Title

IEICE Transactions on Information and Systems

Volume

E97-D

Issue

9

First Page

2443

Last Page

2452

Publication Date

1-1-2014

Abstract

In this paper, a new database suitable for HMM-based automatic Filipino speech recognition is described for the purpose of training a domain-independent, large-vocabulary continuous speech recognition system. Although it is known that high-performance speech recognition systems depend on a superior speech database used in the training stage, due to the lack of such an appropriate database, previous reports on Filipino speech recognition had to contend with serious data sparsity issues. In this paper we alleviate such sparsity through appropriate data analysis that makes the evaluation results more reliable. The best system is identified through its low word-error rate to a cross-validation set containing almost three hours of unknown speech data. Language-dependent problems are discussed, and their impact on accuracy was analyzed. The approach is currently data driven, however it serves as a competent baseline model for succeeding future developments. Copyright © 2014 The Institute of Electronics, Information and Communication Engineers.

html

Digitial Object Identifier (DOI)

10.1587/transinf.2013EDP7442

Recommended Citation

Ang, F., Guevara, R., Miyanaga, Y., Cajote, R., Ilao, J. P., Bayona, M., & Laguna, A. B. (2014). Open domain continuous Filipino speech recognition: Challenges and baseline experiments. IEICE Transactions on Information and Systems, E97-D (9), 2443-2452. https://doi.org/10.1587/transinf.2013EDP7442

Disciplines

Computer Sciences

Keywords

Automatic speech recognition; Filipino language—Data processing

Upload File

wf_no

Faculty Research Work

Open domain continuous Filipino speech recognition: Challenges and baseline experiments

College

Department/Unit

Document Type

Source Title

Volume

Issue

First Page

Last Page

Publication Date

Abstract

Digitial Object Identifier (DOI)

Recommended Citation

Disciplines

Keywords

Upload File

Search

Browse

Submit

Connect

Faculty Research Work

Open domain continuous Filipino speech recognition: Challenges and baseline experiments

Authors

College

Department/Unit

Document Type

Source Title

Volume

Issue

First Page

Last Page

Publication Date

Abstract

Digitial Object Identifier (DOI)

Recommended Citation

Disciplines

Keywords

Upload File

Share

Search

Browse

Submit

Connect