Constituent structure for Filipino: Induction through probabilistic approaches

College

College of Computer Studies

Department/Unit

Software Technology

Document Type

Conference Proceeding

Source Title

Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation, PACLIC 22

First Page

113

Last Page

122

Publication Date

12-1-2008

Abstract

The current state of Philippine linguistic resources, which includes formal grammars, electronic dictionaries and corpora are not yet significant to address industrialstrength language technologies. This paper discusses a computational approach in automatically estimating constituent structures from a corpus using unsupervised probabilistic approaches. Two models are presented and results show an F1 measure of greater than 69%. Issues and phenomena of the Filipino language are identified and discussed. © 2008 by Danniel Alcantara and Allan Borra.

html

Disciplines

Computer Sciences | Software Engineering

Keywords

Computational linguistics; Filipino language—Context; Electronic dictionaries

Upload File

wf_no

This document is currently not available here.

Share

COinS