Gramatika: A grammar checker for the low-resourced Filipino language
College
College of Computer Studies
Department/Unit
Software Technology
Document Type
Conference Proceeding
Source Title
IEEE Region 10 Annual International Conference, Proceedings/TENCON
Volume
2017-December
First Page
471
Last Page
475
Publication Date
12-19-2017
Abstract
This research focuses on the implementation of Gramatika, a grammar checker designed for the Filipino language given its available resources and linguistic tools. The checker uses hybrid n-grams generated from n-grams of words, part-of-speech tags, and lemmas of grammatically-correct texts. It covers a variety of error types including those unique in Filipino: wrong word form, and incorrectly merged/unmerged words. The grammar checker performed 64% accuracy on producing the correct suggestions on erroneous phrases and 85% on error-free texts when using Part-of-Speech (POS) tags from a Hybrid POS tagger (HPOST) for Filipino. Recommendations to improve Gramatika is to implement linguistic tools such as constituency parser, incorrect affix detection system, and a spell checker for the Filipino language.
html
Digitial Object Identifier (DOI)
10.1109/TENCON.2017.8227910
Recommended Citation
Go, M., Nocon, N. S., & Borra, A. (2017). Gramatika: A grammar checker for the low-resourced Filipino language. IEEE Region 10 Annual International Conference, Proceedings/TENCON, 2017-December, 471-475. https://doi.org/10.1109/TENCON.2017.8227910
Disciplines
Computer Sciences
Keywords
Filipino language--Grammar; Computational linguistics
Upload File
wf_no