Gramatika: A grammar checker for the low-resourced Filipino language
College of Computer Studies
IEEE Region 10 Annual International Conference, Proceedings/TENCON
This research focuses on the implementation of Gramatika, a grammar checker designed for the Filipino language given its available resources and linguistic tools. The checker uses hybrid n-grams generated from n-grams of words, part-of-speech tags, and lemmas of grammatically-correct texts. It covers a variety of error types including those unique in Filipino: wrong word form, and incorrectly merged/unmerged words. The grammar checker performed 64% accuracy on producing the correct suggestions on erroneous phrases and 85% on error-free texts when using Part-of-Speech (POS) tags from a Hybrid POS tagger (HPOST) for Filipino. Recommendations to improve Gramatika is to implement linguistic tools such as constituency parser, incorrect affix detection system, and a spell checker for the Filipino language.
Digitial Object Identifier (DOI)
Go, M., Nocon, N. S., & Borra, A. (2017). Gramatika: A grammar checker for the low-resourced Filipino language. IEEE Region 10 Annual International Conference, Proceedings/TENCON, 2017-December, 471-475. https://doi.org/10.1109/TENCON.2017.8227910
Filipino language--Grammar; Computational linguistics