Gramatika: A grammar checker for the low-resourced Filipino language

College

College of Computer Studies

Department/Unit

Software Technology

Document Type

Conference Proceeding

Source Title

IEEE Region 10 Annual International Conference, Proceedings/TENCON

Volume

2017-December

First Page

471

Last Page

475

Publication Date

12-19-2017

Abstract

This research focuses on the implementation of Gramatika, a grammar checker designed for the Filipino language given its available resources and linguistic tools. The checker uses hybrid n-grams generated from n-grams of words, part-of-speech tags, and lemmas of grammatically-correct texts. It covers a variety of error types including those unique in Filipino: wrong word form, and incorrectly merged/unmerged words. The grammar checker performed 64% accuracy on producing the correct suggestions on erroneous phrases and 85% on error-free texts when using Part-of-Speech (POS) tags from a Hybrid POS tagger (HPOST) for Filipino. Recommendations to improve Gramatika is to implement linguistic tools such as constituency parser, incorrect affix detection system, and a spell checker for the Filipino language.

html

Digitial Object Identifier (DOI)

10.1109/TENCON.2017.8227910

Disciplines

Computer Sciences

Keywords

Filipino language--Grammar; Computational linguistics

Upload File

wf_no

This document is currently not available here.

Share

COinS