Building a Filipino colloquialism translator using sequence-to-sequence model
Added Title
IEEE Region Annual International Conference, Proceedings (10th)
TENCON
College
College of Computer Studies
Department/Unit
Software Technology
Document Type
Conference Proceeding
Source Title
IEEE Region 10 Annual International Conference, Proceedings/TENCON
Volume
2018-October
First Page
2199
Last Page
2204
Publication Date
7-2-2018
Abstract
Colloquialism in the Philippines has been prominently used in day-to-day conversations. Its vast emergence is evident especially on social media platforms but poses issues in terms of understandability to certain groups. For this research, machine translators have been implemented to fill in that gap. The translators cover Filipino Textspeak or Shortcuts, Swardspeak or Gay-lingo, Conyo, and Datkilab-implemented on Tensorflow library and Moses tool. Implementing in Tensorflow achieved 85.88 BLEU score when evaluated to the training data and 14.67 to the test data, while Moses garnered 95.27 BLEU score on training data and 79.91 on test data. Analyses on both implementations include advantages and disadvantages in using each one. Through the analyses and development of this research, it is recommended to implement the following in the future: addition of colloquialism samples, experimentation on sequence-to-sequence configurations, applying Graphical User Interface (GUI) to the translators, implementing the translators to Natural Language Processing (NLP) tools, and to deploy the translators as a web application.
html
Digitial Object Identifier (DOI)
10.1109/TENCON.2018.8650118
Recommended Citation
Nocon, N. S., Kho, N. D., & Arroyo, J. (2018). Building a Filipino colloquialism translator using sequence-to-sequence model. IEEE Region 10 Annual International Conference, Proceedings/TENCON, 2018-October, 2199-2204. https://doi.org/10.1109/TENCON.2018.8650118
Disciplines
Computer Sciences
Keywords
Machine translating; Colloquial language--Translating; Filipino language--Spoken Filipino--Translating
Upload File
wf_no