Date of Publication

12-2004

Document Type

Master's Thesis

Degree Name

Master of Science in Computer Science

Subject Categories

Computer Sciences

College

College of Computer Studies

Department/Unit

Computer Science

Thesis Adviser

Rachel Edita O. Roxas

Defense Panel Chair

Raymund C. Sison

Defense Panel Member

Allan B. Borra
Kai Shan L. Fernandez

Abstract/Summary

T2CMT is a uni-directional machine translator for languages Tagalog and Cebuano, specifically it translates from Tagalog to Cebuano. The morphological analysis is based on TagSA (Tagalog Stemming Algorithm) and affix correspondence based POS (part-of-speech) tagger. A new method is used in the POS-tagging process but does not handle ambiguity resolution and is only limited to a one-to-one mapping of words and parts-of-speech. The syntax analyzer accepts data passed by the POS tagger according to the formal grammar defined by the system. Transfer is implemented through affix and root transfers. The rules used in morphological synthesis are reverse of the rules used in morphological analysis. A bilingual dictionary from Tagalog to Cebuano was developed and is used by the different components of the system. T2CMT has been evaluated, with the Book of Genesis as input, using GTM (General Text Matcher), which is based on Precision and Recall. Result of the evaluation gives a score of good performance 0.8027 or 80.27% precision and 0.7992 or 79.92% recall.

Abstract Format

html

Language

English

Format

Electronic

Accession Number

CDTG003798

Shelf Location

Archives, The Learning Commons, 12F Henry Sy Sr. Hall

Physical Description

1 computer optical disc ; 4 3/4 in.

Keywords

Machine translating; Computational linguistics; Translating and interpreting; Language and languages

Upload Full Text

wf_yes

Share

COinS