SimText: Text simplication of medical literature
Date of Publication
2006
Document Type
Bachelor's Thesis
Degree Name
Bachelor of Science in Computer Science
Subject Categories
Computer Sciences
College
College of Computer Studies
Department/Unit
Computer Science
Thesis Adviser
Ethel C. Ong
Defense Panel Member
Rachel O. Roxas
Allan B. Borra
Abstract/Summary
Text simplification can be defined as any process that reduces the syntactic or lexical complexity of a text while attempting to preserve its meaning and information content. The aim text simplification is to make text easier to comprehend for human readers or process by programs.
SimText is a text simplification system that is capable of simplifying English medical literature through syntactic and lexical simplification. The level of syntactic and lexical simplification achieved is dependent on the syntactic and lexical information stored in the knowledge sources by the system. SimText is also extensible, as it can simplify other domains by changing the contents of the knowledge sources according to the preferences of the user. Through these, the system is able to simplify not only general English text but also literatures in specific fields, i.e. medicine, legal documents.
An average of 49.33% increase in Flesch Readability Ease Scores was seen after initial testing on 12 corpora proving the successful text simplification process of SimText.
Abstract Format
html
Language
English
Format
Accession Number
TU14560
Shelf Location
Archives, The Learning Commons, 12F, Henry Sy Sr. Hall
Recommended Citation
Damay, J. S., Lojico, G. D., Lu, K. L., & Tarantan, D. B. (2006). SimText: Text simplication of medical literature. Retrieved from https://animorepository.dlsu.edu.ph/etd_bachelors/14392