SimText: Text simplication of medical literature

Date of Publication

2006

Document Type

Bachelor's Thesis

Degree Name

Bachelor of Science in Computer Science

Subject Categories

Computer Sciences

College

College of Computer Studies

Department/Unit

Computer Science

Thesis Adviser

Ethel C. Ong

Defense Panel Member


Rachel O. Roxas
Allan B. Borra

Abstract/Summary

Text simplification can be defined as any process that reduces the syntactic or lexical complexity of a text while attempting to preserve its meaning and information content. The aim text simplification is to make text easier to comprehend for human readers or process by programs.

SimText is a text simplification system that is capable of simplifying English medical literature through syntactic and lexical simplification. The level of syntactic and lexical simplification achieved is dependent on the syntactic and lexical information stored in the knowledge sources by the system. SimText is also extensible, as it can simplify other domains by changing the contents of the knowledge sources according to the preferences of the user. Through these, the system is able to simplify not only general English text but also literatures in specific fields, i.e. medicine, legal documents.

An average of 49.33% increase in Flesch Readability Ease Scores was seen after initial testing on 12 corpora proving the successful text simplification process of SimText.

Abstract Format

html

Language

English

Format

Print

Accession Number

TU14560

Shelf Location

Archives, The Learning Commons, 12F, Henry Sy Sr. Hall

This document is currently not available here.

Share

COinS