Date of Publication
12-18-2010
Document Type
Master's Thesis
Degree Name
Master of Science in Computer Science
Subject Categories
Computer Sciences
College
College of Computer Studies
Department/Unit
Computer Science
Thesis Adviser
Arnulfo P. Azcarraga
Defense Panel Chair
Nelson Marcos
Defense Panel Member
Arnulfo Azcarraga
Charibeth Cheng
Abstract/Summary
Keyword extraction is vital for Knowledge Management Systems, Information Re- trieval Systems, and Digital Libraries as well as for general browsing of the web. Keywords are often the basis of document processing methods such as clustering and retrieval since processing all the words in the document can be slow. Common models for automating the process of keyword extraction are usually done by using several statistics-based methods such as Bayesian, K-Nearest Neighbour, and Expectation-Maximization. These models are limited by word-related features that can be used since adding more features will make the models more complex and difficult to comprehend. In this research, a Neural Network, specifically a backpropagation network, will be used in generalizing the relationship of the title and the content of articles in the archive by following word features other than TF-IDF, such as position of word in the sentence, paragraph, or in the entire document, and formats such as heading, and other attributes defined beforehand. In order to explain how the backpropagation network works, a rule extraction method will be used to extract symbolic data from the resulting backpropagation network. The rules extracted can then be transformed into decision trees per- forming almost as accurate as the network plus the benefit of being in an easily comprehensible format.
Abstract Format
html
Language
English
Format
Electronic File Format
MS WORD
Accession Number
TG04916
Shelf Location
Archives, The Learning Commons, 12F Henry Sy Sr. Hall
Physical Description
x, 84 leaves ; ill. (some col.) ; 28 cm. + 1 computer optical disc.
Keywords
Back propagation; Keyword searching
Upload Full Text
wf_yes
Recommended Citation
Liu, M. S. (2010). Keyword extraction using a back propagation network and rule extraction. Retrieved from https://animorepository.dlsu.edu.ph/etd_masteral/4007