Information extraction from legal documents
Added Title
International Symposium on Natural Language Processing (8th : 2009)
SNLP '09
College
College of Computer Studies
Department/Unit
Computer Science
Document Type
Conference Proceeding
Source Title
2009 8th International Symposium on Natural Language Processing, SNLP '09
First Page
157
Last Page
162
Publication Date
12-28-2009
Abstract
Legal TRUTHS (TuRning Unstructured Texts to Helpful Structure) is a system that extracts relevant information from Philippine Supreme Court decisions, specifically on criminal cases. We describe here the processes involved in the development of Legal TRUTHS focusing on the issues relating to the domain and the geographical setting of the source documents, and the performance evaluation results are also presented. Pertinent information to be extracted for criminal cases such as the crime, the date and time of commission, the plaintiff, and the penalty were determined from a sample set of documents. Sections of these documents were identified for initial segmentation of the data. Automatic filtering of the data was involved in drawing out relevant information from the texts. From 25 training documents and also the same set for testing, performance showed over-all precision at 91.7%, recall at 99.5%, and F-measure at 95.6%. Testing on another 50 documents showed over-all precision at 84.3%, recall at 95.8%, and F-measure at 91.0%. © 2009 IEEE.
html
Digitial Object Identifier (DOI)
10.1109/SNLP.2009.5340925
Recommended Citation
Cheng, T., Cua, J., Tan, M., Yao, K., & Roxas, R. (2009). Information extraction from legal documents. 2009 8th International Symposium on Natural Language Processing, SNLP '09, 157-162. https://doi.org/10.1109/SNLP.2009.5340925
Disciplines
Computer Sciences
Keywords
Computational linguistics; Ad hoc networks (Computer networks)
Upload File
wf_yes