Journal article

Automated ancillary cancer history classification for mesothelioma patients from free-text clinical reports.

Richard A Wilson, Wendy W Chapman, Shawn J Defries, Michael J Becich, Brian E Chapman

J Pathol Inform | Medknow | Published : 2010


BACKGROUND: Clinical records are often unstructured, free-text documents that create information extraction challenges and costs. Healthcare delivery and research organizations, such as the National Mesothelioma Virtual Bank, require the aggregation of both structured and unstructured data types. Natural language processing offers techniques for automatically extracting information from unstructured, free-text documents. METHODS: Five hundred and eight history and physical reports from mesothelioma patients were split into development (208) and test sets (300). A reference standard was developed and each report was annotated by experts with regard to the patient's personal history of ancilla..

View full abstract