Extracting tumour prognostic factors from a diverse electronic record dataset in genito-urinary oncology
Richard C Khor, Anthony Nguyen, John O'Dwyer, Gargi Kothari, Joseph Sia, David Chang, Sweet Ping Ng, Gillian M Duchesne, Farshad Foroudi
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS | ELSEVIER IRELAND LTD | Published : 2019
OBJECTIVES: To implement a system for unsupervised extraction of tumor stage and prognostic data in patients with genitourinary cancers using clinicopathological and radiology text. METHODS: A corpus of 1054 electronic notes (clinician notes, radiology reports and pathology reports) was annotated for tumor stage, prostate specific antigen (PSA) and Gleason grade. Annotations from five clinicians were reconciled to form a gold standard dataset. A training dataset of 386 documents was sequestered. The Medtex algorithm was adapted using the training dataset. RESULTS: Adapted Medtex equaled or exceeded human performance in most annotations, except for implicit M stage (F-measure of 0.69 vs 0.84)..View full abstract
This research was conducted as part of the Hospira Withers-Peters Fellowship, provided by the Royal Australian and New Zealand College of Radiologists.