Rule-based approach for identifying assertions in clinical free-text data

, Nguyen, Anthony, , & (2010) Rule-based approach for identifying assertions in clinical free-text data. In Turpin, A, Scholer, F, & Trotman, A (Eds.) Proceedings of 15th Australasian Document Computing Symposium. School of Computer Science and IT, RMIT University, Australia, pp. 93-96.

[img]
Preview
Accepted Version (PDF 470kB)
_sun_2011004668.pdf.

View at publisher

Description

A rule-based approach for classifying previously identified medical concepts in the clinical free text into an assertion category is presented. There are six different categories of assertions for the task: Present, Absent, Possible, Conditional, Hypothetical and Not associated with the patient. The assertion classification algorithms were largely based on extending the popular NegEx and Context algorithms. In addition, a health based clinical terminology called SNOMED CT and other publicly available dictionaries were used to classify assertions, which did not fit the NegEx/Context model. The data for this task includes discharge summaries from Partners HealthCare and from Beth Israel Deaconess Medical Centre, as well as discharge summaries and progress notes from University of Pittsburgh Medical Centre. The set consists of 349 discharge reports, each with pairs of ground truth concept and assertion files for system development, and 477 reports for evaluation. The system’s performance on the evaluation data set was 0.83, 0.83 and 0.83 for recall, precision and F1-measure, respectively. Although the rule-based system shows promise, further improvements can be made by incorporating machine learning approaches.

Impact and interest:

2 citations in Scopus
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

121 since deposited on 08 Feb 2012
11 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 48508
Item Type: Chapter in Book, Report or Conference volume (Conference contribution)
ORCID iD:
Sitbon, Laurianneorcid.org/0000-0003-2359-2515
Geva, Shlomoorcid.org/0000-0003-1340-2802
Measurements or Duration: 4 pages
Keywords: Context, NegEx, SNOMED CT, assertion, medical concept, rule-based
ISBN: 978-1-921426-80-3
Pure ID: 32154987
Divisions: Past > QUT Faculties & Divisions > Faculty of Science and Technology
Past > QUT Faculties & Divisions > Science & Engineering Faculty
Current > Research Centres > Australian Research Centre for Aerospace Automation
Copyright Owner: Consult author(s) regarding copyright matters
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au
Deposited On: 08 Feb 2012 01:16
Last Modified: 12 Mar 2024 00:19