New algorithm helps identify negative report findings

Mar 20, 2018

2018 03 19 17 22 4825 Doctor Computer 400

A new open-source algorithm developed at the U.S. National Institutes of Health (NIH) can identify negative and uncertain findings in radiology reports -- and could eventually help guide radiologists as they interpret images, according to a March 15 presentation at the American Medical Informatics Association (AMIA) meeting in San Francisco.

The tool, called NegBio, could help radiologists make better use of prior images -- making them more efficient, presenter Yifan Peng, PhD, told session attendees.

"Eventually, the algorithm could be integrated into a department's reporting system and make suggestions to radiologists as they review prior images," he said.

Peng is part of a research group led by Zhiyong Lu, PhD, of the National Center for Biotechnology Information (NCBI), a division of the National Library of Medicine (NLM) at the NIH. Other authors include Xiaosong Wang, PhD; Le Lu, PhD; Dr. Mohammadhadi Bagheri; and Dr. Ronald Summers, PhD.

Lu and colleagues hope that the fact that NegBio is an open-source tool will spark further research. In clinical practice, NegBio could help downstream applications, such as the reporting system or image diagnosis system, to return fewer irrelevant (i.e., negative) results.

"We believe NegBio can contribute to the research and development in the healthcare informatics community for real-world applications," Peng said.

New approach

Negative and uncertain findings are common in radiology reports, and identifying them is as important as identifying findings that are positive for pathology, Peng said. And although many natural language processing applications have been developed in recent years that can extract findings from medical reports, distinguishing between positive, negative, and uncertain findings remains challenging.

Applications for extracting findings tend to take one of two approaches: rule-based or machine learning. Rule-based algorithms rely on negative keywords and rules; a common application is NegEx. But this type of application relies on surface text and often can't capture more complex sentence constructions. And machine learning requires manually annotated data, which are often not available and can be difficult to generalize, according to Peng.

To address these extraction applications' limitations, the researchers developed NegBio, an open-source, rule-based tool for finding negative and uncertain results in radiology reports. The algorithm uses what the team calls a "universal dependency graph" to define patterns with "a simple description of the grammatical relationships in a sentence that can be easily understood by nonlinguists," Peng said.

The group evaluated NegBio using two datasets annotated for positive findings: a new radiology set Peng and colleagues created for the study called ChestX-ray, which included 900 reports from a larger radiology dataset, and a public set of general clinical texts called OpenI, which included 3,851 radiology reports. The team also included two public benchmarking report datasets annotated for negative findings: BioScope, which included 977 reports, and PK, which included 116 reports. The researchers then compared NegBio with NegEx, measuring performance by precision, recall, and F1-score, which is a measure of a test's accuracy that considers both precision and recall.

Peng and colleagues called a finding negative if it was ruled out by the radiologists, and uncertain if it was equivocal (i.e., "no evidence of pneumothorax" versus "suspicious pneumothorax"). The researchers identified all findings and their corresponding Unified Medical Language System (UMLS) concepts using a program called MetaMap, which plots biomedical text to the UMLS. With ChestX-ray and OpenI, they focused on 14 common disease finding types identified by radiologists:

Atelectasis
Cardiomegaly
Consolidation
Edema
Effusion
Emphysema
Fibrosis
Hernia
Infiltration
Mass
Nodule
Pleural thickening
Pneumonia
Pneumothorax

NegBio achieved significant improvement on all datasets compared with NegEx, the researchers found.

NegBio compared with NegEx on 4 datasets
Dataset	NegBio	NegEx
BioScope Precision Recall F1 score	96.1% 95.7% 95.9%	70.6% 98.7% 82.3%
ChestX-ray Precision Recall F1 score	94.4% 94.4% 94.4%	82.8% 95.5% 88.7%
OpenI Precision Recall F1 score	89.8% 85% 87.3%	77.2% 84.6% 80.7%
PK Precision Recall F1 score	98.4% 88.6% 93.3%	95.1% 91.2% 93.1%

Based on their results, "the use of negation and uncertainty detection on the syntactic level successfully removes false-positive cases of 'positive' findings," according to Peng and colleagues.

The algorithm did miss double-negation terms -- for example, "Findings cannot exclude increasing pleural effusions."

"Our method currently lacks rules to recognize double negatives and, thus, generates more false negatives," Peng said. "We see double negation as an open challenge that merits further investigation."

Going forward, the researchers plan to explore NegBio's performance with clinical texts other than radiology, he said.