Software tool provides data mining of radiology reports

VIENNA - A data-mining software platform can analyze text in German radiology reports and provide semantic links between images and textual descriptions, according to a presentation on Friday at the European Congress of Radiology (ECR).

The platform, developed via a collaboration between image software developer Averbis, the University of Freiburg, and the University of Leipzig, has a range of clinical, research, and teaching applications, said Dr. Philipp Daumke of Averbis. Daumke described the system during the Friday scientific session.

To improve access to images and free-text data in radiology reports, the developers wanted to create a tool that could provide semantic searching and data mining of radiology reports, diagnostic support through detection of similar cases, semantic links between reports and images, and quality assurance by cross-checking a diagnosis with external data, Daumke said.

The system employs a scalable text analysis platform that can be used to extract information from the German radiology reports. An image reference and context recognition component also facilitates the creation of semantic links between text and image data, according to the developers. In addition, radiologists can access key images and relevant text passages in a search engine that can be used for ad-hoc queries.

The system extracts information related to size, volume, time and date, image references, anatomical terms, descriptions, relations, negations, and abbreviations, Daumke said.

Its semantic analysis capabilities are such that a search for "lung metastasis" will also find reports with related terms such as lung metastases, metastases in the lung, metastasis in the lower part of the lung, pulmonary metastases, pulmonary metastatic relapse, pulmonary filiae, and lung filiae, he said.

"There is a huge variation [in terminology], and we are able to find all of these variations," he said.

The team evaluated the system in 133,325 radiology reports. The system was able to identify 32,533 image references and successfully retrieve 32,157 images from the local PACS server. Based on randomized validation of the identified image references, the researchers concluded that the correct reference was extracted with a mean average precision of 0.97.

Clinically, the system is used by radiologists to validate diagnoses and for looking for "best practice" reports, he said. It can also be used as a mobile application for clinicians and be integrated into portals for referring physicians.

For research, the system is used to collect patient cohorts, Daumke said. It can also be useful for teaching applications, such as creating a database for teaching cases and for presentation of lectures and talks, he said.

Page 1 of 603
Next Page