Feature | Information Technology | June 27, 2019 | By Greg Freiherr

Smart Algorithm Extracts Data from Radiology Reports

SIIM talk details early experience with data mining

Smart Algorithm Extracts Data from Radiology Reports
Greg Freiherr

Greg Freiherr

Radiology reports may contain information essential to figuring out a patient’s condition. But this information is often locked in free text, which can make it difficult to search, analyze and summarize report data.  That’s where natural language processing (NLP) might come in.

On June 26, 2019, IBM researcher Ashutosh Jadhav, Ph.D., described at the annual meeting of the Society for Imaging Informatics in Medicine (SIIM) in Denver an algorithm that uses artificial intelligence to extract information from the free text of radiology reports.

 “There is a lot of information hidden in these reports,” Jadhav told ITN after his presentation during the Natural Language Processing session. Included in that information are things like the physician’s findings. The extracted data might be used to automatically populate data fields in PACS.

Automatic structuring of radiology reports could unleash the power of information otherwise locked in these reports, he told an audience attending his presentation titled “Automatic Extraction of Structured Radiology Reports.” Jadhav and his colleagues are working to further improve the prototype, he said, noting that he cannot predict if or when the algorithm will be productized. “Our work is majorly focused on the research aspect,” he told ITN after his presentation. Jadhav is a research staff member at IBM Research Almaden, IBM's Silicon Valley innovation lab.


Extensive Research

In developing the NLP algorithm, Jadhav and colleagues had to overcome challenges including missing section labels, inconsistent section ordering, and inconsistent section formatting. Differences among institutions and individual radiologists should not be issues, he told ITN, because “we have a really big sample size and we are processing reports from multiple institutes.”

Radiology reports from more than 200,000 chest X-rays were used to develop the algorithm. These reports were randomly split into training and testing data sets. The training data set, comprised of about 80 percent of these reports, was used to develop NLP rules. The remaining 20 percent comprised the data set used for testing. Some 200 reports, randomly selected from the testing set, were used to determine how well the algorithm identified sections and corresponding section texts.


Greg Freiherr is a contributing editor to Imaging Technology News (ITN). Over the past three decades, he has served as business and technology editor for publications in medical imaging, as well as consulted for vendors, professional organizations, academia, and financial institutions.


Editor’s note: This article is the seventh piece in a content series by Greg Freiherr covering the Society for Imaging Informatics in Medicine (SIIM) conference in June.


Related content:

Why Blockchain Matters In Medical Imaging

DeepAAA Uses AI to Look Automatically For Aneurysms

Making AI Safe, Effective and Humane for Imaging

Cinebot: Efficient Creation of Movies and Animated Gifs for Presentation and Education Directly from PACS

5 Low-Cost Ways To Slow Hackers

Imaging on Verge of Game-changing Transformation

How to Fix Your Enterprise Imaging Network

Imaging on Verge of Game-changing Transformation

VIDEO: AI That Second Reads Radiology Reports and Deals With Incidental Findings

AI Detects Unsuspected Lung Cancer in Radiology Reports, Augments Clinical Follow-up 

Related Content

Low Doses of Radiation Promote Cancer-capable Cells
News | Radiation Dose Management | July 18, 2019
Low doses of radiation equivalent to three computed tomography (CT) scans, which are considered safe, give cancer-...
Paragon Biosciences Launches Qlarity Imaging to Advance FDA-cleared AI Breast Cancer Diagnosis System

Qlarity Imaging’s software is used to assist radiologists in the assessment and characterization of breast lesions. Imaging features are synthesized by an artificial intelligence algorithm into a single value, the QI score, which is analyzed relative to a database of reference abnormalities with known ground truth. Image courtesy of Business Wire.

Technology | Artificial Intelligence | July 18, 2019
Paragon Biosciences LLC announced the launch of its seventh portfolio company, Qlarity Imaging LLC, which was founded...
Johns Hopkins Named Qualified Provider-led Entity to Develop Criteria for Diagnostic Imaging
News | Clinical Decision Support | July 18, 2019
On June 30, 2019, the Centers for Medicare & Medicaid Services (CMS) announced the Johns Hopkins University School...
Anatomage Releases Anatomage Cloud Platform
News | Remote Viewing Systems | July 16, 2019
Anatomage Inc. released an update to the Anatomage Cloud platform that allows medical and dental professionals to...
Graphic courtesy Pixabay

Graphic courtesy Pixabay

Feature | Artificial Intelligence | July 15, 2019 | By Greg Freiherr
Siemens has long focused on automation as a way to make diagnostic equipment faster and more efficient.
Videos | Artificial Intelligence | July 12, 2019
Khan Siddiqui, M.D., founder and CEO of HOPPR, discusses the economic advantages and costs presented by...
Videos | Digital Pathology | July 11, 2019
Toby Cornish, M.D., Ph.D., associate professor and medical director of informatics at the University of Colorado Scho
FDA Clears Koios DS Breast 2.0 AI-based Software
News | Ultrasound Women's Health | July 11, 2019
Koios Medical announced its second 510(k) clearance from the U.S. Food and Drug Administration (FDA).
SimonMed Imaging Implements ProFound AI for 3-D Tomosynthesis
News | Mammography | July 10, 2019
Arizona-based SimonMed Imaging announced their implementation of the first U.S. Food and Drug Administration (FDA)-...