News | Artificial Intelligence | July 29, 2019

Artificial intelligence tool developed at Dana-Farber Cancer Institute uses natural language processing to rapidly assess unstructured data

New AI Tool Identifies Cancer Outcomes Using Radiology Reports

July 29, 2019 — Scientists at Dana-Farber Cancer Institute have demonstrated that an artificial intelligence (AI) tool can perform as well as human reviewers – and much more rapidly - in extracting clinical information regarding changes in tumors from unstructured radiology reports for patients with lung cancer.

The AI tool performed comparably to trained human “curators” in detecting the presence of cancer; and whether it was responding to treatment interventions, stable or worsening.

The goal of the study, said corresponding author Kenneth Kehl, M.D., MPH, a medical oncologist and faculty member of the Population Sciences Department at Dana-Faber, was to determine whether AI tools can extract the most high-value cancer outcomes from radiology reports, which are a ubiquitous but unstructured data source.

Kehl noted that electronic health records (EHRs) now collect vast amounts of information on thousands of patients seen at a center like Dana-Farber. However, unless the patients are enrolled in clinical trials, information about their outcomes, such as whether their cancers grow or shrink in response to treatment, is recorded only in the text of the medical record. Historically, this unstructured information is not amenable to computational analysis and therefore could not be used for research into the effectiveness of treatment.

Because of studies like the Profile initiative at Dana-Farber/Brigham and Women’s Cancer Center, which analyzes patient tumor samples and creates profiles that reveal genomic variants that may predict responsiveness to treatments, Dana-Farber researchers have accumulated a wealth of molecular information about patients’ cancers. “But it can be difficult to apply this information to understand what molecular patterns predict benefit from treatments without intensive review of patients’ medical records to measure their outcomes. This is a critical barrier to realizing the full potential of precision medicine,” said Kehl.

For the current study, Kehl and colleagues obtained more than 14,000 imaging reports for 1,112 patients and manually reviewed records using the “PRISSMM” framework. PRISSMM is a phenomic data standard developed at Dana-Farber that takes unstructured data from text reports in EHRs and structures them so that they can be readily analyzed. PRISSMM structures data pertaining to a patient’s pathology, radiology/imaging, signs/symptoms, molecular markers and a medical oncologist’s assessment to create a portrait of the cancer patient journey.

Human reviewers analyzed the imaging text reports and noted whether cancer was present and, if so, whether it was worsening or improving, and if the cancer had spread to specific body sites. These reports were then used to train a computational deep learning model to recognize these outcomes from the text reports. “Our hypothesis was that deep learning algorithms could use routinely generated radiology text reports to identify the presence of cancer and changes in its extent over time,” the authors wrote.

The researchers compared human and computer measurements of outcomes such as disease-free survival, progression-free survival, and time to improvement or response, and found that the AI algorithm could replicate human assessment of these outcomes. The deep learning algorithms were then applied to annotate another 15,000 reports for 1,294 patients whose records had not been manually reviewed. The authors found that computer outcome measurements among these patients predicted survival with similar accuracy to human assessments among the manually reviewed patients.

The human curators were able to annotate imaging reports for about three patients per hour, a rate at which one curator would need about six months to annotate all of the nearly 30,000 imaging reports for the patients in the cohort. By contrast, the artificial intelligence model that the researchers developed could annotate the imaging reports for the cohort in about 10 minutes, the researchers said in a report in JAMA Oncology.

“To create a true learning health system for oncology and to facilitate delivery of precision medicine at scale, methods are needed to accelerate curation of cancer-related outcomes from electronic health records,” said the authors of the publication. If applied widely, the investigators said, “this technique could substantially accelerate efforts to use real-world data from all patients with cancer to generate evidence regarding effectiveness of treatment approaches.” Next steps will include testing this approach on EHR data from other cancer centers and using the data to discover which treatments work best for which patients.

The senior author of study is Deborah Schrag, M.D., MPH, chief of Division of Population Sciences at Dana-Farber and a medical oncologist.

For more information: www.jamanetwork.com/journals/jamaoncology

 

Reference

1. Kehl K.L., Elmarakeby H., Nishino M., et al. Assessment of Deep Natural Language Processing in Ascertaining Oncologic Outcomes From Radiology Reports. JAMA Oncology, published online July 25, 2019. doi:10.1001/jamaoncol.2019.1800


Related Content

News | Radiology Imaging

Feb. 12, 2026 — Siemens Healthineers and Mayo Clinic are expanding their strategic collaboration to enhance patient care ...

Time February 13, 2026
arrow
News | Digital Pathology

Feb. 11, 2026 — Leica Biosystems has announced the global launch of the Leica CM1950 Cryostat with DualEcoTec Cooling ...

Time February 11, 2026
arrow
Feature | Cardiac Imaging | Kyle Hardner

Advances in coronary CT angiography (CCTA) have reached the point where image quality and AI capabilities are creating ...

Time February 06, 2026
arrow
News | Ultrasound Women's Health

Feb. 5, 2026 — BrightHeart, a global provider of AI-driven prenatal ultrasound, has announced the availability of its B ...

Time February 05, 2026
arrow
News | Lung Imaging

Feb. 3, 2026 — RevealDx, a leader in the characterization of lung nodules, recently announced FDA clearance of RevealAI ...

Time February 04, 2026
arrow
News | FDA

Jan. 29, 2026 — GE HealthCare has received 510(k) clearance from the U.S. Food and Drug Administration (FDA) for MIM ...

Time February 03, 2026
arrow
News | Radiology Imaging

Jan.26, 2026 — SimonMed Imaging has unveiled an updated brand and the launch of SimonMed Longevity, a new division ...

Time January 27, 2026
arrow
News | Point-of-Care Ultrasound (POCUS)

Jan. 22, 2026 — Qure.ai has received a grant from the Gates Foundation to develop a large open-source multi-modal ...

Time January 23, 2026
arrow
News | Magnetic Resonance Imaging (MRI)

Jan. 20, 2026 — Hyperfine, the developer of the first FDA-cleared AI-powered portable MRI system for the brain — the ...

Time January 20, 2026
arrow
News | Mammography

Jan. 16, 2026 — Vega Imaging Informatics has announced the successful curation of the world’s largest digital breast ...

Time January 19, 2026
arrow
Subscribe Now