News | Artificial Intelligence | May 21, 2024

In a new study of nearly 5,000 screening mammograms interpreted by an FDA-approved AI algorithm, patient characteristics such as race and age influenced false positive results. The study’s results, “Patient Characteristics Impact Performance of AI Algorithm in Interpreting Negative Screening Digital Breast Tomosynthesis Studies” were published today in the RSNA journal Radiology.

According to a newly-published study of nearly 5,000 screening mammograms interpreted by an FDA-approved AI algorithm, patient characteristics such as race and age influenced false positive results.

In a newly-published study of nearly 5,000 screening mammograms interpreted by an FDA-approved AI algorithm, patient characteristics such as race and age influenced false positive results. The study’s results, “Patient Characteristics Impact Performance of AI Algorithm in Interpreting Negative Screening Digital Breast Tomosynthesis Studies” were published today in the RSNA journal Radiology. Image courtesy: RSNA Radiology (Nguyen, DL and Ren, Y et al)


May 21, 2024 — According to a newly-published study of nearly 5,000 screening mammograms interpreted by an FDA-approved AI algorithm, patient characteristics such as race and age influenced false positive results. The study’s results, “Patient Characteristics Impact Performance of AI Algorithm in Interpreting Negative Screening Digital Breast Tomosynthesis Studies” were published today in Radiology, a journal of the Radiological Society of North America (RSNA).

“AI has become a resource for radiologists to improve their efficiency and accuracy in reading screening mammograms while mitigating reader burnout,” said Derek L. Nguyen, M.D., Duke Health, breast radiologist and assistant professor at Duke University in Durham, NC. “However, the impact of patient characteristics on AI performance has not been well studied,” added Nguyen.

Key Results

- In a retrospective study including 4855 breast screening patients, false-positive case scores were more likely in Black patients (odds ratio [OR] = 1.5) and less likely in Asian patients (OR = 0.7) compared with White patients.

- False-positive risk scores were more likely in Black patients (OR = 1.5) and patients with extremely dense breasts (OR = 2.8) compared with White patients and patients with fatty density breasts, respectively.

- The influence of patient characteristics on algorithm performance necessitates more demographically diverse data sets for testing and training and greater transparency.

In a written summary of the findings issued by RSNA, Nguyen said while preliminary data suggests that AI algorithms applied to screening mammography exams may improve radiologists’ diagnostic performance for breast cancer detection and reduce interpretation time, there are some aspects of AI to be aware of.

“There are few demographically diverse databases for AI algorithm training, and the FDA does not require diverse datasets for validation,” Nguyen said, adding: “Because of the differences among patient populations, it’s important to investigate whether AI software can accommodate and perform at the same level for different patient ages, races and ethnicities.”

Collaborating with Nguyen were Lars J. Grimm, M.D., M.S.,  and Joseph Y. Lo, PhD, both with Duke University School of Medicine Department of Radidology; Tyler M. Jones, B.S., Duke University Pratt School of Engineering; Samantha M. Thomas, M.S., Duke University Department of Biostastics and Bioinformatics, and for iCAD Senior Research Scientist, Yinhao Ren, Ph.D.

AI Breast Imaging Study Details

In the retrospective study, researchers identified patients with negative (no evidence of cancer) digital breast tomosynthesis screening examinations performed at Duke University Medical Center between 2016 and 2019. All patients were followed for a two-year period after the screening mammograms, and no patients were diagnosed with a breast malignancy.

The researchers randomly selected a subset of this group consisting of 4,855 patients (median age 54 years) broadly distributed across four ethnic/racial groups. The subset included 1,316 (27%) white, 1,261 (26%) Black, 1,351 (28%) Asian, and 927 (19%) Hispanic patients.

A commercially available AI algorithm interpreted each exam in the subset of mammograms, generating both a case score (or certainty of malignancy) and a risk score (or one-year subsequent malignancy risk).

“Our goal was to evaluate whether an AI algorithm’s performance was uniform across age, breast density types and different patient race/ethnicities,” Nguyen said.

Given all mammograms in the study were negative for the presence of cancer, anything flagged as suspicious by the algorithm was considered a false positive result. False positive case scores were significantly more likely in Black and older patients (71-80 years) and less likely in Asian patients and younger patients (41-50 years) compared to white patients and women between the ages of 51 and 60.

“This study is important because it highlights that any AI software purchased by a healthcare institution may not perform equally across all patient ages, races/ethnicities and breast densities,” Nguyen said. Notably, he added, “Moving forward, I think AI software upgrades should focus on ensuring demographic diversity.”

Nguyen said healthcare institutions should understand the patient population they serve before purchasing an AI algorithm for screening mammogram interpretation and ask vendors about their algorithm training.

“Having a baseline knowledge of your institution’s demographics and asking the vendor about the ethnic and age diversity of their training data will help you understand the limitations you’ll face in clinical practice,” he said. 

More information: www.rsna.org

Reference: https://doi.org/10.1148/radiol.232286


Related Content

News | Magnetic Resonance Imaging (MRI)

May 11, 2026 – At the International Society for Magnetic Resonance in Medicine (ISMRM) 2026 Annual Meeting, GE ...

Time May 11, 2026
arrow
News | FDA

May 6, 2026 — Artera, the developer of multimodal artificial intelligence (MMAI)-based prognostic and predictive cancer ...

Time May 07, 2026
arrow
News | Magnetic Resonance Imaging (MRI)

April 27, 2026 — SimonMed, one of the nation’s largest independent outpatient imaging providers, has announced the ...

Time May 04, 2026
arrow
News

April 30, 2026 — The American College of Radiology has congratulated Nicole B. Saphier, MD, on her nomination to be ...

Time April 30, 2026
arrow
News | Computed Tomography (CT)

April 23, 2026 — Royal Philips has received 510(k) clearance from the U.S. Food and Drug Administration (FDA) for its ...

Time April 30, 2026
arrow
News | X-Ray

April 29, 2026 — Results from a new study* presented at the American Roentgen Ray Society’s (ARRS) 2026 annual meeting ...

Time April 29, 2026
arrow
News | Radiology Business

April 28, 2026 — The American Society of Radiologic Technologists will award Life Member status to three longstanding ...

Time April 29, 2026
arrow
News | Cardiac Imaging

April 28, 2026 — Abbott has received U.S. Food and Drug Administration (FDA) clearance and CE Mark for its next ...

Time April 28, 2026
arrow
News | Radiology Business

April 24, 2026 — The 2026 vacancy rate for radiation therapists decreased to 11.4% and the vacancy rate for medical ...

Time April 24, 2026
arrow
News | Artificial Intelligence

April 20, 2026 — DeepTek, provider of the Augmento platform and deepc, the company behind deepcOS, have introduced a ...

Time April 23, 2026
arrow
Subscribe Now