Utilizing a Large Language Model to Answer Questions From Oncology Patients

April 16, 2024

News

Article

Cross-sectional study observed responses to cancer care questions generated by artificial intelligence.

Image Credit: © Shuo - stock.adobe.com

A cross-sectional study recently published on JAMA Network Open analyzed the quality of responses from an artificial intelligence (AI) large language model (LLM) in radiation oncology. The LLM’s answers were compared to established sources to determine how accurate and readable they were.¹

“AI LLMs demonstrate potential in simulating human-like dialogue. Their efficacy in accurate patient-clinician communication within radiation oncology has yet to be explored,” the study authors wrote.

The study authors utilized questions from websites affiliated with the National Cancer Institute and the Radiological Society of North America. These questions were used as queries for an AI LLM, ChatGPT version 3.5 to prompt LLM-generated responses. Three radiation oncologists and three radiation physicists ranked the responses for factual correctness, completeness, and conciseness, which were compared with answers found online from experts.

Of the 115 radiation oncology questions retrieved, the LLM performed the same or better in 108 responses (94%) for relative correctness, 89 responses (77%) for completeness, and 105 responses (91%) for conciseness compared with the expert answers.

“To our knowledge, this study is one of the first to provide both domain-specific and domain-agnostic metrics to quantitatively evaluate ChatGPT-generated responses in radiation oncology,” the authors wrote. “We have shown via both sets of metrics that the LLM yielded responses comparable with those provided by human experts via online resources and were similar, and in some cases better, than answers provided by the relevant professional bodies on the internet. Overall, the responses provided by the LLM were complete and accurate.”

While the LLM performed well in answering the questions, it did so at a higher reading level than the experts. According to the authors, the mean difference was six grade levels for the category of general radiation oncology answers. However, there was a smaller mean difference of two grade levels for modality-specific and subsite-specific answers.

“The LLM generated more complex responses to the general radiation oncology questions, with higher mean syllable and word counts, and a higher mean lexicon score. These scores between expert and the LLM responses were similar for the modality-specific and site-specific answers,” the authors added.

The study had limitations, the first of which being that the high reading level of the responses may present an obstacle to patients. According to the authors, American Medical Association and National Institutes of Health recommendations for patient education resources are to be written between third grade and seventh grade reading levels. There was not one LLM response that met these criteria. However, the authors suggested that future research could be done with the LLM’s ability to generate tailored responses through specific prompts. Techniques using multiple prompts may provide a more simplified response.

Second, variations in question phrasing could change the LLM’s responses. Patients may have varying levels of comfortability, background, and/or language proficiency with using technology, such as AI. Additionally, it should be noted that the LLM is still an experimental model and responses may change over time as the technology evolves.

“This cross-sectional study found that the LLM provided mainly highly accurate and complete responses in a similar format to virtual communications between a patient and clinicians in a radiation oncology clinical environment,” the authors concluded. “Accordingly, these results suggest that the LLM has the potential to be used as an alternative to current online resources.”

Reference

1. Yalamanchili A, Sengupta B, Song J, et al. Quality of Large Language Model Responses to Radiation Oncology Patient Care Questions. JAMA Netw Open. 2024;7(4):e244630. doi:10.1001/jamanetworkopen.2024.463

Related Content

Image credit: NicoElNino | stock.adobe.com

AI in Clinical Trials: The Future of Drug Discovery

April 29th 2024

Article

Despite the significance offered by AI to pharmaceutical companies, there are several challenges that must be overcome for successful integration of tech-based tools.

Including Women of Childbearing Age in Clinical Research

March 26th 2024

Podcast

In recognition of International Women's Month, we're featuring this recent talk between Associate Editor Miranda Schmalfuhs and Marie Teil, Global Head of UCB’s Women of Childbearing Age Program. They speak about the specific challenges women with chronic illnesses face when accessing appropriate treatment and participating in clinical trials, UCB's Women of Childbearing Age Program and it’s most successful strategies, and much more.

Image credit: Sebastian Kaulitzki | stock.adobe.com

Enhertu Shows Significant Survival Benefit Over Chemotherapy in HR-Positive, HER2-Low Metastatic Breast Cancer

April 29th 2024

Article

Enhertu, a HER2-directed antibody-drug conjugate, showed a clinically meaningful survival benefit in progression-free survival among patients with HR-positive, HER2-low metastatic breast cancer.

Improving Engagement While Maintaining Data Integrity & Validity

March 19th 2024

Podcast

In recognition of Women's Health Month, we're featuring this recent talk between Associate Editor Miranda Schmalfuhs and uMotif's Chief Product Officer, Julia Lakeland, discuss new technologies improving patient engagement and reducing the emotional and logistical burdens of participation, ethical considerations that should be addressed when implementing those technologies, while ensuring patient privacy, and much more.

Image credit: Matthieu | stock.adobe.com

FDA Grants Fast Track Designation to Novel Bispecific Antibody Plus Paclitaxel for Advanced Biliary Tract Cancer

April 26th 2024

Article

Phase II/III COMPANION-002 trial shows promise of CTX-009 in combination with paclitaxel for patients with metastatic or locally advanced biliary tract cancer.

Regulatory Compliance With eCOAs

April 26th 2024

Article

In the fourth and final part of this video interview with ACT editor Andy Studna, Melissa Mooney, director, eCOA sales engineering, IQVIA discusses how the regulatory stance on electronic clinical outcome assessments has changed over the years and what it could look like in the future.