June 8, 2023
2 mins read

ChatGPT performs poorly at US’ urologists exam

The explanations provided by ChatGPT were longer than those provided by SASP, but “frequently redundant and cyclical in nature”, according to the authors…reports Asian Lite News

The much-acclaimed OpenAI’s ChatGPT chatbot has failed a urologist exam in the US, according to a study.

This comes at a time of growing interest in the potential role of artificial intelligence (AI) technology in medicine and healthcare.

The study, reported in the journal Urology Practice, showed that ChatGPT achieved less than a 30 per cent rate of correct answers on the American Urologist Association’s widely used Self-Assessment Study Program for Urology (SASP).

“ChatGPT not only has a low rate of correct answers regarding clinical questions in urologic practice, but also makes certain types of errors that pose a risk of spreading medical misinformation,” said Christopher M. Deibert, from University of Nebraska Medical Center.

The AUA’s Self-Assessment Study Program (SASP) is a 150-question practice examination addressing the core curriculum of medical knowledge in urology.

The study excluded 15 questions containing visual information such as pictures or graphs.

Overall, ChatGPT gave correct answers to less than 30 per cent of SASP questions, 28.2 per cent of multiple-choice questions and 26.7 per cent of open-ended questions.

The chatbot provided “indeterminate” responses to several questions. On these questions, accuracy was decreased when the LLM model was asked to regenerate its answers.

For most open-ended questions, ChatGPT provided an explanation for the selected answer.

The explanations provided by ChatGPT were longer than those provided by SASP, but “frequently redundant and cyclical in nature”, according to the authors.

“Overall, ChatGPT often gave vague justifications with broad statements and rarely commented on specifics,” Dr. Deibert said.

Even when given feedback, “ChatGPT continuously reiterated the original explanation despite it being inaccurate”.

The researchers suggest that while ChatGPT may do well on tests requiring recall of facts, it seems to fall short on questions pertaining to clinical medicine, which require “simultaneous weighing of multiple overlapping facts, situations and outcomes”.

“Given that LLMs are limited by their human training, further research is needed to understand their limitations and capabilities across multiple disciplines before it is made available for general use,” Dr. Deibert said.

“As is, utilisation of ChatGPT in urology has a high likelihood of facilitating medical misinformation for the untrained user.”

ALSO READ-US judge orders lawyers not to use ChatGPT-drafted content  

Previous Story

Here are the most expensive cities for expats

Next Story

Erdogan dials Putin, offers mediation in Ukraine conflict

Latest from -Top News

World Powers Gather for G7

The leaders had unveiled its slimmed-down agenda on Sunday, prioritising discussions on the global economy and energy security….reports Asian Lite News Several world leaders have gathered at the Canadian Rockies for the

Israel Takes Out Iran Spy Leaders

Among those killed were Mohammad Khatami, head of the IRGC Intelligence Organisation since 2022, and his deputy Mohammad Hassan Mahkaghi….reports Asian Lite News Israel on Monday announced that four high-ranking Iranian intelligence

Iran May Quit Nuclear Treaty

The NPT, a landmark international treaty that came into force in 1970, seeks to prevent the spread of nuclear weapons…reports Asian Lite News Amid rising tensions with Israel, Iran announced Monday that

Modi, Cyprus President Hold Talks

Both leaders explored avenues to deepen cooperation in trade, investment, security, and technology…reports Asian Lite News Prime Minister Narendra Modi on Monday held wide-ranging discussions with Cyprus President Nikos Christodoulides at the

Jaishankar Dials UAE, Armenia as Mideast Heats Up

EAM Jaishankar discussed the fast-evolving situation and emphasised the importance of dialogue and cooperation….reports Asian Lite News External Affairs Minister S. Jaishankar held telephonic conversations with his counterparts in the United Arab
Go toTop

Don't Miss

ChatGPT is politically biased, finds study

These multiple responses were then put through a 1000-repetition ‘bootstrap’

Intel working to build ChatGPT-like apps for customers

Together with a custom natural language chatbot interface powered by