Show simple item record

From Language Models to Medical Diagnoses: Assessing the Potential of GPT-4 and GPT-3.5-Turbo in Digital Health

dc.contributor.authorRoos, Jonas
dc.contributor.authorWilhelm, Theresa Isabelle
dc.contributor.authorMartin, Ron
dc.contributor.authorKaczmarczyk, Robert
dc.date.accessioned2025-08-08T10:54:18Z
dc.date.available2025-08-08T10:54:18Z
dc.date.issued02.12.2024
dc.identifier.urihttps://hdl.handle.net/20.500.11811/13331
dc.description.abstractBackground: Large language models (LLMs) like GPT-3.5-Turbo and GPT-4 show potential to transform medical diagnostics through their linguistic and analytical capabilities. This study evaluates their diagnostic proficiency using English and German medical examination datasets. Methods: We analyzed 452 English and 637 German medical examination questions using GPT models. Performance metrics included broad and exact accuracy rates for primary and three-model generated guesses, with an analysis of performance against varying question difficulties based on student accuracy rates. Results: GPT-4 demonstrated superior performance, achieving up to 95.4% accuracy when considering approximate similarity in English datasets. While GPT-3.5-Turbo showed better results in English, GPT-4 maintained consistent performance across both languages. Question difficulty was correlated with diagnostic accuracy, particularly in German datasets. Conclusions: The study demonstrates GPT-4's significant diagnostic capabilities and cross-linguistic flexibility, suggesting potential for clinical applications. However, further validation and ethical consideration are necessary before widespread implementation.en
dc.format.extent13
dc.language.isoeng
dc.rightsNamensnennung 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectAI
dc.subjectLLM
dc.subjectmedical examination
dc.subjectChatGPT
dc.subject.ddc004 Informatik
dc.subject.ddc610 Medizin, Gesundheit
dc.titleFrom Language Models to Medical Diagnoses: Assessing the Potential of GPT-4 and GPT-3.5-Turbo in Digital Health
dc.typeWissenschaftlicher Artikel
dc.publisher.nameMDPI
dc.publisher.locationBasel
dc.rights.accessRightsopenAccess
dcterms.bibliographicCitation.volume2024, vol. 5
dcterms.bibliographicCitation.issueiss. 4
dcterms.bibliographicCitation.pagestart2680
dcterms.bibliographicCitation.pageend2692
dc.relation.doihttps://doi.org/10.3390/ai5040128
dcterms.bibliographicCitation.journaltitleAI
ulbbn.pubtypeZweitveröffentlichung
dc.versionpublishedVersion
ulbbn.sponsorship.oaUnifundOA-Förderung Universität Bonn


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

The following license files are associated with this item:

Namensnennung 4.0 International