دورية أكاديمية
Comparative performance of humans versus GPT-4.0 and GPT-3.5 in the self-assessment program of American Academy of Ophthalmology
العنوان: | Comparative performance of humans versus GPT-4.0 and GPT-3.5 in the self-assessment program of American Academy of Ophthalmology |
---|---|
المؤلفون: | Andrea Taloni, Massimiliano Borselli, Valentina Scarsi, Costanza Rossi, Giulia Coco, Vincenzo Scorcia, Giuseppe Giannaccare |
المصدر: | Scientific Reports, Vol 13, Iss 1, Pp 1-7 (2023) |
بيانات النشر: | Nature Portfolio, 2023. |
سنة النشر: | 2023 |
المجموعة: | LCC:Medicine LCC:Science |
مصطلحات موضوعية: | Medicine, Science |
الوصف: | Abstract To compare the performance of humans, GPT-4.0 and GPT-3.5 in answering multiple-choice questions from the American Academy of Ophthalmology (AAO) Basic and Clinical Science Course (BCSC) self-assessment program, available at https://www.aao.org/education/self-assessmentsTest . In June 2023, text-based multiple-choice questions were submitted to GPT-4.0 and GPT-3.5. The AAO provides the percentage of humans who selected the correct answer, which was analyzed for comparison. All questions were classified by 10 subspecialties and 3 practice areas (diagnostics/clinics, medical treatment, surgery). Out of 1023 questions, GPT-4.0 achieved the best score (82.4%), followed by humans (75.7%) and GPT-3.5 (65.9%), with significant difference in accuracy rates (always P 50% of humans), both GPT models favorably compared to humans, without reaching significancy. The word count for answers provided by GPT-4.0 was significantly lower than those produced by GPT-3.5 (160 ± 56 and 206 ± 77 respectively, P |
نوع الوثيقة: | article |
وصف الملف: | electronic resource |
اللغة: | English |
تدمد: | 2045-2322 |
العلاقة: | https://doaj.org/toc/2045-2322Test |
DOI: | 10.1038/s41598-023-45837-2 |
الوصول الحر: | https://doaj.org/article/fecab778661c4adba5a3cc2bb3e9e82bTest |
رقم الانضمام: | edsdoj.fecab778661c4adba5a3cc2bb3e9e82b |
قاعدة البيانات: | Directory of Open Access Journals |
تدمد: | 20452322 |
---|---|
DOI: | 10.1038/s41598-023-45837-2 |