Comparison of Intern Doctors and ChatGPT in Emergency Cases Assessment

Kantar, Yusuf; İmamoğlu, Melih; Bülbül, Emre; Hayme, Serhat; Eden, Arif Onur; Bilgin, Yasin; Sari, Fatih Mehmet

doi:10.14744/cpr.2026.71836

Original Article

Comparison of Intern Doctors and ChatGPT in Emergency Cases Assessment

Yusuf Kantar ¹

, Melih İmamoğlu ²

, Emre Bülbül ³

, Serhat Hayme ⁴

, Arif Onur Eden ¹

, Yasin Bilgin ¹

, Fatih Mehmet Sari ¹

¹Department of Emergency Medicine, Erzincan Binali Yıldırım University, Faculty of Medicine, Erzincan, Türkiye
²Department of Emergency Medicine, Karadeniz Technical University, Faculty of Medicine, Trabzon, Türkiye
³Department of Emergency Medicine, Erciyes University, Faculty of Medicine, Kayseri, Türkiye
⁴Department of Biostatistics and Medical Informatics Erzincan Binali Yıldırım University, Faculty of Medicine, Erzincan, Türkiye

J Clin Pract Res 2026; 48(2): 145-152 PMCID: PMC13267315 DOI: 10.14744/cpr.2026.71836

Full Text PDF

Abstract

Objective: Accurate and timely diagnosis in emergency departments is crucial due to the high patient volume and time-sensitive nature of care. Intern doctors, who are nearing the completion of medical school, frequently work in emergency departments in many countries. However, after graduation, physicians are often expected to assume critical patient care responsibilities despite limited experience. Artificial intelligence models can quickly analyze patient data and generate diagnoses, thus assisting inexperienced physicians in enhancing diagnostic accuracy. This study aims to evaluate the diagnostic performance of ChatGPT-4 in emergency department case scenarios and compare its accuracy with that of intern doctors.

Materials and Methods: This study involved intern doctors participating in the internship program during the 2024–2025 academic year. A total of 36 case-based questions, categorized by difficulty level, were administered to 155 interns and subsequently presented to artificial intelligence. Descriptive statistics were used to summarize the data, and a one-sample t-test was conducted to compare the diagnostic accuracy between intern doctors and ChatGPT. Statistical significance was set at p<0.05.

Results: Intern doctors achieved an overall correct response rate of 58.3%, while ChatGPT achieved a rate of 97.2%. A statistically significant, moderate negative correlation was found between question difficulty and interns’ performance (r=-0.684; p<0.001), indicating decreased accuracy as question difficulty increased. ChatGPT consistently demonstrated significantly higher performance across all difficulty levels.

Conclusion: ChatGPT-4 may serve as a valuable diagnostic support tool in emergency departments, particularly for newly graduated physicians with limited clinical experience.

Keywords: Artificial intelligence, ChatGPT, emergency department, intern doctors, medical education.

Kantar Y, İmamoğlu M, Bülbül E, Hayme S, Eden AO, Bilgin Y, Sari FM. Comparison of Intern Doctors and ChatGPT in Emergency Cases Assessment. J Clin Pract Res. 2026 ;48(2):145-152. doi: 10.14744/cpr.2026.71836.

Kantar Y, İmamoğlu M, Bülbül E, et al. Comparison of Intern Doctors and ChatGPT in Emergency Cases Assessment. J Clin Pract Res. 2026;48(2):145-152. doi: 10.14744/cpr.2026.71836

Kantar, Y., İmamoğlu, M., Bülbül, E., Hayme, S., Eden, A. O., Bilgin, Y., & Sari, F. M. (2026). Comparison of Intern Doctors and ChatGPT in Emergency Cases Assessment. Journal of Clinical Practice and Research, 48(2), 145-152. https://doi.org/10.14744/cpr.2026.71836

Kantar, Y., et al. "Comparison of Intern Doctors and ChatGPT in Emergency Cases Assessment." Journal of Clinical Practice and Research, vol. 48, no. 2, 2026, pp. 145-152. https://doi.org/10.14744/cpr.2026.71836.

Kantar, Y., et al. 2026. "Comparison of Intern Doctors and ChatGPT in Emergency Cases Assessment." Journal of Clinical Practice and Research 48, no. 2: 145-152. https://doi.org/10.14744/cpr.2026.71836.

Journal Display Format:

Authors: Yusuf Kantar, Melih İmamoğlu, Emre Bülbül, Serhat Hayme, Arif Onur Eden, Yasin Bilgin, Fatih Mehmet Sari
Article Title: Comparison of Intern Doctors and ChatGPT in Emergency Cases Assessment
Journal Name: Journal of Clinical Practice and Research
Year: 2026
Volume: 48
Issue: 2
Pages: 145 - 152
DOI: 10.14744/cpr.2026.71836

RIS BibTeX EndNote Medlars