The performance of ChatGPT on short-answer questions in a psychiatry examination: A pilot study

Loading...
Thumbnail Image

Supplementary material

Other Title

Authors

Hsu, Che-Wei

Author ORCID Profiles (clickable)

Degree

Grantor

Date

2024

Supervisors

Type

Journal Article

Ngā Upoko Tukutuku (Māori subject headings)

Keyword

artificial intelligence
clinical skill assessment
examination
psychiatric education

Citation

Lin, C-C., du Plooy, K., Gray, A., Brown, D., Hobbs, L., Patterson, T., Tan, V., Fridberg, D., & Hsu, C-W. The performance of ChatGPT on short-answer questions in a psychiatry examination: A pilot study. Taiwanese Journal of Psychiatry, 38(2), 94-98.https://doi.org/10.4103/TPSY.TPSY_19_24

Abstract

Objectives: We compared ChatGPT’s performance to medical students’ on psychiatry examinations and investigated whether raters could distinguish answers between them. Methods: We used a copy of short-answer questions from a psychiatry examination to compare the performance of three randomized groups – ChatGPT, student, and hybrid (student-modified ChatGPT responses). Furthermore, we investigated raters’ ability to identify response origins. Results: ChatGPT-assisted answers, both ChatGPT alone (p < 0.001) and hybrid (p < 0.001), had significantly better examination performance than did independent students work. Raters showed high accuracy in identifying the origin of the responses, correctly identifying 92% of both students’ and ChatGPT-assisted responses. But raters were only 61% accurate in making the distinction between ChatGPT and hybrid answers. Conclusion: ChatGPT showed superior performance in a psychiatry examination compared to students’ work, but raters showed high accuracy in distinguishing them. Further investigation is warranted to optimize the advantages and mitigate the drawbacks of incorporating such technology into psychiatric education and health care.

Publisher

Wolters Kluwer Medknow Publications

Link to ePress publication

DOI

https://doi.org/10.4103/TPSY.TPSY_19_24

Copyright holder

Authors.

Copyright notice

CC BY-NC-SA V4.0

Copyright license

This item appears in: