From Virtual Tutors to Professional Identity: Generative AI and Large Language Models in Medical Education
DOI:
https://doi.org/10.12775/JEHS.2025.86.66953Keywords
medical education, AI, chatgpt, clinical reasoning, analysis of clinical studies, professionalism, current medical knowledgeAbstract
Background: Generative artificial intelligence (AI), particularly large language models (LLMs), is rapidly transforming medical education. These tools can act as virtual tutors, generate teaching materials and support clinical simulations, but they also raise concerns regarding accuracy, bias, academic integrity, data protection and professional identity formation. Objective: This narrative review aims to synthesise current knowledge on the use of generative AI - with a focus on LLMs - in medical education, identify key benefits and risks, and discuss implications for curricula and professionalism in medicine and other health professions. Methods: Publications from 2021–2025 were included if they addressed applications of generative AI/LLMs in medical or health professions education, or discussed their impact on curricula, assessment, professionalism or future physician competencies. Original empirical studies, reviews, conceptual papers and policy/guideline documents were included. Data were extracted and synthesised narratively according to four domains: areas of application, benefits, risks/limitations and recommendations/frameworks. Results: Generative AI is being used to support self-directed learning (on-demand explanations, personalised practice questions), clinical simulations and virtual patients, faculty work in content generation and assessment, and research in medical education. Reported benefits include increased accessibility, personalisation, scalability and opportunities to practise clinical reasoning and communication in low-stakes environments. Major risks comprise hallucinations and bias, threats to academic integrity, potential deskilling and over-reliance, and privacy and confidentiality concerns.
References
William Hersh. Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education. Annual Review of Biomedical Data Science Volume 8, 2025, https://doi.org/10.1146/annurev-biodatasci-103123-094756 April 09, 2025
Thompson RAM, Shah YB, Aguirre F, Stewart C, Lallas CD, Shah MS. Artificial Intelligence Use in Medical Education: Best Practices and Future Directions. Curr Urol Rep. 2025 May 29;26(1):45. doi:10.1007/s11934-025-01277-1. PMID: 40439780; PMCID: PMC12122599.
Luordo, D.; Torres Arrese, M.; Tristán Calvo, C.; Shani Shani, K.D.; Rodríguez Cruz, L.M.; García Sánchez, F.J.; Lagares Gómez-Abascal, A.; Rubio García, R.; Delgado Jiménez, J.; Pérez Carreras, M.; et al. Application of Artificial Intelligence as an Aid for the Correction of the Objective Structured Clinical Examination (OSCE). Appl. Sci. 2025, 15, 1153. https://doi.org/10.3390/app15031153
Almansour, M.; Alfhaid F.; Generative artificial intelligence and the personalization of health professional education: A narrative review. PMID: 39093806 doi:10.1097/MD.0000000000038955; Aug 02, 2024;103 (31): e38955.
Scherr R, Halaseh FF, Spina A, Andalib S, Rivera R. ChatGPT Interactive Medical Simulations for Early Clinical Education: Case Study. JMIR Med Educ. 2023 Nov 10;9:e49877. doi: 10.2196/49877. PMID: 37948112; PMCID: PMC10674152.
Pendergrast T, Chalmers Z. Anki Tagger: A Generative AI Tool for Aligning Third-Party Resources to Preclinical Curriculum. JMIR Med Educ. 2023 Sep 20;9:e48780. doi: 10.2196/48780. PMID: 37728965; PMCID: PMC10551781.
Skryd A, Lawrence K, ChatGPT as a Tool for Medical Education and Clinical Decision-Making on the Wards: Case Study, JMIR Form Res 2024;8:e51346, doi: 10.2196/51346, PMID: 38717811, PMCID: 11112466
Metze K, Morandin-Reis RC, Lorand-Metze I, Florindo JB. Bibliographic Research with ChatGPT may be Misleading: The Problem of Hallucination. J Pediatr Surg. 2024 Jan;59(1):158. doi: 10.1016/j.jpedsurg.2023.08.018. Epub 2023 Aug 30. PMID: 37735041.
Metze K, Morandin-Reis RC, Lorand-Metze I, Florindo JB. Bibliographic research with large language model ChatGPT-4: instability, hallucinations and sometimes alerts. Clinics (Sao Paulo). 2024 Oct 15;79:100409. doi: 10.1016/j.clinsp.2024.100409. PMID: 39413495; PMCID: PMC11530843.
Scherr R, Halaseh FF, Spina A, Andalib S, Rivera R. ChatGPT Interactive Medical Simulations for Early Clinical Education: Case Study. JMIR Med Educ. 2023 Nov 10;9:e49877. doi: 10.2196/49877. PMID: 37948112; PMCID: PMC10674152.
Sheth, U.; Lo, M.; McCarthy, J.; Baath, N.; Last, N.; Guo, E.; Monteiro, S.; Sibbald, M. Understanding the Role of Large Language Model Virtual Patients in Developing Communication and Clinical Skills in Undergraduate Medical Education. Int. Med. Educ. 2025, 4, 39. https://doi.org/10.3390/ime4040039
Laupichler MC, Rother JF, Grunwald Kadow IC, Ahmadi S, Raupach T. Large Language Models in Medical Education: Comparing ChatGPT- to Human-Generated Exam Questions. Acad Med. 2024 May 1;99(5):508-512. doi: 10.1097/ACM.0000000000005626. Epub 2023 Dec 28. PMID: 38166323.
Komasawa N, Yokohira M. Generative Artificial Intelligence (AI) in Medical Education: A Narrative Review of the Challenges and Possibilities for Future Professionalism. Cureus. 2025 Jun 18;17(6):e86316. doi: 10.7759/cureus.86316. PMID: 40688936; PMCID: PMC12276793.
Metze K, Morandin-Reis RC, Lorand-Metze I, Florindo JB. Bibliographic Research with ChatGPT may be Misleading: The Problem of Hallucination. J Pediatr Surg. 2024 Jan;59(1):158. doi: 10.1016/j.jpedsurg.2023.08.018. Epub 2023 Aug 30. PMID: 37735041.
Wong RSY. ChatGPT in medical education: promoting learning or killing critical thinking? Education in Medicine Journal. 2024;16(2):177–183. https://doi.org/10.21315/eimj2024.16.2.13
Cione, Nicholas J.; Pillow, Malford T. MD, MEd. Creating More Reliable Large Language Models in Medical Education. Academic Medicine 100(9):p 1001, September 2025. | DOI: 10.1097/ACM.0000000000006058
Vrdoljak J, Boban Z, Vilović M, Kumrić M, Božić J. A Review of Large Language Models in Medical Education, Clinical Decision Support, and Healthcare Administration. Healthcare (Basel). 2025 Mar 10;13(6):603. doi: 10.3390/healthcare13060603. PMID: 40150453; PMCID: PMC11942098.
Wong K, Fayngersh A, Traba C, et al. (May 31, 2024) Using ChatGPT in the Development of Clinical Reasoning Cases: A Qualitative Study. Cureus 16(5): e61438. doi:10.7759/cureus.61438
Hadi A, Tran E, Nagarajan B, Kirpalani A (2024) Evaluation of ChatGPT as a diagnostic tool for medical learners and clinicians. PLoS ONE 19(7): e0307383. https://doi.org/10.1371/journal.pone.0307383
Öncü, S., Torun, F. & Ülkü, H.H. AI-powered standardised patients: evaluating ChatGPT-4o’s impact on clinical case management in intern physicians. BMC Med Educ 25, 278 (2025). https://doi.org/10.1186/s12909-025-06877-6
Hui, Z., Zewu, Z., Jiao, H. et al. Application of ChatGPT-assisted problem-based learning teaching method in clinical medical education. BMC Med Educ 25, 50 (2025). https://doi.org/10.1186/s12909-024-06321-1
Thesen, T., Park, S.H. A generative AI teaching assistant for personalized learning in medical education. npj Digit. Med. 8, 627 (2025). https://doi.org/10.1038/s41746-025-02022-1
Simoni, J., Urtubia-Fernandez, J., Mengual, E. et al. Artificial intelligence in undergraduate medical education: an updated scoping review. BMC Med Educ 25, 1609 (2025). https://doi.org/10.1186/s12909-025-08188-2
Yang Y et al. Beyond Multiple-Choice Accuracy: Real-World Challenges of Implementing Large Language Models in Healthcare. Annual Review of Biomedical Data Science 2025. annualreviews.org https://doi.org/10.1146/annurev-biodatasci-103123-094851
Usuyama N, et al. Biomedical Natural Language Processing in the Era of Large Language Models. Annual Review of Biomedical Data Science. 2025. https://www.annualreviews.org/doi/10.1146/annurev-biodatasci-071023-023244
Chan JTN, Kwek RK. Uncovering Bias and Variability in How Large Language Models Attribute Cardiovascular Risk. Frontiers in Digital Health. 2025. https://doi.org/10.3389/fdgth.2025.1508461
Jaleel A, et al. The Impact of ChatGPT on Academic Integrity in Medical Education. Frontiers in Education. 2024. https://doi.org/10.3389/feduc.2024.1356163
Ekaterina K, Ana M, Maia Z. Academic Integrity Within the Medical Curriculum in the Age of Generative Artificial Intelligence. Health Sci Rep. 2025 Feb 19;8(2):e70489. doi:10.1002/hsr2.70489. PMID: 39980818; PMCID: PMC11839754.
TEQSA. Gen AI – Academic Integrity and Assessment Reform. Gen AI Knowledge Hub. 2025. https://www.teqsa.gov.au/gen-ai-academic-integrity-and-assessment-reform
Natali C, et al. AI-Induced Deskilling in Medicine: A Mixed Method Literature Review for Setting a New Research Agenda. Artificial Intelligence Review. 2025. https://doi.org/10.1007/s10462-025-11352-1
Wong RSY. ChatGPT in Medical Education: Promoting Learning or Killing Critical Thinking? Education in Medicine Journal. 2024;16(2):177–183. https://doi.org/10.21315/eimj2024.16.2.13
Budzyń K, et al. Endoscopist Deskilling Risk After Exposure to Artificial Intelligence in Colonoscopy: A Multicentre, Observational Study. Lancet Gastroenterology & Hepatology. 2025.
Zhui L, Fenghe L, Xuehu W, Qining F, Wei R. Ethical Considerations and Fundamental Principles of Large Language Models in Medical Education: Viewpoint. J Med Internet Res. 2024 Aug 1;26:e60083. doi:10.2196/60083. PMID: 38971715; PMCID: PMC11327620.
Mbakwe AB, Lourentzou I, Celi LA, Mechanic OJ, Dagan A (2023) ChatGPT passing USMLE shines a spotlight on the flaws of medical education. PLOS Digit Health 2(2): e0000205. https://doi.org/10.1371/journal.pdig.0000205
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Kinga Marciniak, Sabina Sciążko-Gancarczyk, Jagoda Węgrzyn, Dominika Żukowiecka-Sęga, Maciej Gancarczyk

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
The periodical offers access to content in the Open Access system under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0
Stats
Number of views and downloads: 48
Number of citations: 0