Detecting Artificial Intelligence-Generated Versus Human-Written Medical Student Essays: Semirandomized Controlled Study.

Journal: JMIR medical education
PMID:

Abstract

BACKGROUND: Large language models, exemplified by ChatGPT, have reached a level of sophistication that makes distinguishing between human- and artificial intelligence (AI)-generated texts increasingly challenging. This has raised concerns in academia, particularly in medicine, where the accuracy and authenticity of written work are paramount.

Authors

  • Berin Doru
    University Hospital of Paediatrics and Adolescent Medicine, St. Josef-Hospital, Ruhr University Bochum, Bochum, Germany.
  • Christoph Maier
    University Hospital of Paediatrics and Adolescent Medicine, St. Josef-Hospital, Ruhr University Bochum, Bochum, Germany.
  • Johanna Sophie Busse
    University Hospital of Paediatrics and Adolescent Medicine, St. Josef-Hospital, Ruhr University Bochum, Bochum, Germany.
  • Thomas Lücke
    University Hospital of Paediatrics and Adolescent Medicine, St. Josef-Hospital, Ruhr University Bochum, Bochum, Germany.
  • Judith Schönhoff
    Departement of German Philology, General and Comparative Literary Studies, Ruhr University Bochum, Bochum, Germany.
  • Elena Enax-Krumova
    Department of Neurology, BG University Hospital Bergmannsheil gGmbH Bochum, Ruhr University Bochum, Bochum, Germany.
  • Steffen Hessler
    German Department, German Linguistics, Ruhr University Bochum, Bochum, Germany.
  • Maria Berger
    German Department, Digital Forensic Linguistics, Ruhr University Bochum, Bochum, Germany.
  • Marianne Tokic
    Department of Medical Informatics, Biometry and Epidemiology, Ruhr University Bochum, Bochum, Germany.