Between human and AI: assessing the reliability of AI text detection tools.

Journal: Current medical research and opinion
Published Date:

Abstract

OBJECTIVE: Large language models (LLMs) such as ChatGPT-4 have raised critical questions regarding their distinguishability from human-generated content. In this research, we evaluated the effectiveness of online detection tools in identifying ChatGPT-4 vs human-written text.

Authors

  • Valentina Bellini
    Anesthesiology, Critical Care and Pain Medicine Division, Department of Medicine and Surgery, University of Parma, Viale Gramsci 14, 43126, Parma, Italy.
  • Federico Semeraro
    European Resuscitation Council, Belgium, Niel; Department of Anesthesia, Intensive Care and Prehospital Emergency, Maggiore Hospital Carlo Alberto Pizzardi, Bologna, Italy. Electronic address: federico.semeraro@erc.edu.
  • Jonathan Montomoli
    Department of Anesthesia and Intensive Care, Infermi Hospital, Romagna Local Health Authority, Rimini, Italy.
  • Marco Cascella
    Department of Medicine, Surgery and Dentistry, University of Salerno, 84081, Baronissi, Italy.
  • Elena Bignami
    Department of Anesthesia and Intensive Care, IRCCS San Raffaele Scientific Institute, Milan, Italy. Electronic address: bignami.elena@hsr.it.