Comparison of ChatGPT-4, Copilot, Bard and Gemini Ultra on an Otolaryngology Question Bank.

Journal: Clinical otolaryngology : official journal of ENT-UK ; official journal of Netherlands Society for Oto-Rhino-Laryngology & Cervico-Facial Surgery
Published Date:

Abstract

OBJECTIVE: To compare the performance of Google Bard, Microsoft Copilot, GPT-4 with vision (GPT-4) and Gemini Ultra on the OTO Chautauqua, a student-created, faculty-reviewed otolaryngology question bank.

Authors

  • Rashi Ramchandani
    Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada.
  • Eddie Guo
    Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.
  • Michael Mostowy
    Department of Otolaryngology, Jacobs School of Medicine and Biomedical Sciences at the University of Buffalo, Buffalo, New York, USA.
  • Jason Kreutz
    Division of Dermatology, Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.
  • Nick Sahlollbey
    Department of Otolaryngology-Head and Neck Surgery, University of Calgary, Calgary, Alberta, Canada.
  • Michele M Carr
    Department of Otolaryngology, Jacobs School of Medicine and Biomedical Sciences at the University of Buffalo, Buffalo, New York, USA.
  • Janet Chung
    Department of Otolaryngology-Head and Neck Surgery, University of Toronto, Toronto, Ontario, Canada.
  • Lisa Caulley
    Department of Otolaryngology-Head and Neck Surgery, University of Ottawa, Ottawa, Ontario, Canada.