The Role of Artificial Intelligence Large Language Models in Literature Search Assistance to Evaluate Inguinal Hernia Repair Approaches.

Journal: Journal of laparoendoscopic & advanced surgical techniques. Part A
Published Date:

Abstract

This study assesses the reliability of artificial intelligence (AI) large language models (LLMs) in identifying relevant literature comparing inguinal hernia repair techniques. We used LLM chatbots (Bing Chat AI, ChatGPT versions 3.5 and 4.0, and Gemini) to find comparative studies and randomized controlled trials on inguinal hernia repair techniques. The results were then compared with existing systematic reviews (SRs) and meta-analyses and checked for the authenticity of listed articles. LLMs screened 22 studies from 2006 to 2023 across eight journals, while the SRs encompassed a total of 42 studies. Through thorough external validation, 63.6% of the studies (14 out of 22), including 10 identified through Chat GPT 4.0 and 6 via Bing AI (with an overlap of 2 studies between them), were confirmed to be authentic. Conversely, 36.3% (8 out of 22) were revealed as fabrications by Google Gemini (Bard), with two (25.0%) of these fabrications mistakenly linked to valid DOIs. Four (25.6%) of the 14 real studies were acknowledged in the SRs, which represents 18.1% of all LLM-generated studies. LLMs missed a total of 38 (90.5%) of the studies included in the previous SRs, while 10 real studies were found by the LLMs but were not included in the previous SRs. Between those 10 studies, 6 were reviews, and 1 was published after the SRs, leaving a total of three comparative studies missed by the reviews. This study reveals the mixed reliability of AI language models in scientific searches. Emphasizing a cautious application of AI in academia and the importance of continuous evaluation of AI tools in scientific investigations.

Authors

  • Joao P G Kasakewitch
    Department of Surgery, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA.
  • Diego L Lima
    Department of Surgery, Montefiore Medical Center, The Bronx, New York, USA.
  • Carlos A Balthazar da Silveira
    Escola Bahiana de Medicina e Saúde Pública, Salvador, Brasil.
  • Valberto Sanha
    Department of Surgery, Universidade Federal de Ciências da Saúde de Porto Alegre, Porto Alegre, Brasil.
  • Ana Caroline Rasador
    Escola Bahiana de Medicina e Saúde Pública, Salvador, Brasil.
  • Leandro Totti Cavazzola
    Department of Surgery, Universidade Federal do Rio Grande Do Sul, Porto Alegre, Brasil.
  • Julio Mayol
    Hospital Clinico San Carlos, Instituto de Investigación Sanitaria San Carlos, Universidad Complutense de Madrid, Madrid, Spain.
  • Flavio Malcher
    Division of General Surgery, NYU Langone Health, New York, New York, USA.

Keywords

No keywords available for this article.