The Role of Artificial Intelligence Large Language Models in Literature Search Assistance to Evaluate Inguinal Hernia Repair Approaches.

Journal: Journal of laparoendoscopic & advanced surgical techniques. Part A

Published Date: Apr 26, 2025

Abstract

This study assesses the reliability of artificial intelligence (AI) large language models (LLMs) in identifying relevant literature comparing inguinal hernia repair techniques. We used LLM chatbots (Bing Chat AI, ChatGPT versions 3.5 and 4.0, and Gemini) to find comparative studies and randomized controlled trials on inguinal hernia repair techniques. The results were then compared with existing systematic reviews (SRs) and meta-analyses and checked for the authenticity of listed articles. LLMs screened 22 studies from 2006 to 2023 across eight journals, while the SRs encompassed a total of 42 studies. Through thorough external validation, 63.6% of the studies (14 out of 22), including 10 identified through Chat GPT 4.0 and 6 via Bing AI (with an overlap of 2 studies between them), were confirmed to be authentic. Conversely, 36.3% (8 out of 22) were revealed as fabrications by Google Gemini (Bard), with two (25.0%) of these fabrications mistakenly linked to valid DOIs. Four (25.6%) of the 14 real studies were acknowledged in the SRs, which represents 18.1% of all LLM-generated studies. LLMs missed a total of 38 (90.5%) of the studies included in the previous SRs, while 10 real studies were found by the LLMs but were not included in the previous SRs. Between those 10 studies, 6 were reviews, and 1 was published after the SRs, leaving a total of three comparative studies missed by the reviews. This study reveals the mixed reliability of AI language models in scientific searches. Emphasizing a cautious application of AI in academia and the importance of continuous evaluation of AI tools in scientific investigations.

Authors

Joao P G Kasakewitch

Department of Surgery, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA.
Diego L Lima

Department of Surgery, Montefiore Medical Center, The Bronx, New York, USA.
Carlos A Balthazar da Silveira

Escola Bahiana de Medicina e Saúde Pública, Salvador, Brasil.
Valberto Sanha

Department of Surgery, Universidade Federal de Ciências da Saúde de Porto Alegre, Porto Alegre, Brasil.
Ana Caroline Rasador

Escola Bahiana de Medicina e Saúde Pública, Salvador, Brasil.
Leandro Totti Cavazzola

Department of Surgery, Universidade Federal do Rio Grande Do Sul, Porto Alegre, Brasil.
Julio Mayol

Hospital Clinico San Carlos, Instituto de Investigación Sanitaria San Carlos, Universidad Complutense de Madrid, Madrid, Spain.
Flavio Malcher

Division of General Surgery, NYU Langone Health, New York, New York, USA.

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (40285461)

The Role of Artificial Intelligence Large Language Models in Literature Search Assistance to Evaluate Inguinal Hernia Repair Approaches.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals