An AI Agent for Fully Automated Multi-Omic Analyses.

Journal: Advanced science (Weinheim, Baden-Wurttemberg, Germany)
Published Date:

Abstract

With the fast-growing and evolving omics data, the demand for streamlined and adaptable tools to handle bioinformatics analysis continues to grow. In response to this need, Automated Bioinformatics Analysis (AutoBA) is introduced, an autonomous AI agent designed explicitly for fully automated multi-omic analyses based on large language models (LLMs). AutoBA simplifies the analytical process by requiring minimal user input while delivering detailed step-by-step plans for various bioinformatics tasks. AutoBA's unique capacity to self-design analysis processes based on input data variations further underscores its versatility. Compared with online bioinformatic services, AutoBA offers multiple LLM backends, with options for both online and local usage, prioritizing data security and user privacy. In comparison to ChatGPT and open-source LLMs, an automated code repair (ACR) mechanism in AutoBA is designed to improve its stability in automated end-to-end bioinformatics analysis tasks. Moreover, different from the predefined pipeline, AutoBA has adaptability in sync with emerging bioinformatics tools. Overall, AutoBA represents an advanced and convenient tool, offering robustness and adaptability for conventional multi-omic analyses.

Authors

  • Juexiao Zhou
  • Bin Zhang
    Department of Psychiatry, Sleep Medicine Center, Nanfang Hospital, Southern Medical University, Guangzhou, China.
  • Guowei Li
    Center for Clinical Epidemiology and Methodology, Guangdong Second Provincial General Hospital, Guangzhou, China.
  • Xiuying Chen
    Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia.
  • Haoyang Li
  • Xiaopeng Xu
    Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Makkah, Saudi Arabia.
  • Siyuan Chen
    First author: Department of Computer Science, Columbia University in the City of New York, 10027; second, fourth, and sixth authors: Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853; third author: Department of Mechanical Engineering, Columbia University; fifth author: Uber AI Labs, San Francisco 94103; seventh author: Plant Pathology and Plant-Microbe Biology Section, School of Integrative Plant Science, Cornell University; and eighth author: Department of Mechanical Engineering and Institute of Data Science, Columbia University.
  • Wenjia He
    School of Software at Shandong University, China.
  • Chencheng Xu
    Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia.
  • Liwei Liu
    Shenzhen Key Laboratory of Ultrafast Laser Micro/Nano Manufacturing, Key Laboratory of Optoelectronic Devices and Systems of Ministry of Education/Guangdong Province, College of Physics and Optoelectronic Engineering, Shenzhen University, Shenzhen 518060, China.
  • Xin Gao
    Department of Computer Science, New Jersey Institute of Technology, Newark, New Jersey, USA.