Towards artificial general intelligence via a multimodal foundation model.

Journal: Nature communications
Published Date:

Abstract

The fundamental goal of artificial intelligence (AI) is to mimic the core cognitive activities of human. Despite tremendous success in the AI research, most of existing methods have only single-cognitive ability. To overcome this limitation and take a solid step towards artificial general intelligence (AGI), we develop a foundation model pre-trained with huge multimodal data, which can be quickly adapted for various downstream cognitive tasks. To achieve this goal, we propose to pre-train our foundation model by self-supervised learning with weak semantic correlation data crawled from the Internet and show that promising results can be obtained on a wide range of downstream tasks. Particularly, with the developed model-interpretability tools, we demonstrate that strong imagination ability is now possessed by our foundation model. We believe that our work makes a transformative stride towards AGI, from our common practice of "weak or narrow AI" to that of "strong or generalized AI".

Authors

  • Nanyi Fei
    Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China.
  • Zhiwu Lu
    Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China. luzhiwu@ruc.edu.cn.
  • Yizhao Gao
    Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China.
  • Guoxing Yang
    Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China.
  • Yuqi Huo
    Beijing Key Laboratory of Big Data Management and Analysis Methods, Beijing, China.
  • Jingyuan Wen
    School of Pharmacy, Faculty of Medical and Health Sciences, The University of Auckland, Auckland, New Zealand.
  • Haoyu Lu
    Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China.
  • Ruihua Song
    Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China.
  • Xin Gao
    Department of Computer Science, New Jersey Institute of Technology, Newark, New Jersey, USA.
  • Tao Xiang
  • Hao Sun
    Department of Gastrointestinal Surgery, Harbin Medical University Cancer Hospital, Harbin, China.
  • Ji-Rong Wen
    Gaoling School of Artificial Intelligence, Renmin University of China, Beijing, China. jrwen@ruc.edu.cn.