Enhancing robotic skill acquisition with multimodal sensory data: A novel dataset for kitchen tasks.

Journal: Scientific data
PMID:

Abstract

The advent of large language models has transformed human-robot interaction by enabling robots to execute tasks via natural language commands. However, these models primarily depend on unimodal data, which limits their ability to integrate diverse and essential environmental, physiological, and physical data. To address the limitations of current unimodal dataset problems, this paper investigates the novel and comprehensive multimodal data collection methodologies which can fully capture the complexity of human interaction in the complex real-world kitchen environments. Data related to the use of 17 different kitchen tools by 20 adults in dynamic scenarios were collected, including human tactile information, EMG signals, audio data, whole-body movement, and eye-tracking data. The dataset is comprised of 680 segments (~11 hours) with data across seven modalities and includes 56,000 detailed annotations. This paper bridges the gap between real-world multimodal data and embodied AI, paving the way for a new benchmark in utility and repeatability for skill learning in robotics areas.

Authors

  • Ruochen Ren
    Shanghai Research Institute for Intelligent Autonomous Systems, Tongji University, Shanghai, 201210, China.
  • Zhipeng Wang
    Department of Pharmacy, Changzheng Hospital, Second Military Medical University, Shanghai, 200003, PR China.
  • Chaoyun Yang
    National Key Laboratory of Autonomous Intelligent Unmanned Systems, Shanghai, 201109, China.
  • Jiahang Liu
    National Key Laboratory of Autonomous Intelligent Unmanned Systems, Shanghai, 201109, China.
  • Rong Jiang
    Institute of Intelligence Applications, Yunnan University of Finance and Economics, Kunming, China.
  • Yanmin Zhou
    State Key Laboratory of Intelligent Autonomous Systems, Shanghai, 201210, China. yanmin.zhou@tongji.edu.cn.
  • Shuo Jiang
    The State Key Laboratory of Mechanical System and Vibration, School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China.
  • Bin He
    Clinical Translational Medical Center, The Affiliated Dongguan Songshan Lake Central Hospital, Guangdong Medical University, Dongguan, Guangdong, China.