A Comprehensive Behavioral Dataset for the Abstraction and Reasoning Corpus.

Journal: Scientific data

Published Date: Aug 7, 2025

Abstract

The Abstraction and Reasoning Corpus (ARC) is a visual program synthesis benchmark designed to test out-of-distribution generalization in machines. Comparing AI algorithms to human performance is essential to measure progress on these problems. In this paper, we present H-ARC (Human-ARC): a novel large-scale dataset containing solution attempts from over 1700 humans on ARC problems. The dataset spans the full set of 400 training and 400 evaluation tasks from the original ARC benchmark, and it is the largest human evaluation to date. By publishing the dataset, we contribute human responses to each problem, step-by-step behavioral action traces from the ARC user-interface, and natural-language solution descriptions of the inferred program/rule. We believe this dataset will be of value to researchers, both in cognitive science and AI, since it offers the potential to facilitate the discovery of underlying mechanisms supporting abstraction and reasoning in people. The insights to be gained from these data not only have value for cognitive science, but could in turn inform the design of more efficient, human-like AI algorithms.

Authors

Solim LeGris

Department of Psychology, NYU, New York, USA. solim.legris@nyu.edu.
Wai Keen Vong

Center for Data Science, New York University, New York, NY, USA.
Brenden M Lake

Center for Data Science, New York University, 726 Broadway, New York, NY 10003, USA. brenden@nyu.edu.
Todd M Gureckis

Department of Psychology, NYU, New York, USA.

Keywords

Algorithms Artificial Intelligence Behavior Humans Problem Solving

External Resources

View on PubMed Access via DOI PubMed (40775224)

A Comprehensive Behavioral Dataset for the Abstraction and Reasoning Corpus.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

A Comprehensive Behavioral Dataset for the Abstraction and Reasoning Corpus.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals