Which deep learning model can best explain object representations of within-category exemplars?

Journal: Journal of vision
PMID:

Abstract

Deep neural network (DNN) models realize human-equivalent performance in tasks such as object recognition. Recent developments in the field have enabled testing the hierarchical similarity of object representation between the human brain and DNNs. However, the representational geometry of object exemplars within a single category using DNNs is unclear. In this study, we investigate which DNN model has the greatest ability to explain invariant within-category object representations by computing the similarity between representational geometries of visual features extracted at the high-level layers of different DNN models. We also test for the invariability of within-category object representations of these models by identifying object exemplars. Our results show that transfer learning models based on ResNet50 best explained both within-category object representation and object identification. These results suggest that the invariability of object representations in deep learning depends not on deepening the neural network but on building a better transfer learning model.

Authors

  • Dongha Lee
    BK21 PLUS Project for Medical Science, Yonsei University College of Medicine, Seoul, Republic of Korea; Department of Nuclear Medicine, Department of Radiology, Department of Psychiatry, Severance Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea; Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Republic of Korea.