A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs.

Journal: Science (New York, N.Y.)
Published Date:

Abstract

Learning from a few examples and generalizing to markedly different situations are capabilities of human visual intelligence that are yet to be matched by leading machine learning models. By drawing inspiration from systems neuroscience, we introduce a probabilistic generative model for vision in which message-passing-based inference handles recognition, segmentation, and reasoning in a unified way. The model demonstrates excellent generalization and occlusion-reasoning capabilities and outperforms deep neural networks on a challenging scene text recognition benchmark while being 300-fold more data efficient. In addition, the model fundamentally breaks the defense of modern text-based CAPTCHAs (Completely Automated Public Turing test to tell Computers and Humans Apart) by generatively segmenting characters without CAPTCHA-specific heuristics. Our model emphasizes aspects such as data efficiency and compositionality that may be important in the path toward general artificial intelligence.

Authors

  • Dileep George
    Vicarious,Union City,CA 94587.dileep@vicarious.comwww.vicarious.com.
  • Wolfgang Lehrach
    Vicarious AI, 2 Union Square, Union City, CA 94587, USA.
  • Ken Kansky
    Vicarious AI, 2 Union Square, Union City, CA 94587, USA.
  • Miguel Lázaro-Gredilla
    Vicarious AI, 2 Union Square, Union City, CA 94587, USA. dileep@vicarious.com miguel@vicarious.com.
  • Christopher Laan
    Vicarious AI, 2 Union Square, Union City, CA 94587, USA.
  • Bhaskara Marthi
    Vicarious AI, 2 Union Square, Union City, CA 94587, USA.
  • Xinghua Lou
    Vicarious AI, 2 Union Square, Union City, CA 94587, USA.
  • Zhaoshi Meng
    Vicarious AI, 2 Union Square, Union City, CA 94587, USA.
  • Yi Liu
    Department of Interventional Therapy, Ningbo No. 2 Hospital, Ningbo, China.
  • Huayan Wang
    Vicarious AI, 2 Union Square, Union City, CA 94587, USA.
  • Alex Lavin
    Vicarious AI, 2 Union Square, Union City, CA 94587, USA.
  • D Scott Phoenix
    Vicarious AI, 2 Union Square, Union City, CA 94587, USA.