Optimised weight programming for analogue memory-based deep neural networks.

Journal: Nature communications
Published Date:

Abstract

Analogue memory-based deep neural networks provide energy-efficiency and per-area throughput gains relative to state-of-the-art digital counterparts such as graphics processing units. Recent advances focus largely on hardware-aware algorithmic training and improvements to circuits, architectures, and memory devices. Optimal translation of software-trained weights into analogue hardware weights-given the plethora of complex memory non-idealities-represents an equally important task. We report a generalised computational framework that automates the crafting of complex weight programming strategies to minimise accuracy degradations during inference, particularly over time. The framework is agnostic to network structure and generalises well across recurrent, convolutional, and transformer neural networks. As a highly flexible numerical heuristic, the approach accommodates arbitrary device-level complexity, making it potentially relevant for a variety of analogue memories. By quantifying the limit of achievable inference accuracy, it also enables analogue memory-based deep neural network accelerators to reach their full inference potential.

Authors

  • Charles Mackin
    IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA. charles.mackin@ibm.com.
  • Malte J Rasch
    State Key Laboratory of Cognitive Neuroscience and Learning, IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing 100875, China, and wusi@bnu.edu.cn malte.rasch@bnu.edu.cn.
  • An Chen
    Radiology Department, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, HaiNing Rd.100, Shanghai, 200080, China.
  • Jonathan Timcheck
    Stanford University, 450 Serra Mall, Stanford, CA, 94305, USA.
  • Robert L Bruce
    IBM Research-Yorktown Heights, 1101 Kitchawan Road, Yorktown Heights, NY, USA.
  • Ning Li
    Department of Respiratory and Critical Care Medicine, Center for Respiratory Medicine, the Fourth Affiliated Hospital of School of Medicine, and International School of Medicine, International Institutes of Medicine, Zhejiang University, Yiwu, China.
  • Pritish Narayanan
    IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.
  • Stefano Ambrogio
    IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.
  • Manuel Le Gallo
    IBM Research - Zurich, Säumerstrasse 4, 8803, Rüschlikon, Switzerland.
  • S R Nandakumar
    IBM Research - Zurich, Säumerstrasse 4, 8803, Rüschlikon, Switzerland.
  • Andrea Fasoli
    IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.
  • Jose Luquin
    IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.
  • Alexander Friz
    IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.
  • Abu Sebastian
    IBM Research - Zurich, Säumerstrasse 4, 8803, Rüschlikon, Switzerland. ase@zurich.ibm.com.
  • Hsinyu Tsai
    IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.
  • Geoffrey W Burr