The -MDA: An Invariant to Shifting, Scaling, and Rotating Variance for 3D Object Recognition Using Diffractive Deep Neural Network.

Journal: Sensors (Basel, Switzerland)
PMID:

Abstract

The diffractive deep neural network (DNN) can efficiently accomplish 2D object recognition based on rapid optical manipulation. Moreover, the multiple-view DNN array (MDA) possesses the obvious advantage of being able to effectively achieve 3D object classification. At present, 3D target recognition should be performed in a high-speed and dynamic way. It should be invariant to the typical shifting, scaling, and rotating variance of targets in relatively complicated circumstances, which remains a shortcoming of optical neural network architectures. In order to efficiently recognize 3D targets based on the developed DNN, a more robust MDA (-MDA) is proposed in this paper. Through utilizing a new training strategy to tackle several random disturbances introduced into the optical neural network system, a trained -MDA model constructed by us was numerically verified, demonstrating that the training strategy is able to dynamically recognize 3D objects in a relatively stable way.

Authors

  • Liang Zhou
    Department of Otorhinolaryngology, Eye & ENT Hospital, Fudan University, Shanghai, 200031, China. liang.zhou@fdeent.org.
  • Jiashuo Shi
  • Xinyu Zhang
    Wenzhou Medical University Renji College, Wenzhou, Zhejiang, China.