The -MDA: An Invariant to Shifting, Scaling, and Rotating Variance for 3D Object Recognition Using Diffractive Deep Neural Network.

Journal: Sensors (Basel, Switzerland)

PMID: 36298105

Abstract

The diffractive deep neural network (DNN) can efficiently accomplish 2D object recognition based on rapid optical manipulation. Moreover, the multiple-view DNN array (MDA) possesses the obvious advantage of being able to effectively achieve 3D object classification. At present, 3D target recognition should be performed in a high-speed and dynamic way. It should be invariant to the typical shifting, scaling, and rotating variance of targets in relatively complicated circumstances, which remains a shortcoming of optical neural network architectures. In order to efficiently recognize 3D targets based on the developed DNN, a more robust MDA (-MDA) is proposed in this paper. Through utilizing a new training strategy to tackle several random disturbances introduced into the optical neural network system, a trained -MDA model constructed by us was numerically verified, demonstrating that the training strategy is able to dynamically recognize 3D objects in a relatively stable way.

Authors

Liang Zhou

Department of Otorhinolaryngology, Eye & ENT Hospital, Fudan University, Shanghai, 200031, China. liang.zhou@fdeent.org.
Jiashuo Shi
Xinyu Zhang

Wenzhou Medical University Renji College, Wenzhou, Zhejiang, China.

Keywords

Neural Networks, Computer Pattern Recognition, Visual Recognition, Psychology Visual Perception

External Resources

View on PubMed Access via DOI PubMed (36298105)

The -MDA: An Invariant to Shifting, Scaling, and Rotating Variance for 3D Object Recognition Using Diffractive Deep Neural Network.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals