Mind the Remainder: Taylor's Theorem View on Recurrent Neural Networks.

Journal: IEEE transactions on neural networks and learning systems

Published Date: Apr 4, 2022

Abstract

Recurrent neural networks (RNNs) have gained tremendous popularity in almost every sequence modeling task. Despite the effort, these kinds of discrete unstructured data, such as texts, audio, and videos, are still difficult to be embedded in the feature space. Studies in improving the neural networks have accelerated since the introduction of more complex or deeper architectures. The improvements of previous methods are highly dependent on the model at the expense of huge computational sources. However, few of them pay attention to the algorithm. In this article, we bridge the Taylor series with the construction of RNN. Training RNN can be considered as a parameter estimate for the Taylor series. However, we found that there is a discrete term called the remainder in the finite Taylor series that cannot be optimized using gradient descent, which is part of the reason for the truncation error and the model falling into the local optimal solution. To address this, we propose a training algorithm that estimates the range of remainder and introduces the remainder obtained by sampling in this continuous space into the RNN to assist in optimizing the parameters. Notably, the performance of RNN can be improved without changing the RNN architecture in the testing phase. We demonstrate that our approach is able to achieve state-of-the-art performance in action recognition and cross-modal retrieval tasks.

Authors

Xiang Guan
Yang Yang

Department of Gastrointestinal Surgery, The Third Hospital of Hebei Medical University, Shijiazhuang, China.
Jingjing Li

Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, 430074, China.
Xing Xu
Heng Tao Shen

Center for Future Media, School of Computer Science & Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China; Sichuan Artificial Intelligence Research Institute, Yibin 644000, China. Electronic address: shenhengtao@uestc.edu.cn.

Keywords

Algorithms Neural Networks, Computer

External Resources

View on PubMed Access via DOI PubMed (33444144)

Mind the Remainder: Taylor's Theorem View on Recurrent Neural Networks.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Mind the Remainder: Taylor's Theorem View on Recurrent Neural Networks.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals