Memory-efficient Transformer-based network model for Traveling Salesman Problem.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Combinatorial optimization problems such as Traveling Salesman Problem (TSP) have a wide range of real-world applications in transportation, logistics, manufacturing. It has always been a difficult problem to solve large-scale TSP problems quickly because of memory usage limitations. Recent research shows that the Transformer model is a promising approach. However, the Transformer has several severe problems that prevent it from quickly solving TSP combinatorial optimization problems, such as quadratic time complexity, especially quadratic space complexity, and the inherent limitations of the encoder and decoder itself. To address these issues, we developed a memory-efficient Transformer-based network model for TSP combinatorial optimization problems, termed Tspformer, with two distinctive characteristics: (1) a sampled scaled dot-product attention mechanism with O(Llog(L)) (L is the length of input sequences) time and space complexity, which is the most different between our work and other works. (2) due to the reduced space complexity, GPU/CPU memory usage is significantly reduced. Extensive experiments demonstrate that Tspformer significantly outperforms existing methods and provides a new solution to the TSP combinatorial optimization problems. Our Pytorch code will be publicly available on GitHub https://github.com/yhnju/tspFormer.

Authors

  • Hua Yang
  • Minghao Zhao
    College of Computer Science and Technology, Jilin University, Changchun 130012, China.
  • Lei Yuan
    Department of Pharmacy, Baodi People's Hospital, Tianjin, China.
  • Yang Yu
    Division of Cardiology, the Central Hospital of Wuhan, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
  • Zhenhua Li
    Affiliated Hospital of Changchun University of Chinese Medicine, 1035 Boshuo Road, Changchun, 130117, Jilin, China. 19390078790@163.com.
  • Ming Gu
    School of Software, Tsinghua University, Beijing, China.