Mode combinability: Exploring convex combinations of permutation aligned models.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

We explore element-wise convex combinations of two permutation-aligned neural network parameter vectors Θ and Θ of size d. We conduct extensive experiments by examining various distributions of such model combinations parametrized by elements of the hypercube [0,1] and its vicinity. Our findings reveal that broad regions of the hypercube form surfaces of low loss values, indicating that the notion of linear mode connectivity extends to a more general phenomenon which we call mode combinability. We also make several novel observations regarding linear mode connectivity and model re-basin. We demonstrate a transitivity property: two models re-based to a common third model are also linear mode connected, and a robustness property: even with significant perturbations of the neuron matchings the resulting combinations continue to form a working model. Moreover, we analyze the functional and weight similarity of model combinations and show that such combinations are non-vacuous in the sense that there are significant functional differences between the resulting models.

Authors

  • Adrián Csiszárik
    HUN-REN Alfréd Rényi Institute of Mathematics, Reáltanoda utca 13-15., Budapest, 1053, Hungary; Eötvös Loránd University, Pázmány Péter sétány 1/C, Budapest, 1117, Hungary. Electronic address: csadrian@renyi.hu.
  • Melinda F Kiss
    HUN-REN Alfréd Rényi Institute of Mathematics, Reáltanoda utca 13-15., Budapest, 1053, Hungary; Eötvös Loránd University, Pázmány Péter sétány 1/C, Budapest, 1117, Hungary. Electronic address: mfkiss@renyi.hu.
  • Péter Kőrösi-Szabó
    HUN-REN Alfréd Rényi Institute of Mathematics, Reáltanoda utca 13-15., Budapest, 1053, Hungary. Electronic address: koszpe@renyi.hu.
  • Márton Muntag
    HUN-REN Alfréd Rényi Institute of Mathematics, Reáltanoda utca 13-15., Budapest, 1053, Hungary. Electronic address: muntag@renyi.hu.
  • Gergely Papp
    EMBL Grenoble, 71 Avenue des Martyrs, 38042 Grenoble, France.
  • Dániel Varga
    HUN-REN Alfréd Rényi Institute of Mathematics, Reáltanoda utca 13-15., Budapest, 1053, Hungary. Electronic address: daniel@renyi.hu.