Lower dimensional kernels for video discriminators.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

This work presents an analysis of the discriminators used in Generative Adversarial Networks (GANs) for Video. We show that unconstrained video discriminator architectures induce a loss surface with high curvature which make optimization difficult. We also show that this curvature becomes more extreme as the maximal kernel dimension of video discriminators increases. With these observations in hand, we propose a methodology for the design of a family of efficient Lower-Dimensional Video Discriminators for GANs (LDVD-GANs). The proposed methodology improves the performance and efficiency of video GAN models it is applied to and demonstrates good performance on complex and diverse datasets such as UCF-101. In particular, we show that LDVDs can double the performance of Temporal-GANs and provide for state-of-the-art performance on a single GPU using the proposed methodology.

Authors

  • Emmanuel Kahembwe
    Robust Autonomy and Decisions Group, The School of Informatics, The University of Edinburgh, 10 Crichton St, Edinburgh EH8 9AB, United Kingdom; The Edinburgh Centre of Robotics, The University of Edinburgh's Bayes Centre, 47 Potterrow, Edinburgh EH8 9BT, United Kingdom; The School of Engineering and Physical Sciences, The Robotarium, Heriot-Watt University, Edinburgh, EH14 4AS, United Kingdom. Electronic address: e.kahembwe@ed.ac.uk.
  • Subramanian Ramamoorthy
    Robust Autonomy and Decisions Group, The School of Informatics, The University of Edinburgh, 10 Crichton St, Edinburgh EH8 9AB, United Kingdom; The Edinburgh Centre of Robotics, The University of Edinburgh's Bayes Centre, 47 Potterrow, Edinburgh EH8 9BT, United Kingdom; FiveAI, 5th Floor, Greenside, 12 Blenheim Place, Edinburgh, EH7 5JH, United Kingdom.