Training Neural Networks and Mean-field Langevin dynamics

Date: 
Friday, 28 August, 2020 - 15:00 - 16:00
Venue: 
https://cuhk.zoom.us/j/92775210812
Seminar Type: 
MATH-IMS Joint Applied Mathematics Colloquium Series
Speaker Name: 
Prof. Zhenjie REN
Affiliation: 
CEREMADE, Université Paris-Dauphine
Abstract: 

The neural networks have become an extremely useful tool in various applications such as statistical learning and sampling. The empirical success urges a theoretical investigation based on mathematical models. Recently it has become popular to treat the training of the neural networks as an optimization on the space of probability measures. In this talk we show that the optimizer of such optimization can be approximated using the so-called mean-field Langevin dynamics. This theory sheds light on the efficiency of the (stochastic) gradient descent algorithm for training the neural networks. Based on the theory, we also propose a new algorithm for training the generative adversarial networks (GAN), and test it to produce sampling of simple probability distributions.