Gradient Descent on Infinitely Wide Neural Networks

Oct 17, 2022 · 1 min read
seminars

Event

Weekly DL Theory & Stat Phy Seminar

Short summary

In this seminar, I will talk about the paper “Gradient Descent on Infinitely Wide Neural Networks: Global Convergence and Generalization” (Bach and Chizat, arXiv 2021) and the references therein.

Papers

Paper discussed in the seminar:

  • Main: Bach, Francis and Chizat, Lénaïc. Gradient Descent on Infinitely Wide Neural Networks: Global Convergence and Generalization. In arXiv 2021.
  • Chizat, Lénaïc and Bach, Francis. On the Global Convergence of Gradient Descent for Over-parametrized Models using Optimal Transport. In NeurIPS 2018.
Junghyun Lee
Authors
PhD Candidate in Artificial Intelligence
PhD candidate at KAIST AI, jointly advised by Se-Young Yun and Chulhee Yun. I work on interactive machine learning, theoretical aspects of LLMs, learning/optimization theory, and statistical analysis of large networks.