One paper accepted to NeurIPS 2023 Workshop on Mathematics of Modern Machine Learning (M3L) as an *oral presentation*!
Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study
One paper (Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study) is accepted to NeurIPS 2023 Workshop on Mathematics of Modern Machine Learning (M3L) as an oral presentation! This is joint work with the wonderful undergradudate intern Prin Phunyaphibarn (KAIST Math, equal contributions), my advisor Chulhee Yun (KAIST AI), and collaborators Bohan Wang (USTC) and Huishuai Zhang (Microsoft Research Asia - Theory Centre).
I’ll be attending and presenting in person. See you all at New Orleans, USA!