Statistical Problems Related to (LLM) Alignment and Preference Learning

Junghyun Lee

Oct 27, 2020 project

A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

Accepted to NeurIPS 2024.
Accepted to ICML 2024 Workshop on Aligning Reinforcement Learning Experimentalists and Theorists (ARLET) as oral.
Joint work with Se-Young Yun (KAIST AI) and Kwang-Sung Jun (Univ. of Arizona CS).

Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion

Accepted to AISTATS 2024.
Joint work with Se-Young Yun (KAIST AI) and Kwang-Sung Jun (Univ. of Arizona CS).

GL-LowPopArt: A Nearly Instance-Wise Minimax-Optimal Estimator for Generalized Low-Rank Trace Regression

Accepted to ICML 2025 Spotlight.
Joint work with Kyoungseok Jang (CAU AI), Kwang-Sung Jun (Univ. of Arizona CS), Milan Vojnović (LSE Stat), and Se-Young Yun (KAIST AI).

Bandits RL Statistics Online Learning Probability Theory Ranking Theory

Posts

Three preprints now available on arXiv!

Three preprints, each on a very different topic, now available :) Near-Optimal Clustering in Mixture of Markov Chains Joint work with Yassir Jedra (MIT EECS -> ICL EEE), Alexandre Proutière (KTH EECS), and Se-Young Yun (KAIST AI) GL-LowPopArt: A Nearly Instance-Wise Minimax-Optimal Estimator for Generalized Low-Rank Trace Regression Joint work with Kyoungseok Jang (CAU AI), Kwang-Sung Jun (Univ.

Junghyun Lee

Jun 2, 2025 1 min read

I'll be spending my summer of 2025 as a research scientist intern at Adobe Research, San Jose!

I am thrilled to share that I will join Adobe Research (San Jose) as a summer research intern, working with Dr. Branislav “Brano” Kveton on active learning, bandits, RL, LLM, and more!

Junghyun Lee

Dec 24, 2024 1 min read

I gave an invited talk at Hyperconnect, hosted by Dr. Gihun Lee!

Today I visited Hyperconnect (host: Dr. Gihun Lee) and gave an invited talk, “Bandits 101: A Maximally Non-technical Tutorial”.

Junghyun Lee

Oct 30, 2024 1 min read

One paper accepted to NeurIPS 2024!

One paper (A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits) is accepted to NeurIPS 2024! This is joint work with my advisor Se-Young Yun (KAIST AI), and Kwang-Sung Jun (Univ.

Junghyun Lee

Sep 26, 2024 1 min read

Two papers accepted to CKAIA 2024 + one is selected as one of the best papers!

Two papers (A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits, Gradient Descent with Polyak’s Momentum Finds Flatter Minima via Large Catapults) are accepted to CKAIA 2024.

Junghyun Lee

Jul 30, 2024 1 min read

I'll be giving a talk at the DeLTA Seminar in Copenhagen, Denmark.

I’ll be visiting Prof. Mohammad Sadegh Talebi’s group on July 1st and giving a talk at the DeLTA Seminar. Catch you all in Copenhagen, Denmark!

Junghyun Lee

Jun 30, 2024 1 min read

I'll be spending the next two weeks in Stockholm, attending Stochastic Networks Conference 2024 and visiting KTH!

I’ll be in Stockholm for the next two weeks (07.02 - 07.14). For the first week (07.02 - 07.05), I’ll be attending the Stochastic Networks 2024. Also, as part of the Learning in Networks - Structure, Dynamics, and Control, I’ll be giving a talk about my AISTATS 2023 paper (Nearly Optimal Latent State Decoding in Block MDPs) on 07.

Junghyun Lee

Jun 30, 2024 1 min read

I have successfully organized the 1st Korean AI Theory Community Workshop (Bandits)

I have successfully organized the 1st Korean AI Theory Community Workshop (Bandits). Refer to the website for more details. A total of 33 Korean researchers working on bandits attended, including 6 speakers.

Junghyun Lee

Jun 20, 2024 1 min read

I have successfully organized the 1st Korean AI Theory Community Workshop (Bandits)

Two papers accepted to two ICML workshops, one as **oral**!

One paper (Gradient Descent with Polyak’s Momentum Finds Flatter Minima via Large Catapults) is accepted to ICML 2024 Workshop: 2nd Workshop on High-dimensional Learning Dynamics (HiLD): The Emergence of Structure and Reasoning.

Junghyun Lee

Jun 19, 2024 1 min read

I'll be giving a talk at a mini-workshop at Universitat Pompeu Fabra, hosted by Prof. Gergely Neu

After AISTATS 2024, I’ll be at Universitat Pompeu Fabra to give a talk at a mini-workshop on RL theory, hosted by Prof. Gergely Neu. I’ll be presenting my AISTATS 2024 paper (Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion).

Junghyun Lee

May 6, 2024 1 min read

One paper accepted to AISTATS 2024!

One paper (Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion) is accepted to AISTATS 2024! This is joint work with my advisor Se-Young Yun (KAIST AI) and Kwang-Sung Jun (Univ.

Junghyun Lee

Jan 20, 2024 1 min read

Publications

GL-LowPopArt: A Nearly Instance-Wise Minimax-Optimal Estimator for Generalized Low-Rank Trace Regression

We present GL-LowPopArt, a novel Catoni-style estimator for generalized low-rank trace regression. Building on LowPopArt (Jang et al., …

Junghyun Lee, Kyoungseok Jang, Kwang-Sung Jun, Milan Vojnović, Se-Young Yun

A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

We present a unified likelihood ratio-based confidence sequence (CS) for any (self-concordant) generalized linear model (GLM) that is …

Junghyun Lee, Se-Young Yun, Kwang-Sung Jun

A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion

Logistic bandit is a ubiquitous framework of modeling users’ choices, e.g., click vs. no click for advertisement recommender …

Junghyun Lee, Se-Young Yun, Kwang-Sung Jun

Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion