Theoretical Analyses of Reinforcement Learning with Human Feedback (RLHF) and Related Problems
Logistic and Generalized Linear Bandits, Dueling Bandits, etc.
Project #2. A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
- Accepted to NeurIPS 2024.
- Accepted to ICML 2024 Workshop on Aligning Reinforcement Learning Experimentalists and Theorists (ARLET) as oral.
- Joint work with Se-Young Yun (KAIST AI) and Kwang-Sung Jun (Univ. of Arizona CS).
Project #1. Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion
- Accepted to AISTATS 2024.
- Joint work with Se-Young Yun (KAIST AI) and Kwang-Sung Jun (Univ. of Arizona CS).
``General’’ Theoretical Questions in RLHF

Junghyun Lee
PhD Student
PhD student at GSAI, KAIST, jointly advised by Profs. Se-Young Yun and Chulhee Yun. Research focuses on interactive machine learning, particularly at the intersection of RLHF and preference learning, and statistical analyses of large networks, with an emphasis on community detection. Broadly interested in mathematical and theoretical AI and related problems in mathematics.
Posts
After AISTATS 2024, I’ll be at Universitat Pompeu Fabra to give a talk at a mini-workshop on RL theory, hosted by Prof. Gergely Neu. I’ll be presenting my AISTATS 2024 paper (Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion).
Junghyun Lee
May 6, 2024
1 min read
Publications
We present GL-LowPopArt, a novel Catoni-style estimator for generalized linear low-rank trace regression. Building on LowPopArt (Jang …
Junghyun Lee, Kyoungseok Jang, Kwang-Sung Jun, Milan Vojnović, Se-Young Yun
We present a unified likelihood ratio-based confidence sequence (CS) for any (self-concordant) generalized linear model (GLM) that is …
Junghyun Lee, Se-Young Yun, Kwang-Sung Jun