Junghyun Lee
Junghyun Lee
Home
Experiences
Publications
Seminars
Organizer
Korean AI Theory Community Workshop
SNU-KAIST ML/AI Theory Workshop
Machine/Deep Learning Theory + Physics Seminar
Contact
Light
Dark
Automatic
RLHF
Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion
Event Weekly OptiML Lab Group Meeting Short summary In this seminar, I will talk about my own paper “Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion” (Lee et al.
Jun 18, 2024
Introduction to Reinforcement Learning with Human Feedback (RLHF): A Theoretically Biased Overview
Event Weekly OptiML Lab Group Meeting Short summary In this talk, I will first (somewhat rigorously) introduce the framework of reinforcement learning with human feedback (RLHF). Then I will go over three recent breakthroughs in the analysis and improvement of RLHF.
Nov 30, 2023
Regularized Online RLHF with Generalized Bilinear Preferences
We consider the problem of contextual online RLHF with general preferences, where the goal is to identify the Nash Equilibrium. We …
Junghyun Lee
,
Minju Hong
,
Kwang-Sung Jun
,
Chulhee Yun
,
Se-Young Yun
Cite
Cite
×