Junghyun Lee
Junghyun Lee
Home
Experiences
Publications
Projects
Posts
Seminars
Contact
Light
Dark
Automatic
Kwang-Sung Jun
Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion
Logistic bandit is a ubiquitous framework of modeling users’ choices, e.g., click vs. no click for advertisement recommender …
Junghyun Lee
,
Se-Young Yun
,
Kwang-Sung Jun
PDF
Cite
Project
Poster
Slides
Likelihood Loss-based Confidence Sequence
Aim to derive tight likelihood loss-based confidence sequence with time-uniform guarantees, with applications to sequential decision making and RL.
Junghyun Lee
Theoretical Analyses of Reinforcement Learning with Human Feedback (RLHF)
(tbd)
Junghyun Lee
Cite
×