Theoretical Analyses of Reinforcement Learning with Human Feedback (RLHF)
Project #1. brainstorming…
Work in progress with Se-Young Yun, Chulhee Yun (KAIST AI), Kwang-Sung Jun (Univ. of Arizona CS), and Milan Vojnović (LSE Stat).
Work in progress with Se-Young Yun, Chulhee Yun (KAIST AI), Kwang-Sung Jun (Univ. of Arizona CS), and Milan Vojnović (LSE Stat).