(Statistically/Theoretically) Principled Approaches to LLM Reasoning

Junghyun Lee

Oct 27, 2020 project

Adaptive Appraoches to Improving Sample Efficiency of LLM Reasoning

TBD

My Adobe internship

AdaSTaR: Adaptive Data Sampling for Training Self-Taught Reasoners

arXiv preprint arXiv:2505.16322
Led by Woosung Koh (Yonsei Univ.) along with Wonbeen Oh, Jaein Jang, MinHyung Lee, Hyeongjin Kim, Ah Yeon Kim (Yonsei Univ.). Also joint work with Joonkee Kim & Taehyeon Kim (LG AI Research) and Se-Young Yun (KAIST AI).

LLM RL Statistics Reasoning