OSI Lab Seminars

[previously] The ``RLHF Study Group’’ has been dissolved on 2025.01.05.

[previously] OSI Lab has divided the global seminar into several divisions based on the topics. I was in charge of the Theory division til 2024.06.11, when it dissolved and converted to ``RLHF Study Group.''

[previously] OSI Lab (led by Prof. Se-Young Yun) holds a weekly seminar where each of 2 members, whose orders are assigned based on a fixed circular order, presents a paper of his/her own interest. The seminar is called off only when it overlaps with some major conference or exam period.

Bandits 101: A Maximally Non-technical Tutorial

Event OSI Lab Seminar Series Short summary In this (rather sudden) seminar, I will give a maximally non-technical tutorial on bandits, with emphasis on diferent variants and their …

Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks

Event Weekly OSI Lab Seminar Short summary In this seminar, I will talk my recent work on proposing a new, efficient network protocol for networked, multi-agent bandits with …

Conference Day - Theory Division

Event OSI Lab Conference Day Short summary In this talk, as the manager of the Theory seminar, I will summarize all the papers covered by the Theory seminar for the last semester, …

Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion

Event Weekly OSI Lab Seminar Short summary In this seminar, I will talk my recent work on the new, state-of-the-art regret bound for (multinomial) logistic bandits, and the …

Introduction to Reinforcement Learning with Human Feedback (RLHF): A Theoretically Biased Overview

Event Weekly OSI Lab Seminar Short summary In this talk, I will first (somewhat rigorously) introduce the framework of reinforcement learning with human feedback (RLHF). Then I …

Community Detection in Block Models: From SBMs to Block Markov Chains

Event Weekly OSI Lab Seminar Short summary In this seminar, I will first give a brief yet comprehensive overview of the seminal results on community detection in stochastic block …

A Primer on (Combinatorial Semi-) Bandits

Event Weekly OSI Lab Seminar Short summary In this seminar, I start by introducing the problem of bandits, overall proof techniques for the fundamental lower bounds, and a brief …

Fair Streaming Principal Component Analysis: Statistical and Algorithmic Viewpoint

Event Weekly OSI Lab Seminar Short summary In this seminar, I will talk my recent work (submitted to NeurIPS) on a streaming variant of fair PCA. Abstract Fair Principal Component …

Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs

Event Weekly OSI Lab Seminar Short summary In this seminar, I will talk about the paper “Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs” …

Exact Dynamics of Stochastic Gradient Descent in High Dimensions and Volterra (Integral) Equations

Event Weekly OSI Lab Seminar Short summary In this seminar, I will about two recent papers on the high-dimensional limit of SGD and their statistical theories, which were recently …