Se-Young Yun

Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion

Logistic bandit is a ubiquitous framework of modeling users' choices, e.g., click vs. no click for advertisement recommender system. We observe that the prior works overlook or …

Junghyun Lee

• Jan 20, 2024 • 1 min read

Fairness

Fair Streaming Principal Component Analysis: Statistical and Algorithmic Viewpoint

Proposes a framework for performing fair PCA in memory limited, streaming setting. Sample complexity results and empirical discussions show the superiority of our approach compared …

Junghyun Lee

• Dec 10, 2023 • 1 min read

Bandits

Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks

A novel problem setting where heterogeneous multi-agent bandits collaborate over a network to minimize their group regret. To deal with the high communication complexity of the …

Junghyun Lee

• Oct 30, 2023 • 1 min read

Nearly Optimal Latent State Decoding in Block MDPs

First theoretical analysis of model estimation and reward-free RL of block MDP, without resorting to function approximation frameworks. Lower bounds and algorithms with …

yassir-jedra

• Apr 27, 2023 • 1 min read

No results found

Se-Young Yun

Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion

Fair Streaming Principal Component Analysis: Statistical and Algorithmic Viewpoint

Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks

Nearly Optimal Latent State Decoding in Block MDPs