Information Theory

Instance-Optimal Estimation with Multiple LLM Judges on a Budget

Evaluating large language models increasingly relies on LLM-as-a-judge protocols, but such evaluations remain costly: different judges have different prices and reliabilities, and …

Junghyun Lee

• May 25, 2026 • 1 min read

Online Learning

Looking Through the Mirror: Minimax-Optimal Regularized Regrets in Online Learning and Bandits

We revisit regularized regret minimization under full-information and bandit feedback, where a learner optimizes an objective of the form $\langle r, \pi \rangle - \eta^{-1} …

Junghyun Lee

• May 21, 2026 • 1 min read

Markov Chain

Near-Optimal Clustering in Mixture of Markov Chains

We study the problem of clustering T trajectories of length H, each generated by one of K unknown ergodic Markov chains over a finite state space of size S. The goal is to …

Junghyun Lee

• May 2, 2026 • 1 min read

Nearly Optimal Latent State Decoding in Block MDPs

First theoretical analysis of model estimation and reward-free RL of block MDP, without resorting to function approximation frameworks. Lower bounds and algorithms with …

yassir-jedra

• Apr 27, 2023 • 1 min read

No results found

Information Theory

Instance-Optimal Estimation with Multiple LLM Judges on a Budget

Looking Through the Mirror: Minimax-Optimal Regularized Regrets in Online Learning and Bandits

Near-Optimal Clustering in Mixture of Markov Chains

Nearly Optimal Latent State Decoding in Block MDPs