Se Young Yun

On the Estimation of Linear Softmax Parametrized Markov Chains

In reinforcement learning and deep learning, softmax parameterization is commonly used to represent discrete probability distributions.In this work, we study three possible softmax …

kunwoo-na

• Jun 26, 2024 • 1 min read

Bandits

A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

We present a unified likelihood ratio-based confidence sequence (CS) for any (self-concordant) generalized linear model (GLM) that is guaranteed to be convex and numerically tight. …

Junghyun Lee

• Jun 19, 2024 • 1 min read

Bandits

Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion

Logistic bandit is a ubiquitous framework of modeling users' choices, e.g., click vs. no click for advertisement recommender system. We observe that the prior works overlook or …

Junghyun Lee

• Jan 20, 2024 • 1 min read

Empirical Analyses of Corruption in the Clustering of Block MDPs

We show that a simple trick of randomly corrupting the trajectories in Block MDPs allow for us to use the the clustering algorithm proposed of Jedra et al. (2023) for general …

Junghyun Lee

• Jan 19, 2024 • 1 min read

On the Estimation of Linear Softmax Parametrized Probability Distributions

Linear softmax parametrization (LSP) of a discrete probability distribution is ubiquitous in many areas, such as deep learning, RL, NLP, and social choice models. Instead of trying …

murad-aghazada

• Dec 20, 2023 • 1 min read

Fairness

Fair Streaming Principal Component Analysis: Statistical and Algorithmic Viewpoint

Proposes a framework for performing fair PCA in memory limited, streaming setting. Sample complexity results and empirical discussions show the superiority of our approach compared …

Junghyun Lee

• Dec 10, 2023 • 1 min read

Bandits

Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks

A novel problem setting where heterogeneous multi-agent bandits collaborate over a network to minimize their group regret. To deal with the high communication complexity of the …

Junghyun Lee

• Oct 30, 2023 • 1 min read

Nearly Optimal Latent State Decoding in Block MDPs

First theoretical analysis of model estimation and reward-free RL of block MDP, without resorting to function approximation frameworks. Lower bounds and algorithms with …

yassir-jedra

• Apr 27, 2023 • 1 min read

Preliminary Empirical Analyses of Clustering in Block MDPs

We empirically validate the clustering algorithm proposed in (Jedra et al., 2022).

Junghyun Lee

• Dec 20, 2022 • 1 min read

A Statistical Analysis of Stochastic Gradient Noises for GNNs

Inspired from (Wang et al., ICLR'22), we provide a preliminary statistical analysis of stochastic gradient noises (SGNs) of GIN and GCN in Cora node classification task.

Junghyun Lee

• Jun 30, 2022 • 1 min read

No results found

Se Young Yun