Se Young Yun

On the Estimation of Linear Softmax Parametrized Markov Chains featured image

On the Estimation of Linear Softmax Parametrized Markov Chains

In reinforcement learning and deep learning, softmax parameterization is commonly used to represent discrete probability distributions.In this work, we study three possible softmax …

kunwoo-na
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits featured image

A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

We present a unified likelihood ratio-based confidence sequence (CS) for any (self-concordant) generalized linear model (GLM) that is guaranteed to be convex and numerically tight. …

avatar
Junghyun Lee
Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion featured image

Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion

Logistic bandit is a ubiquitous framework of modeling users' choices, e.g., click vs. no click for advertisement recommender system. We observe that the prior works overlook or …

avatar
Junghyun Lee
Empirical Analyses of Corruption in the Clustering of Block MDPs featured image

Empirical Analyses of Corruption in the Clustering of Block MDPs

We show that a simple trick of randomly corrupting the trajectories in Block MDPs allow for us to use the the clustering algorithm proposed of Jedra et al. (2023) for general …

avatar
Junghyun Lee
On the Estimation of Linear Softmax Parametrized Probability Distributions featured image

On the Estimation of Linear Softmax Parametrized Probability Distributions

Linear softmax parametrization (LSP) of a discrete probability distribution is ubiquitous in many areas, such as deep learning, RL, NLP, and social choice models. Instead of trying …

murad-aghazada
Fair Streaming Principal Component Analysis: Statistical and Algorithmic Viewpoint featured image

Fair Streaming Principal Component Analysis: Statistical and Algorithmic Viewpoint

Proposes a framework for performing fair PCA in memory limited, streaming setting. Sample complexity results and empirical discussions show the superiority of our approach compared …

avatar
Junghyun Lee
Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks featured image

Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks

A novel problem setting where heterogeneous multi-agent bandits collaborate over a network to minimize their group regret. To deal with the high communication complexity of the …

avatar
Junghyun Lee
Nearly Optimal Latent State Decoding in Block MDPs featured image

Nearly Optimal Latent State Decoding in Block MDPs

First theoretical analysis of model estimation and reward-free RL of block MDP, without resorting to function approximation frameworks. Lower bounds and algorithms with …

yassir-jedra
Preliminary Empirical Analyses of Clustering in Block MDPs featured image

Preliminary Empirical Analyses of Clustering in Block MDPs

We empirically validate the clustering algorithm proposed in (Jedra et al., 2022).

avatar
Junghyun Lee
A Statistical Analysis of Stochastic Gradient Noises for GNNs featured image

A Statistical Analysis of Stochastic Gradient Noises for GNNs

Inspired from (Wang et al., ICLR'22), we provide a preliminary statistical analysis of stochastic gradient noises (SGNs) of GIN and GCN in Cora node classification task.

avatar
Junghyun Lee