Report

Cumulative Distribution Regret Minimization with Max- Quantile Threshold in Multi-Armed Bandit

We study a new risk-averse bandit setting motivated by semiconductor manufacturing, where the quality of a recipe is judged not by its mean performance but by its weakest outcomes. …

jaeyoung-cha
Preliminary Empirical Study of Low-Rank, Hierarchical Gaussian Linear Bandits featured image

Preliminary Empirical Study of Low-Rank, Hierarchical Gaussian Linear Bandits

Inspired by recent advances in multi-task bandits, we propose a new problem setting called low-rank, hierarchical Gaussian linear bandits, which combines low-rank structure with …

avatar
Junghyun Lee
On the Estimation of Linear Softmax Parametrized Markov Chains featured image

On the Estimation of Linear Softmax Parametrized Markov Chains

In reinforcement learning and deep learning, softmax parameterization is commonly used to represent discrete probability distributions.In this work, we study three possible softmax …

kunwoo-na
Empirical Analyses of Corruption in the Clustering of Block MDPs featured image

Empirical Analyses of Corruption in the Clustering of Block MDPs

We show that a simple trick of randomly corrupting the trajectories in Block MDPs allow for us to use the the clustering algorithm proposed of Jedra et al. (2023) for general …

avatar
Junghyun Lee
On the Estimation of Linear Softmax Parametrized Probability Distributions featured image

On the Estimation of Linear Softmax Parametrized Probability Distributions

Linear softmax parametrization (LSP) of a discrete probability distribution is ubiquitous in many areas, such as deep learning, RL, NLP, and social choice models. Instead of trying …

murad-aghazada
Preliminary Empirical Analyses of Clustering in Block MDPs featured image

Preliminary Empirical Analyses of Clustering in Block MDPs

We empirically validate the clustering algorithm proposed in (Jedra et al., 2022).

avatar
Junghyun Lee
A Statistical Analysis of Stochastic Gradient Noises for GNNs featured image

A Statistical Analysis of Stochastic Gradient Noises for GNNs

Inspired from (Wang et al., ICLR'22), we provide a preliminary statistical analysis of stochastic gradient noises (SGNs) of GIN and GCN in Cora node classification task.

avatar
Junghyun Lee