On the Estimation of Linear Softmax Parametrized Markov Chains
Kunwoo Na, Junghyun Lee, Se-Young Yun
June, 2024
Abstract
In reinforcement learning and deep learning, softmax parameterization is commonly used to represent discrete probability distributions.In this work, we study three possible softmax parametrizations of the transition matrix of the Markov chain. Through theoretical and empirical lenses, we provide several insights into the effect of such parametrizations on estimating the Markov transition matrix.
Publication
In Korea Computer Congress
PhD Student
PhD student at GSAI, KAIST, jointly advised by Profs. Se-Young Yun and Chulhee Yun. Research focuses on interactive machine learning, particularly at the intersection of RLHF and preference learning, and statistical analyses of large networks, with an emphasis on community detection. Broadly interested in mathematical and theoretical AI and related problems in mathematics.