Ryan A. Rossi

Learning to Reason in LLMs by Expectation Maximization featured image

Learning to Reason in LLMs by Expectation Maximization

Large language models (LLMs) solve reasoning problems by first generating a rationale and then answering. We formalize reasoning as a latent variable model and derive a …

avatar
Junghyun Lee