Michal Valko's picture

Open to Collab

2 2 1

Michal Valko

misovalko

·

https://misovalko.github.io/

AI & ML interests

large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models

Recent Activity

upvoted a paper 2 days ago

A General Theoretical Paradigm to Understand Learning from Human Preferences

authored a paper 2 days ago

Optimal Design for Reward Modeling in RLHF

authored a paper 2 days ago

Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms

View all activity

Organizations

Papers 82

arxiv:2505.19731

arxiv:2503.19612

arxiv:2410.17055

arxiv:2410.12138

models 0

None public yet

datasets 1

misovalko/my-research-papers

Updated 3 days ago • 9