Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Michal Valko's picture
Open to Collab
2 2 1

Michal Valko

misovalko
kashif's profile picture hugolb's profile picture Doge-GPT's profile picture
·
https://misovalko.github.io/
  • misovalko
  • misovalko
  • michalvalko
  • misovalko.bsky.social

AI & ML interests

large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models

Recent Activity

upvoted a paper 2 days ago
A General Theoretical Paradigm to Understand Learning from Human Preferences
authored a paper 2 days ago
Optimal Design for Reward Modeling in RLHF
authored a paper 2 days ago
Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms
View all activity

Organizations

Centre Inria de l'Université de Rennes's profile picture huggingPartyParis's profile picture Institut National de Recherche en Informatique et en Automatique's profile picture Paris AI Running Club's profile picture Hugging Face Discord Community's profile picture Hugging Face MCP Course's profile picture

Papers 82

arxiv:2505.19731
arxiv:2503.19612
arxiv:2410.17055
arxiv:2410.12138

models 0

None public yet

datasets 1

misovalko/my-research-papers

Updated 3 days ago • 9
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs