RLVR-World - a thuml Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

thuml 's Collections

Time Series Foundation Models

RLVR-World

updated May 26, 2025

RLVR-World: Training World Models with Reinforcement Learning

Paper • 2505.13934 • Published May 20, 2025 • 16
thuml/rt1-frame-tokenizer

Updated May 22, 2025 • 20
thuml/rt1-world-model-single-step-base

0.1B • Updated May 22, 2025 • 17
thuml/rt1-world-model-single-step-rlvr

Updated May 26, 2025 • 5
thuml/rt1-compressive-tokenizer

Updated May 22, 2025 • 17
thuml/rt1-world-model-multi-step-base

0.1B • Updated May 22, 2025 • 130
thuml/rt1-world-model-multi-step-rlvr

Updated May 26, 2025 • 4
thuml/webarena-world-model-cot

Viewer • Updated May 26, 2025 • 6.41k • 134
thuml/webarena-world-model-sft

2B • Updated May 26, 2025 • 8
thuml/webarena-world-model-rlvr

2B • Updated May 26, 2025 • 4
thuml/bytesized32-world-model-cot

Viewer • Updated May 26, 2025 • 304k • 58 • 3
thuml/bytesized32-world-model-sft

2B • Updated May 26, 2025 • 7
thuml/bytesized32-world-model-rlvr-binary-reward

2B • Updated May 26, 2025 • 5
thuml/bytesized32-world-model-rlvr-task-specific-reward

2B • Updated May 26, 2025 • 5

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs