Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
alphaXiv
PRO
alphaXiv
Follow
rodolphogurgel's profile picture
robaray's profile picture
mmahdi-sz's profile picture
10 followers
·
0 following
AI & ML interests
None yet
Recent Activity
updated
a collection
1 day ago
spurious-rewards
updated
a dataset
1 day ago
alphaXiv/spurious-rewards-data
published
a dataset
1 day ago
alphaXiv/spurious-rewards-data
View all activity
Organizations
None yet
alphaXiv
's models
10
Sort: Recently updated
alphaXiv/spurious-rewards-reasoning-traces
Updated
1 day ago
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-400
2B
•
Updated
7 days ago
•
14
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-1000
2B
•
Updated
7 days ago
•
9
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-200
2B
•
Updated
7 days ago
•
7
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-50
2B
•
Updated
7 days ago
•
13
alphaXiv/Qwen-2.5-1.5b-instruct-ppo
2B
•
Updated
12 days ago
•
29
alphaXiv/Qwen-2.5-1.5b-instruct-grpo
2B
•
Updated
12 days ago
•
17
alphaXiv/trm-model-arc-agi-1
Updated
Oct 22, 2025
•
4
alphaXiv/trm-model-sudoku
Updated
Oct 22, 2025
•
3
alphaXiv/trm-model-maze
Updated
Oct 22, 2025
•
5