-
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-400
2B • Updated • 14 -
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-1000
2B • Updated • 9 -
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-200
2B • Updated • 7 -
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-50
2B • Updated • 13
alphaXiv PRO
alphaXiv
AI & ML interests
None yet
Recent Activity
updated
a collection
2 days ago
spurious-rewards
updated
a dataset
2 days ago
alphaXiv/spurious-rewards-data
published
a dataset
2 days ago
alphaXiv/spurious-rewards-data
Organizations
None yet