Trained ExpRL checkpoints. Paper link: https://arxiv.org/abs/2606.17024
Violet Xiang PRO
violetxi
AI & ML interests
None yet
Recent Activity
published a model about 11 hours ago
violetxi/qwen35-4b-terminal-action-clean-lora-r16 published a model about 11 hours ago
violetxi/qwen35-4b-terminal-mixed-think-lora-r16 published a model about 11 hours ago
violetxi/qwen35-4b-terminal-combo-lora-rl