arxiv:2603.28342
Zixian Huang
njuhzx
AI & ML interests
None yet
Recent Activity
upvoted a paper 2 days ago
TIP: Token Importance in On-Policy Distillation upvoted a paper 2 days ago
Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision updated a dataset 4 days ago
CoopReason/TESSY-Code-80K