shipeng luo
luoagent
·
AI & ML interests
ML AI
Recent Activity
liked a model 20 days ago
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled upvoted an article about 1 month ago
使用 DPO 微调 Llama 2 upvoted a paper about 1 month ago
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video ModelsOrganizations
None yet