Yuhang Wen
Necolizer
AI & ML interests
Agent RL, RL Post-train, LLM
Recent Activity
new activity
2 days ago
Quark-LLM/SSP:docs: update readme
new activity
about 2 months ago
Quark-LLM/SSP:feat: upload training and evaluation data
commented on
a paper
2 months ago
Search Self-play: Pushing the Frontier of Agent Capability without
Supervision