arxiv:2510.15862
Jiuqi Wang
LeonardoWjq
AI & ML interests
reinforcement learning
Recent Activity
upvoted
a
paper
about 2 months ago
PokeeResearch: Effective Deep Research via Reinforcement Learning from
AI Feedback and Robust Reasoning Scaffold
authored
a paper
2 months ago
PokeeResearch: Effective Deep Research via Reinforcement Learning from
AI Feedback and Robust Reasoning Scaffold
new activity
about 1 year ago
stable-diffusion-v1-5/stable-diffusion-v1-5:Update README.md