Lei Wang
demolei
AI & ML interests
LLMs
Recent Activity
upvoted a paper about 15 hours ago
Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis upvoted a paper about 15 hours ago
Self-Distilled Agentic Reinforcement Learning upvoted a paper 3 days ago
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards