-
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
Paper • 2506.01943 • Published • 25 -
LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks
Paper • 2506.00411 • Published • 31 -
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics
Paper • 2506.01844 • Published • 146
Ron Zhu
RzZ
AI & ML interests
None yet
Recent Activity
updated
a collection
2 days ago
VLM
updated
a collection
6 months ago
VLM
updated
a collection
6 months ago
VLM
Organizations
None yet