[ICLR 2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models
Zengbin Wang
MuMing0102
·
AI & ML interests
Agentic AI, Multimodal LLM, Computer Vision
Recent Activity
authored a paper about 19 hours ago
Visually-Guided Policy Optimization for Multimodal Reasoning updated a collection 1 day ago
SpatialGenEval updated a collection 1 day ago
VGPO-RLOrganizations
None yet