PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control Paper • 2605.15963 • Published 19 days ago • 17
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora Paper • 2604.24819 • Published Apr 27 • 89
The Trinity of Consistency as a Defining Principle for General World Models Paper • 2602.23152 • Published Feb 26 • 202
Decouple to Generalize: Context-First Self-Evolving Learning for Data-Scarce Vision-Language Reasoning Paper • 2512.06835 • Published Dec 7, 2025 • 5
Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights Paper • 2512.01816 • Published Dec 1, 2025 • 95
GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models Paper • 2511.11134 • Published Nov 14, 2025 • 33