Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation Paper • 2510.08673 • Published Oct 9, 2025 • 125
Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting Paper • 2505.14059 • Published May 20, 2025 • 3