Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published 5 days ago • 201
Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation Paper • 2512.13495 • Published Dec 15, 2025 • 11
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models Paper • 2511.11007 • Published Nov 14, 2025 • 15
Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow Paper • 2509.21789 • Published Sep 26, 2025 • 9
Runtime error MCP Featured 186 SVFR Demo ✨ 186 Unified Framework for Generalized Video Face Restoration