VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression? Paper • 2512.15649 • Published 17 days ago • 6
MLLM-CL: Continual Learning for Multimodal Large Language Models Paper • 2506.05453 • Published Jun 5, 2025 • 1
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published Sep 26, 2025 • 184
Practical Continual Forgetting for Pre-trained Vision Models Paper • 2501.09705 • Published Jan 16, 2025 • 1
MRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers Paper • 2502.07856 • Published Feb 11, 2025 • 5
OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map Construction Paper • 2410.23278 • Published Oct 30, 2024 • 2
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 649