V-JEPA 2 Collection A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 177
StreamingVLM: Real-Time Understanding for Infinite Video Streams Paper • 2510.09608 • Published Oct 10 • 50
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published May 7 • 65
Running on Zero Featured 1.73k Dia 1.6B 👯 1.73k Generate realistic dialogue from a script, using Dia!
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 19 days ago • 274k • 1.55k