TencentARC/TimeLens-7B
Video-Text-to-Text
•
8B
•
Updated
•
52
•
6
ARC mainly focuses on areas of computer vision, speech, and natural language processing, including speech/video generation, enhancement, retrieval, understanding, AutoML, etc. Considering research developments and industry trends, ARC consistently pursues exploration, innovation, and breakthroughs in technologies.
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs