venti1217's picture

1 7

venti1217

venti1217

·

AI & ML interests

None yet

Organizations

None yet

commented a paper 3 months ago

SightSound-R1: Cross-Modal Reasoning Distillation from Vision to Audio Language Models

Paper • 2509.15661 • Published Sep 19, 2025 • 2 •