CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion
Paper
•
2512.19535
•
Published
•
9
None defined yet.
Image Diffusion Preview with Consistency Solver
The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality