Pillar-0: A New Frontier for Radiology Foundation Models Paper • 2511.17803 • Published Nov 21, 2025 • 20
Pillar-0 Collection A New Frontier for Radiology Foundation Models • 5 items • Updated Nov 20, 2025 • 9
Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint Paper • 2505.23759 • Published May 29, 2025 • 5
Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published Apr 21, 2025 • 44
Describe Anything: Detailed Localized Image and Video Captioning Paper • 2504.16072 • Published Apr 22, 2025 • 63
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Paper • 2504.13169 • Published Apr 17, 2025 • 39
Visual Haystacks Collection Official datasets and checkpoints of the paper -- Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark (ICLR 2025) • 4 items • Updated Apr 18, 2025 • 2
VisionArena: 230K Real World User-VLM Conversations with Preference Labels Paper • 2412.08687 • Published Dec 11, 2024 • 13
view article Article Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark! Jul 23, 2024 • 3
CLAIR-A: Leveraging Large Language Models to Judge Audio Captions Paper • 2409.12962 • Published Sep 19, 2024 • 2