7 14 6

Patrick (Tsung-Han) Wu

tsunghanwu

https://patrickthwu.com/

AI & ML interests

Vision and Language

Recent Activity

upvoted a paper about 1 month ago

Pillar-0: A New Frontier for Radiology Foundation Models

upvoted a collection about 1 month ago

Pillar-0

updated a dataset about 1 month ago

tsunghanwu/HaloQuest

View all activity

Organizations

upvoted a paper about 1 month ago

Pillar-0: A New Frontier for Radiology Foundation Models

Paper • 2511.17803 • Published Nov 21, 2025 • 20

upvoted a collection about 1 month ago

Pillar-0

Collection

A New Frontier for Radiology Foundation Models • 5 items • Updated Nov 20, 2025 • 9

upvoted a paper 2 months ago

Are Large Reasoning Models Interruptible?

Paper • 2510.11713 • Published Oct 13, 2025 • 4

upvoted 3 papers 7 months ago

upvoted 3 papers 9 months ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21, 2025 • 44

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22, 2025 • 63

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper • 2504.13169 • Published Apr 17, 2025 • 39

upvoted a paper 10 months ago

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published Mar 19, 2025 • 49

upvoted a collection 11 months ago

Visual Haystacks

Collection

Official datasets and checkpoints of the paper -- Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark (ICLR 2025) • 4 items • Updated Apr 18, 2025 • 2

upvoted a paper about 1 year ago

VisionArena: 230K Real World User-VLM Conversations with Preference Labels

Paper • 2412.08687 • Published Dec 11, 2024 • 13

upvoted an article over 1 year ago

Article

Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark!

Jul 23, 2024

•

upvoted a paper over 1 year ago

CLAIR-A: Leveraging Large Language Models to Judge Audio Captions

Paper • 2409.12962 • Published Sep 19, 2024 • 2

Patrick (Tsung-Han) Wu

AI & ML interests

Recent Activity

Organizations

tsunghanwu's activity

Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark!