5 34 40

Haiwen Diao

Paranioar

https://Paranioar.github.io/

AI & ML interests

Vision-and-Language, Parameter-efficient Transfer Learning, Multi-modal Large Language Model

Recent Activity

authored a paper 1 day ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

commented on a paper 4 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

upvoted a paper 4 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

View all activity

Organizations

authored a paper 1 day ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 5 days ago • 60

commented a paper 4 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 5 days ago • 60 •

upvoted a paper 4 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 5 days ago • 60

upvoted 2 papers 24 days ago

MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

Paper • 2512.03041 • Published 25 days ago • 62

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 25 days ago • 235

upvoted a paper 25 days ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25 • 163

liked a dataset 26 days ago

VLMEval/OpenVLMRecords

Updated Apr 8 • 3.2k • 12

upvoted 5 papers about 1 month ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20 • 121

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20 • 91

upvoted a paper about 2 months ago

Uniform Discrete Diffusion with Metric Path for Video Generation

Paper • 2510.24717 • Published Oct 28 • 40

updated 6 models 2 months ago

Paranioar/NEO1_0-2B-PT

Image-Text-to-Text • 3B • Updated Oct 21 • 11 • 1

Paranioar/NEO1_0-2B-MT

Image-Text-to-Text • 3B • Updated Oct 21 • 11

Paranioar/NEO1_0-2B-SFT

Image-Text-to-Text • 3B • Updated Oct 21 • 145 • 8

Paranioar/NEO1_0-9B-PT

Image-Text-to-Text • 10B • Updated Oct 21 • 3

Paranioar/NEO1_0-9B-MT

Image-Text-to-Text • 10B • Updated Oct 21 • 15

Paranioar/NEO1_0-9B-SFT

Image-Text-to-Text • 10B • Updated Oct 21 • 69 • 5

authored a paper 2 months ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16 • 66

Haiwen Diao

AI & ML interests

Recent Activity

Organizations

Paranioar's activity