1 54 2

YanxingLiu

lyx98

YanxingLiu

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper 3 days ago

Elucidating the SNR-t Bias of Diffusion Probabilistic Models

upvoted a paper 6 days ago

LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories

upvoted a paper 10 days ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Elucidating the SNR-t Bias of Diffusion Probabilistic Models

Paper • 2604.16044 • Published 7 days ago • 72

upvoted a paper 6 days ago

LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories

Paper • 2604.15311 • Published 8 days ago • 12

upvoted a paper 10 days ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published 11 days ago • 70

upvoted 2 papers about 2 months ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published Feb 27 • 98

DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference

Paper • 2602.21548 • Published Feb 25 • 50

upvoted a paper 2 months ago

Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

Paper • 2602.11858 • Published Feb 12 • 63

upvoted 2 papers 3 months ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published Feb 4 • 268

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

Paper • 2602.01785 • Published Feb 2 • 96

updated a collection 4 months ago

InternVL3_5_Flash-HF

Collection

3 items • Updated Dec 14, 2025

updated 3 models 4 months ago

updated a collection 4 months ago

InternVL3_5_Flash-HF

Collection

3 items • Updated Dec 14, 2025

published 3 models 4 months ago

lyx98/InternVL3_5_Flash-4B-HF

Image-Text-to-Text • 5B • Updated Dec 14, 2025 • 5

lyx98/InternVL3_5_Flash-2B-HF

Image-Text-to-Text • 2B • Updated Dec 14, 2025 • 6

lyx98/InternVL3_5_Flash-1B-HF

Image-Text-to-Text • 1.0B • Updated Dec 14, 2025 • 27

upvoted 3 papers 5 months ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 245

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 215

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7, 2025 • 46

upvoted a paper 7 months ago

Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

Paper • 2509.09286 • Published Sep 11, 2025 • 11

YanxingLiu

AI & ML interests

Recent Activity

Organizations

lyx98's activity