Is There a Better Source Distribution than Gaussian? Exploring Source Distributions for Image Flow Matching Paper • 2512.18184 • Published 13 days ago • 20
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 6 items • Updated 2 days ago • 108
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Paper • 2505.09343 • Published May 14, 2025 • 73
Toward a Better Understanding of Fourier Neural Operators: Analysis and Improvement from a Spectral Perspective Paper • 2404.07200 • Published Apr 10, 2024 • 2
Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion Paper • 2410.19324 • Published Oct 25, 2024 • 3
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face +3 Jul 29, 2025 • 206
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper • 2508.02193 • Published Aug 4, 2025 • 133
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training Paper • 2508.00414 • Published Aug 1, 2025 • 93
The Well Collection A 15TB collection of physics simulation datasets. • 18 items • Updated Mar 24, 2025 • 41
100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models Paper • 2505.00551 • Published May 1, 2025 • 36
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers Paper • 2504.10483 • Published Apr 14, 2025 • 21
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper • 2504.08685 • Published Apr 11, 2025 • 130
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper • 2403.03206 • Published Mar 5, 2024 • 71