Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published 24 days ago • 168
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data Paper • 2505.18445 • Published May 24 • 63
Human101: Training 100+FPS Human Gaussians in 100s from 1 View Paper • 2312.15258 • Published Dec 23, 2023 • 10
One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications Paper • 2312.16145 • Published Dec 26, 2023 • 10
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos Paper • 2312.15770 • Published Dec 25, 2023 • 15
Audiobox: Unified Audio Generation with Natural Language Prompts Paper • 2312.15821 • Published Dec 25, 2023 • 17
Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4 Paper • 2312.16171 • Published Dec 26, 2023 • 37
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling Paper • 2312.15166 • Published Dec 23, 2023 • 60
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation Paper • 2312.14385 • Published Dec 22, 2023 • 7
MACS: Mass Conditioned 3D Hand and Object Motion Synthesis Paper • 2312.14929 • Published Dec 22, 2023 • 6
ZeroShape: Regression-based Zero-shot Shape Reconstruction Paper • 2312.14198 • Published Dec 21, 2023 • 9
PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar Paper • 2312.14239 • Published Dec 21, 2023 • 12
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning Paper • 2312.14878 • Published Dec 22, 2023 • 15