thedeoxen/FLUX.1-Kontext-dev-reference-depth-fusion-LORA Image-to-Image • Updated Aug 20, 2025 • 245 • • 70
VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator Paper • 2510.13454 • Published Oct 15, 2025 • 8
Imaginarium: Vision-guided High-Quality 3D Scene Layout Generation Paper • 2510.15564 • Published Oct 17, 2025 • 10
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery Paper • 2510.15869 • Published Oct 17, 2025 • 48
Kontinuous Kontext: Continuous Strength Control for Instruction-based Image Editing Paper • 2510.08532 • Published Oct 9, 2025 • 5
VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models Paper • 2509.17985 • Published Sep 22, 2025 • 26
VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models Paper • 2509.17985 • Published Sep 22, 2025 • 26
VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models Paper • 2509.17985 • Published Sep 22, 2025 • 26 • 2
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling Paper • 2509.12201 • Published Sep 15, 2025 • 106
SpatialVID: A Large-Scale Video Dataset with Spatial Annotations Paper • 2509.09676 • Published Sep 11, 2025 • 33
Matrix-3D: Omnidirectional Explorable 3D World Generation Paper • 2508.08086 • Published Aug 11, 2025 • 75
Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation Paper • 2508.07981 • Published Aug 11, 2025 • 58
LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Paper • 2508.03694 • Published Aug 5, 2025 • 51
Dens3R: A Foundation Model for 3D Geometry Prediction Paper • 2507.16290 • Published Jul 22, 2025 • 8