InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper • 2512.17504 • Published 13 days ago • 93
Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation Paper • 2512.17040 • Published 14 days ago • 27
Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure Paper • 2512.14336 • Published 16 days ago • 28
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 23 days ago • 115
Running on Zero MCP Featured 1.7k Qwen Image Edit Camera Control 🎬 1.7k Fast 4 step inference with Qwen Image Edit 2509
PHUMA: Physically-Grounded Humanoid Locomotion Dataset Paper • 2510.26236 • Published Oct 30, 2025 • 29
ACG: Action Coherence Guidance for Flow-based VLA models Paper • 2510.22201 • Published Oct 25, 2025 • 36
EcoTTA: Memory-Efficient Continual Test-time Adaptation via Self-distilled Regularization Paper • 2303.01904 • Published Mar 3, 2023
DesignLab: Designing Slides Through Iterative Detection and Correction Paper • 2507.17202 • Published Jul 23, 2025 • 50
Token Bottleneck: One Token to Remember Dynamics Paper • 2507.06543 • Published Jul 9, 2025 • 20
ProLIP Collection Official ProLIP weights, Probabilistic Language-Image Pre-Training (ICLR 2025) • 7 items • Updated Apr 18, 2025 • 10
deepseek-ai/DeepSeek-R1-Distill-Llama-8B Text Generation • 8B • Updated Feb 24, 2025 • 956k • • 835