HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming Paper • 2512.21338 • Published 4 days ago • 16
HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming Paper • 2512.21338 • Published 4 days ago • 16
Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing Paper • 2512.17909 • Published 9 days ago • 36
Scaling Zero-Shot Reference-to-Video Generation Paper • 2512.06905 • Published 21 days ago • 28
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition Paper • 2205.13535 • Published May 26, 2022
WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception Paper • 2508.15720 • Published Aug 21
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis Paper • 2402.16117 • Published Feb 25, 2024
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper • 2512.02014 • Published 27 days ago • 69
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper • 2512.02014 • Published 27 days ago • 69
PixelFlow: Pixel-Space Generative Models with Flow Paper • 2504.07963 • Published Apr 10 • 18 • 6