One-Minute Video Generation with Test-Time Training Paper • 2504.05298 • Published Apr 7, 2025 • 110 • 7
QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation Paper • 2502.05178 • Published Feb 7, 2025 • 10 • 2