JunhaSong

junha1125

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

upvoted a paper 10 days ago

Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation

upvoted a paper 15 days ago

Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure

View all activity

Organizations

None yet

upvoted a paper 2 days ago

InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

Paper • 2512.17504 • Published 13 days ago • 93

upvoted a paper 10 days ago

Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation

Paper • 2512.17040 • Published 14 days ago • 27

upvoted a paper 15 days ago

Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure

Paper • 2512.14336 • Published 16 days ago • 28

upvoted a paper 17 days ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published 23 days ago • 115

liked a Space about 2 months ago

Qwen Image Edit Camera Control

🎬

1.7k

Fast 4 step inference with Qwen Image Edit 2509

upvoted a paper about 2 months ago

PHUMA: Physically-Grounded Humanoid Locomotion Dataset

Paper • 2510.26236 • Published Oct 30, 2025 • 29

upvoted a paper 2 months ago

ACG: Action Coherence Guidance for Flow-based VLA models

Paper • 2510.22201 • Published Oct 25, 2025 • 36

authored 2 papers 2 months ago

EcoTTA: Memory-Efficient Continual Test-time Adaptation via Self-distilled Regularization

Paper • 2303.01904 • Published Mar 3, 2023

RL makes MLLMs see better than SFT

Paper • 2510.16333 • Published Oct 18, 2025 • 48

upvoted a paper 2 months ago

RL makes MLLMs see better than SFT

Paper • 2510.16333 • Published Oct 18, 2025 • 48

upvoted a paper 5 months ago

DesignLab: Designing Slides Through Iterative Detection and Correction

Paper • 2507.17202 • Published Jul 23, 2025 • 50

upvoted a paper 6 months ago

Token Bottleneck: One Token to Remember Dynamics

Paper • 2507.06543 • Published Jul 9, 2025 • 20

liked a model 6 months ago

AILab-CVC/seed-x-17b-instruct

Updated Sep 21, 2024 • 17 • 1

liked a model 8 months ago

nvidia/DAM-3B

Image-Text-to-Text • Updated May 7, 2025 • 31.1k • 128

upvoted a collection 8 months ago

ProLIP

Collection

Official ProLIP weights, Probabilistic Language-Image Pre-Training (ICLR 2025) • 7 items • Updated Apr 18, 2025 • 10

liked 2 models 9 months ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

24B • Updated 10 days ago • 83.5k • 1.34k

LGAI-EXAONE/EXAONE-Deep-32B

Text Generation • 32B • Updated Mar 19, 2025 • 1.3k • 298

liked a model 11 months ago

deepseek-ai/DeepSeek-R1-Distill-Llama-8B

Text Generation • 8B • Updated Feb 24, 2025 • 956k • • 835

liked 2 models over 1 year ago

lmms-lab/llama3-llava-next-8b

Text Generation • 8B • Updated Aug 17, 2024 • 1.67k • 103

xhyi/PT_GPTNEO350_ATG

Text Generation • Updated Jul 27, 2022 • 938 • 20

JunhaSong

AI & ML interests

Recent Activity

Organizations

junha1125's activity

Qwen Image Edit Camera Control