5 90 31

Ha-Yeong Choi

Ha0

https://scholar.google.com/citations?user=Jw3X6UgAAAAJ&hl=ko

hayeong0

AI & ML interests

Speech Synthesis, Voice Conversion, Generative Models

Recent Activity

upvoted a paper 17 days ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

upvoted a paper 22 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

liked a dataset about 1 month ago

yenopoya/thousand-voices-trauma

View all activity

Organizations

None yet

upvoted a paper 17 days ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published 18 days ago • 71

upvoted a paper 22 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 24 days ago • 168

liked a dataset about 1 month ago

yenopoya/thousand-voices-trauma

Updated Oct 24 • 465 • 2

upvoted a paper 2 months ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27 • 58

upvoted a paper 4 months ago

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27 • 31

liked a model 5 months ago

LiquidAI/LFM2-350M

Text Generation • 0.4B • Updated 23 days ago • 20.3k • 199

upvoted 2 papers 5 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 195

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 136

liked a Space 5 months ago

Open ASR Leaderboard

🏆

1.18k

View and request speech models benchmark data

upvoted a paper 5 months ago

CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models

Paper • 2507.13984 • Published Jul 18 • 25

upvoted 3 papers 6 months ago

upvoted a paper 7 months ago

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5 • 76

liked a model 7 months ago

nvidia/parakeet-tdt-1.1b

Automatic Speech Recognition • Updated 24 days ago • 4.27k • 110

liked a dataset 7 months ago

yijingwu/HeySQuAD_human

Viewer • Updated Feb 26, 2024 • 76.1k • 432 • 4

liked a model 7 months ago

nvidia/canary-1b-flash

Automatic Speech Recognition • 0.8B • Updated 24 days ago • 217k • 261

upvoted a paper 7 months ago

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

Paper • 2505.12504 • Published May 18 • 24

liked a model 8 months ago

kyutai/moshika-pytorch-bf16

Updated Sep 18, 2024 • 803 • 58

liked a dataset 8 months ago

HKUSTAudio/Audio-FLAN-Dataset

Preview • Updated Oct 6 • 13.5k • 38

Ha-Yeong Choi

AI & ML interests

Recent Activity

Organizations

Ha0's activity

Open ASR Leaderboard