victor wu

victor-wu

https://wutong4012.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 2 days ago

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

authored a paper 2 days ago

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

authored a paper 2 days ago

Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise

View all activity

Organizations

authored 7 papers 2 days ago

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

Paper • 2305.09515 • Published May 16, 2023 • 3

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 50

Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise

Paper • 2212.11685 • Published Dec 22, 2022 • 2

Never Miss A Beat: An Efficient Recipe for Context Window Extension of Large Language Models with Consistent "Middle" Enhancement

Paper • 2406.07138 • Published Jun 11, 2024 • 2

OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts

Paper • 2503.22952 • Published Mar 29 • 17

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 188

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Paper • 2505.13308 • Published May 19 • 27

updated 2 models 2 days ago

bigai-NPR/NPR-4B

4B • Updated 2 days ago • 38 • 7

bigai-NPR/NPR-4B-non-thinking

4B • Updated 2 days ago • 30 • 3

authored a paper 2 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 3 days ago • 67

upvoted a paper 2 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 3 days ago • 67

published 2 models 8 days ago

bigai-NPR/NPR-4B-non-thinking

4B • Updated 2 days ago • 30 • 3

bigai-NPR/NPR-4B

4B • Updated 2 days ago • 38 • 7

upvoted a paper 6 months ago

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

Paper • 2506.08672 • Published Jun 10 • 30

upvoted 2 papers 7 months ago

Discrete Markov Bridge

Paper • 2505.19752 • Published May 26 • 17

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Paper • 2505.13308 • Published May 19 • 27

updated a model 9 months ago

TokenSwift/TokenSwift-QwQ-32B

Text Generation • 33B • Updated Mar 19 • 8 • 1

published a model 9 months ago

TokenSwift/TokenSwift-QwQ-32B

Text Generation • 33B • Updated Mar 19 • 8 • 1

upvoted an article 9 months ago

Article

Open R1: Update #3

Mar 11

•

296

upvoted a paper 9 months ago

From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens

Paper • 2502.18890 • Published Feb 26 • 30

victor wu

AI & ML interests

Recent Activity

Organizations

victor-wu's activity

Open R1: Update #3