Qwen

Team

company

https://qwen.ai/

alibaba_qwen

QwenLM

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

akhaliq submitted a paper 1 day ago

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

alozowski authored a paper 2 days ago

YourBench: Easy Custom Evaluation Sets for Everyone

littlebird13 updated a Space 6 days ago

Qwen/Qwen3-Omni-Demo

View all activity

Papers

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Qwen3-VL Technical Report

View all Papers

akhaliq

submitted a paper to Daily Papers 1 day ago

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Paper • 2512.07843 • Published 17 days ago • 18

alozowski

authored a paper 2 days ago

YourBench: Easy Custom Evaluation Sets for Everyone

Paper • 2504.01833 • Published Apr 2 • 22

littlebird13

updated a Space 6 days ago

Qwen3 Omni Demo

Generate audio responses from text and media inputs

wenhuach

authored a paper 6 days ago

SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs

Paper • 2512.04746 • Published 7 days ago • 12

wenhuach

posted an update 6 days ago

Post

2816

🚀 SignRoundV2 for LLM quantization: PTQ-level cost, QAT-level accuracy — yes, even at 2 bits.

SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs (2512.04746)

ShuaiBai623

authored a paper 7 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 15 days ago • 119

littlebird13

published a model 7 days ago

Qwen/Qwen3-Next-80B-A3B-Instruct-GGUF

Text Generation • 80B • Updated 8 days ago • 2.41k • 9

bartowski

in Qwen/Qwen3-Coder-30B-A3B-Instruct 8 days ago

Does new update require GGUF update?

#34 opened 8 days ago by

littlebird13

updated 2 models 8 days ago

Qwen/Qwen3-Next-80B-A3B-Instruct-GGUF

Text Generation • 80B • Updated 8 days ago • 2.41k • 9

Qwen/Qwen3-Next-80B-A3B-Thinking-GGUF

Text Generation • 80B • Updated 8 days ago • 841 • 7

cyente

updated 2 models 8 days ago

Qwen/Qwen3-Coder-30B-A3B-Instruct-FP8

Text Generation • 31B • Updated 8 days ago • 291k • 118

Qwen/Qwen3-Coder-30B-A3B-Instruct

Text Generation • 31B • Updated 8 days ago • 1.18M • • 804

littlebird13

published a model 8 days ago

Qwen/Qwen3-Next-80B-A3B-Thinking-GGUF

Text Generation • 80B • Updated 8 days ago • 841 • 7

fernandofernandes

authored 4 papers 9 days ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 15

Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation

Paper • 2406.14971 • Published Jun 21, 2024

Training-Free Tokenizer Transplantation via Orthogonal Matching Pursuit

Paper • 2506.06607 • Published Jun 7 • 2

LFM2 Technical Report

Paper • 2511.23404 • Published 13 days ago • 34

yangapku

authored a paper 9 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 10 days ago • 85

littlebird13

updated a Space 13 days ago

Qwen TTS Clone Demo

Clone and synthesize voice from a sample

littlebird13

published a Space 13 days ago

Qwen TTS Clone Demo

Clone and synthesize voice from a sample