OpenMOSS-Team/SmolLM-135M-MLA-d_kv_32
Text Generation
ā¢
0.1B
ā¢
Updated
ā¢
10
LLM
DiRL: An Efficient Post-Training Framework for Diffusion Language Models
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs