view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 377
Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning Paper • 2508.08221 • Published Aug 11 • 49
Shorter but not Worse: Frugal Reasoning via Easy Samples as Length Regularizers in Math RLVR Paper • 2511.01937 • Published Nov 2 • 12
⚛️ Liquid Nanos Collection Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 22 items • Updated 9 days ago • 98
view article Article AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models Sep 16 • 19
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 12 items • Updated 7 days ago • 47
view article Article Building Conversational AI: A Deep Dive into Voice Agent Architectures and Best Practices Sep 2 • 11
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper • 2508.18265 • Published Aug 25 • 208
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18 • 88