view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 10 days ago • 234
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face +3 Jul 29 • 203
EuroBERT Collection Scaling Multilingual Encoders for European Languages • 4 items • Updated Mar 10 • 13
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 175
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub +2 Feb 12 • 79
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 249