Running 9 Frontier AI Cybersecurity Observatory 🌎 9 Cybersecurity Capability Evaluation Results Collection
AlicanKiraz0/Cybersecurity-BaronLLM_Offensive_Security_LLM_Q6_K_GGUF Text Generation • 8B • Updated Jun 4 • 783 • 124
Running on CPU Upgrade Featured 2.57k The Smol Training Playbook 📚 2.57k The secrets to building world-class LLMs
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models Paper • 2508.21365 • Published Aug 29 • 29
Running 40 Leaderboard: Physical Reasoning from Video 🏃 40 Submit model evaluations and view leaderboard results
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face +3 Jul 29 • 203