Running 9 Frontier AI Cybersecurity Observatory 🌎 9 Cybersecurity Capability Evaluation Results Collection
AlicanKiraz0/Cybersecurity-BaronLLM_Offensive_Security_LLM_Q6_K_GGUF Text Generation • 8B • Updated Jun 4 • 777 • 124
Running on CPU Upgrade Featured 2.6k The Smol Training Playbook 📚 2.6k The secrets to building world-class LLMs
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models Paper • 2508.21365 • Published Aug 29 • 29
Running 41 Leaderboard: Physical Reasoning from Video 🏃 41 Submit model evaluations and view leaderboard results
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face +3 Jul 29 • 203