Running 3.55k The Ultra-Scale Playbook 🌌 3.55k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 2.55k The Smol Training Playbook 📚 2.55k The secrets to building world-class LLMs
ShinkaEvolve: Towards Open-Ended And Sample-Efficient Program Evolution Paper • 2509.19349 • Published Sep 17 • 2
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8 • 192
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving Paper • 2507.23726 • Published Jul 31 • 114
DepNeCT Collection This Hugging Face collection hosts models and datasets from DepNeCT — a dependency-based method for nested compound type identification in Sanskrit • 4 items • Updated Jul 29 • 2
REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback Paper • 2505.06548 • Published May 10 • 30
REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback Paper • 2505.06548 • Published May 10 • 30
Running on CPU Upgrade Featured 990 Model Memory Utility 🚀 990 Calculate vRAM needed for model training and inference
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 • 171