Running 3.63k The Ultra-Scale Playbook π 3.63k The ultimate guide to training LLM on large GPU Clusters
unsloth/DeepSeek-R1-Distill-Llama-70B Text Generation β’ 71B β’ Updated May 10, 2025 β’ 149 β’ 10
unsloth/Llama-3.3-70B-Instruct-bnb-4bit Text Generation β’ 71B β’ Updated Nov 25, 2025 β’ 13.6k β’ 52
ibnzterrell/Nvidia-Llama-3.1-Nemotron-70B-Instruct-HF-AWQ-INT4 Text Generation β’ 71B β’ Updated Dec 7, 2024 β’ 25 β’ 6
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4 Text Generation β’ 71B β’ Updated Aug 7, 2024 β’ 119k β’ 107
Running Featured 1.26k FineWeb: decanting the web for the finest text data at scale π· 1.26k Generate high-quality text data for LLMs using FineWeb