Running 3.65k The Ultra-Scale Playbook 🌌 3.65k The ultimate guide to training LLM on large GPU Clusters
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14, 2025 • 300