Collections
Discover the best community collections!
Collections including paper arxiv:2412.19437
-
Cosmos World Foundation Model Platform for Physical AI
Paper • 2501.03575 • Published • 81 -
Phi-4 Technical Report
Paper • 2412.08905 • Published • 122 -
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper • 2501.08313 • Published • 301 -
DeepSeek-V3 Technical Report
Paper • 2412.19437 • Published • 73
-
deepseek-ai/DeepSeek-V3-Base
685B • Updated • 10.9k • 1.68k -
TransMLA: Multi-head Latent Attention Is All You Need
Paper • 2502.07864 • Published • 58 -
Qwen2.5 Bakeneko 32b Instruct Awq
⚡2Generate detailed responses to text prompts
-
Deepseek R1 Distill Qwen2.5 Bakeneko 32b Awq
⚡3Generate text responses to user messages in a chat interface
-
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Paper • 2412.16145 • Published • 38 -
DeepSeek-V3 Technical Report
Paper • 2412.19437 • Published • 73 -
Slamming: Training a Speech Language Model on One GPU in a Day
Paper • 2502.15814 • Published • 69 -
SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning
Paper • 2506.19767 • Published • 15
-
Cosmos World Foundation Model Platform for Physical AI
Paper • 2501.03575 • Published • 81 -
Phi-4 Technical Report
Paper • 2412.08905 • Published • 122 -
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper • 2501.08313 • Published • 301 -
DeepSeek-V3 Technical Report
Paper • 2412.19437 • Published • 73
-
deepseek-ai/DeepSeek-V3-Base
685B • Updated • 10.9k • 1.68k -
TransMLA: Multi-head Latent Attention Is All You Need
Paper • 2502.07864 • Published • 58 -
Qwen2.5 Bakeneko 32b Instruct Awq
⚡2Generate detailed responses to text prompts
-
Deepseek R1 Distill Qwen2.5 Bakeneko 32b Awq
⚡3Generate text responses to user messages in a chat interface
-
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Paper • 2412.16145 • Published • 38 -
DeepSeek-V3 Technical Report
Paper • 2412.19437 • Published • 73 -
Slamming: Training a Speech Language Model on One GPU in a Day
Paper • 2502.15814 • Published • 69 -
SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning
Paper • 2506.19767 • Published • 15