Olmo 3 Pre-training Collection All artifacts related to Olmo 3 pre-training • 10 items • Updated 7 days ago • 26
Real-Time Reasoning Agents in Evolving Environments Paper • 2511.04898 • Published about 1 month ago • 11
SynthesizeMe! Inducing Persona-Guided Prompts for Personalized Reward Models in LLMs Paper • 2506.05598 • Published Jun 5 • 7
Gemstone Models Collection Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80. • 69 items • Updated Jul 4 • 10
Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset Paper • 2412.02595 • Published Dec 3, 2024 • 5
Mind the Gap! Static and Interactive Evaluations of Large Audio Models Paper • 2502.15919 • Published Feb 21 • 4
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise Paper • 2410.03017 • Published Oct 3, 2024 • 29
Distilling an End-to-End Voice Assistant Without Instruction Training Data Paper • 2410.02678 • Published Oct 3, 2024 • 23