19 14 5

Will Held PRO

WillHeld

https://williamheld.com

AI & ML interests

Machine Learning and Natural Language Processing for low-resource languages and language variants

Recent Activity

liked a dataset 15 days ago

allenai/dolma3_pool

upvoted a collection 15 days ago

Olmo 3 Pre-training

updated a model 23 days ago

marin-community/marin-32b-base

View all activity

Organizations

upvoted a collection 15 days ago

Olmo 3 Pre-training

Collection

All artifacts related to Olmo 3 pre-training • 10 items • Updated 7 days ago • 26

upvoted a paper 25 days ago

Real-Time Reasoning Agents in Evolving Environments

Paper • 2511.04898 • Published about 1 month ago • 11

upvoted a paper 6 months ago

SynthesizeMe! Inducing Persona-Guided Prompts for Personalized Reward Models in LLMs

Paper • 2506.05598 • Published Jun 5 • 7

upvoted a collection 7 months ago

Gemstone Models

Collection

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80. • 69 items • Updated Jul 4 • 10

upvoted 3 papers 7 months ago

upvoted 2 papers 9 months ago

Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset

Paper • 2412.02595 • Published Dec 3, 2024 • 5

Mind the Gap! Static and Interactive Evaluations of Large Audio Models

Paper • 2502.15919 • Published Feb 21 • 4

upvoted an article 11 months ago

Article

Optimizing Pretraining Data Mixes with LLM-Estimated Utility

Jan 22

•

upvoted 2 articles about 1 year ago

Article

Welcome, Gradio 5

Oct 9, 2024

•

130

Article

AI Apps in a Flash with Gradio's Reload Mode

Apr 16, 2024

•

upvoted 2 papers about 1 year ago

Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise

Paper • 2410.03017 • Published Oct 3, 2024 • 29

Distilling an End-to-End Voice Assistant Without Instruction Training Data

Paper • 2410.02678 • Published Oct 3, 2024 • 23

Will Held PRO

AI & ML interests

Recent Activity

Organizations

WillHeld's activity

Optimizing Pretraining Data Mixes with LLM-Estimated Utility

Welcome, Gradio 5

AI Apps in a Flash with Gradio's Reload Mode