t.d.a.g.'s picture

t.d.a.g. PRO

sequelbox

·

sequelbox.bsky.social

AI & ML interests

open source, infinite games. (they/them)

Recent Activity

replied to their post about 2 hours ago

Two new releases today! Firstly, our new Raiden-Mini dataset, powered by DeepSeek's newest https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Speciale model! - A V3.2-Speciale reasoning showcase: the Raiden prompts test the model's creative, analytic, and general reasoning skills! - HEAD TO HEAD: a comparison subset pits V3.2-Speciale against V3.2 with the same prompts, providing a direct look at each model's advantages! Get the new Raiden-Mini dataset: https://huggingface.co/datasets/sequelbox/Raiden-Mini-DeepSeek-V3.2-Speciale On the model side, we've also brought Shining Valiant 3 to Ministral 3! - Science-reasoning: https://huggingface.co/datasets/sequelbox/Celestia3-DeepSeek-R1-0528 for physics, biology, chemistry, compsci, astronomy, Earth science, and information theory. - AI to build AI: the https://huggingface.co/datasets/sequelbox/Mitakihara-DeepSeek-R1-0528 dataset for high-quality reasoning performance on AI, MLOps, math and CUDA, complex adaptive and agentic systems, cognition, logic, linguistics, simulation, knowledge management, and more! - Creative reasoning and general chat performance supplemented with https://huggingface.co/datasets/sequelbox/Raiden-DeepSeek-R1 Get the newest SV3: https://huggingface.co/ValiantLabs/Ministral-3-14B-Reasoning-2512-ShiningValiant3 Esper 3.1 is available for Ministral 3 as well: https://huggingface.co/ValiantLabs/Ministral-3-14B-Reasoning-2512-Esper3.1 We're working hard on our next Big New Release, coming out in the next few weeks :) Help support our releases, donations used for models and datasets: https://huggingface.co/spaces/sequelbox/SupportOpenSource Open source matters. Fight for it with us. with love and friendship, allegra

liked a model about 2 hours ago

sequelbox/Ministral-3-14B-Reasoning-2512-PlumEsper1.1

published a model about 2 hours ago

sequelbox/Ministral-3-14B-Reasoning-2512-PlumEsper1.1

View all activity

Organizations

upvoted a collection about 5 hours ago

Reasoning Datasets

Synthetic datasets generated using reasoning models, primarily the Deepseek-R1 and Deepseek-V3 series. • 12 items • Updated 1 day ago • 2

upvoted a collection 1 day ago

Shining Valiant 3

Shining Valiant 3 is a science-reasoning, LLMOps, AI architecture, and general reasoning finetune for Qwen, gpt-oss, and Ministral! • 5 items • Updated 1 day ago • 2

upvoted a collection 6 days ago

Esper 3.1

Esper 3.1 is a DevOps, architecture, code, and general reasoning finetune for Qwen, Ministral and gpt-oss! • 5 items • Updated 6 days ago • 1

upvoted a changelog 6 days ago

Changelog

Duplicate Datasets

6 days ago

• 69

upvoted a collection 9 days ago

Qwen3

84 items • Updated Aug 6 • 1.48k

upvoted a paper about 2 months ago

The Massive Legal Embedding Benchmark (MLEB)

Paper • 2510.19365 • Published Oct 22 • 17

upvoted a paper 3 months ago

Hermes 4 Technical Report

Paper • 2508.18255 • Published Aug 25 • 42

upvoted 2 articles 5 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

•

722

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8

•

735

upvoted 2 papers 5 months ago

CodeContests+: High-Quality Test Case Generation for Competitive Programming

Paper • 2506.05817 • Published Jun 6 • 9

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25 • 47

upvoted a collection 5 months ago

🐙 OctoThinker

Mid-training Incentivizes Reinforcement Learning Scaling • 18 items • Updated Jun 26 • 2

upvoted a changelog 6 months ago

Changelog

Organization and User profiles now include repository listing pages

Jun 20

• 131

upvoted a collection 7 months ago

Esper 3

Esper 3 is a DevOps, architecture, code, and general reasoning finetune for Qwen 3! • 4 items • Updated 6 days ago • 3

upvoted a paper 11 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 52

upvoted a collection 12 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated 6 days ago • 155

upvoted a paper about 1 year ago

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2, 2024 • 24

upvoted an article about 1 year ago

Article

Introducing Community Tools on HuggingChat

Sep 16, 2024

•

37

upvoted an article over 1 year ago

Article

Synthetic dataset generation techniques: Self-Instruct

May 15, 2024

•

21

upvoted a collection over 1 year ago

Llama 3.x Models

Our models built with Llama 3, 3.1, and 3.2 • 10 items • Updated 6 days ago • 3