1 5 7

Mikhail Yurochkin

moonfolk

https://moonfolk.github.io/

moonfolk

AI & ML interests

None yet

Recent Activity

liked a Space 4 days ago

LLM360/k2v2-vibe

liked a dataset 5 days ago

LLM360/TxT360-3efforts

liked a dataset 5 days ago

LLM360/TxT360

View all activity

Organizations

liked a Space 4 days ago

Eval Dashboard

💻

Analyze model performance across training stages

liked 3 datasets 5 days ago

liked 2 models 6 days ago

LLM360/K2-V2-Instruct

Updated 4 days ago • 1.75k • 19

LLM360/K2-V2

Updated 6 days ago • 219 • 10

updated 2 datasets 6 days ago

LLM360/TxT360-3efforts

Viewer • Updated 5 days ago • 9.46M • 2.19k • 7

LLM360/TxT360-Midas

Viewer • Updated 6 days ago • 2.87B • 2.58k • 6

upvoted a collection 8 days ago

K2-V2

Collection

The collection for K2-V2 models. • 6 items • Updated 10 days ago • 15

upvoted a paper 29 days ago

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Paper • 2511.09057 • Published Nov 12 • 75

liked a Space about 1 month ago

TxT360: Trillion Extracted Text

📖

130

Explore and analyze the TxT360 dataset for LLM pre-training

updated a Space 4 months ago

README

👀

upvoted a paper 4 months ago

Essential-Web v1.0: 24T tokens of organized web data

Paper • 2506.14111 • Published Jun 17 • 46

upvoted a paper 5 months ago

Critiques of World Models

Paper • 2507.05169 • Published Jul 7 • 25

authored a paper 6 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

upvoted a paper 6 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

authored 4 papers over 1 year ago

The Future of Open Human Feedback

Paper • 2408.16961 • Published Aug 15, 2024 • 22

Asymmetry in Low-Rank Adapters of Foundation Models

Paper • 2402.16842 • Published Feb 26, 2024 • 2

tinyBenchmarks: evaluating LLMs with fewer examples

Paper • 2402.14992 • Published Feb 22, 2024 • 17

Large Language Model Routing with Benchmark Datasets

Paper • 2309.15789 • Published Sep 27, 2023 • 1