PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper β’ 2511.09057 β’ Published Nov 12 β’ 75
Running 130 TxT360: Trillion Extracted Text π 130 Explore and analyze the TxT360 dataset for LLM pre-training
Essential-Web v1.0: 24T tokens of organized web data Paper β’ 2506.14111 β’ Published Jun 17 β’ 46
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper β’ 2506.14965 β’ Published Jun 17 β’ 49
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper β’ 2506.14965 β’ Published Jun 17 β’ 49
Asymmetry in Low-Rank Adapters of Foundation Models Paper β’ 2402.16842 β’ Published Feb 26, 2024 β’ 2
tinyBenchmarks: evaluating LLMs with fewer examples Paper β’ 2402.14992 β’ Published Feb 22, 2024 β’ 17
Large Language Model Routing with Benchmark Datasets Paper β’ 2309.15789 β’ Published Sep 27, 2023 β’ 1