Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yamatazen 's Collections
Genshin Impact
Optimizers
Autoregressive image generation
GGUF tools
AGI
Model merging
Multilingual LLMs
Japanese LLMs
AI censorship
LLM leaderboards
Grokking

Grokking

updated Jun 27, 2025
Upvote
2

  • Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

    Paper • 2405.15071 • Published May 23, 2024 • 41

  • Grokking at the Edge of Numerical Stability

    Paper • 2501.04697 • Published Jan 8, 2025 • 2

  • Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

    Paper • 2506.21551 • Published Jun 26, 2025 • 28
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs