-
Let's Predict Sentence by Sentence
Paper • 2505.22202 • Published • 19 -
From Bytes to Ideas: Language Modeling with Autoregressive U-Nets
Paper • 2506.14761 • Published • 17 -
TokAlign: Efficient Vocabulary Adaptation via Token Alignment
Paper • 2506.03523 • Published -
zip2zip: Inference-Time Adaptive Vocabularies for Language Models via Token Compression
Paper • 2506.01084 • Published • 7
ssh
melebele
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 2 months ago
bird-of-paradise/muon-tutorial
updated
a collection
5 months ago
nlp
updated
a collection
5 months ago
nlp
Organizations
None yet