Community Blog & Articles

Community Articles

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval

Norm-Preserving Biprojected Abliteration

about 1 month ago

Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement

An Edge-First Generalized LLM LoRA Fine-Tuning Framework for Heterogeneous GPUs

Building Jobly: Semantic Job Matching with RAG and Vector Embeddings

Uncensor any LLM with abliteration

Code a simple RAG from scratch

AI Energy Score v2: Refreshed Leaderboard, now with Reasoning 🧠

Building and evaluating Multimodal Rerankers

From GRPO to DAPO and GSPO: What, Why, and How

DeepFabric: Generate, Train and Evaluate with Datasets curated for Model Behavior Training.

Gemini-3 Benchmarkathon

Mastering Tensor Dimensions in Transformers

KV Caching Explained: Optimizing Transformer Inference Efficiency

Curating datasets directly on the Hub

Engineering Notes: Training a LoRA for Z-Image Turbo with the Ostris AI Toolkit

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Small Language Models (SLM): A Comprehensive Overview

GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms

audiospeechleaderboard

Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks

November 21, 2025

leaderboardevaluationnlp

Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More

+2

math-verifyopen-llm-leaderboardleaderboard

Fixing Open LLM Leaderboard with Math-Verify

February 14, 2025

nlpresearchleaderboard

The Open Arabic LLM Leaderboard 2

+3

February 10, 2025

open-llm-leaderboardleaderboardenergy_efficiency

CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard

January 9, 2025

leaderboardresearchcollaboration

Evaluating Audio Reasoning with Big Bench Audio

December 20, 2024

leaderboardevaluationnlp

Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard

+1

December 4, 2024

communityresearchnlp

Letting Large Models Debate: The First Multilingual LLM Debate Competition

+8

November 20, 2024

communityresearchnlp

Introducing the Open Leaderboard for Japanese LLMs!

+2

November 20, 2024

leaderboardarenacollaboration

Judge Arena: Benchmarking LLMs as Evaluators

+4

November 19, 2024

leaderboardcollaborationcommunity

Introducing the Open FinLLM Leaderboard

+9

October 4, 2024

nlpresearchleaderboard

🇨🇿 BenCzechMark - Can your LLM Understand Czech?

+7

October 1, 2024

ai4mathnlpcommunity

How NuminaMath Won the 1st AIMO Progress Prize

+4

agentssmolagentsnlp

Our Transformers Code Agent beats the GAIA benchmark 🏅

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval

Norm-Preserving Biprojected Abliteration

about 1 month ago

Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement

An Edge-First Generalized LLM LoRA Fine-Tuning Framework for Heterogeneous GPUs

Building Jobly: Semantic Job Matching with RAG and Vector Embeddings

Uncensor any LLM with abliteration

Code a simple RAG from scratch

AI Energy Score v2: Refreshed Leaderboard, now with Reasoning 🧠

Building and evaluating Multimodal Rerankers

From GRPO to DAPO and GSPO: What, Why, and How

DeepFabric: Generate, Train and Evaluate with Datasets curated for Model Behavior Training.

Gemini-3 Benchmarkathon

Mastering Tensor Dimensions in Transformers

KV Caching Explained: Optimizing Transformer Inference Efficiency

Curating datasets directly on the Hub

Engineering Notes: Training a LoRA for Z-Image Turbo with the Ostris AI Toolkit

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Small Language Models (SLM): A Comprehensive Overview

GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms

View all articles