2 11 14

Pretam Ray

Pretam

raypretam

AI & ML interests

NLP

Recent Activity

liked a dataset 14 days ago

arc-agi-community/arc-agi-2

liked a Space 28 days ago

nanotron/ultrascale-playbook

liked a Space 28 days ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

liked a dataset 14 days ago

arc-agi-community/arc-agi-2

Viewer • Updated Apr 2 • 1.12k • 129 • 11

liked 2 Spaces 28 days ago

The Ultra-Scale Playbook

🌌

3.55k

The ultimate guide to training LLM on large GPU Clusters

The Smol Training Playbook

📚

2.55k

The secrets to building world-class LLMs

upvoted a paper 2 months ago

ShinkaEvolve: Towards Open-Ended And Sample-Efficient Program Evolution

Paper • 2509.19349 • Published Sep 17 • 2

upvoted a collection 3 months ago

Qwen3

Collection

84 items • Updated Aug 6 • 1.47k

upvoted 2 papers 4 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 192

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31 • 114

upvoted a collection 4 months ago

DepNeCT

Collection

This Hugging Face collection hosts models and datasets from DepNeCT — a dependency-based method for nested compound type identification in Sanskrit • 4 items • Updated Jul 29 • 2

liked a model 5 months ago

nvidia/OpenReasoning-Nemotron-32B

Text Generation • 33B • Updated Sep 16 • 2.72k • • 119

updated a model 5 months ago

Pretam/hindi_sanskrit

0.6B • Updated Jul 3 • 5

published a model 5 months ago

Pretam/hindi_sanskrit

0.6B • Updated Jul 3 • 5

authored a paper 7 months ago

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Paper • 2505.06548 • Published May 10 • 30

upvoted a paper 7 months ago

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Paper • 2505.06548 • Published May 10 • 30

liked a model 8 months ago

google/gemma-3-4b-it-qat-int4-unquantized

Image-Text-to-Text • 4B • Updated Apr 15 • 545 • 8

updated a model 9 months ago

Pretam/lora_model_gemma-3-12b-it_anushtup_final

Updated Mar 27

published a model 9 months ago

Pretam/lora_model_gemma-3-12b-it_anushtup_final

Updated Mar 27

liked a Space 9 months ago

Scaling test-time compute

📈

587

Implement test-time compute scaling for math problems

liked a Space 10 months ago

Model Memory Utility

🚀

990

Calculate vRAM needed for model training and inference

published a model 10 months ago

Pretam/t5-small-finetuned-xsum

Updated Jan 30

upvoted an article over 1 year ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

•

171