Sanjeev Satheesh's picture

7

Sanjeev Satheesh PRO

sanjeevnv

·

sancha

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16

published a model 3 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16

updated a dataset 9 days ago

nvidia/Nemotron-CC-Code-v1

View all activity

Organizations

upvoted an article 3 months ago

Article

📢 NVIDIA Releases Nemotron-CC-Math Pre-Training Dataset: A High-Quality, Web-Scale Math Corpus for Pretraining Large Language Models

Aug 18

•

5

upvoted 2 papers 4 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20 • 38

Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset

Paper • 2508.15096 • Published Aug 20 • 4

upvoted a collection 4 months ago

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 1 day ago • 69

upvoted a paper 8 months ago

NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning

Paper • 2504.13941 • Published Apr 15 • 11

upvoted a paper over 1 year ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 57

upvoted a collection almost 2 years ago

Canary

A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 5 items • Updated 1 day ago • 29