rom7 (romit) – Likes

liked 2 Spaces 7 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

112

Explore on-policy distillation visualization for any model

The Smol Training Playbook

📚

3.2k

The secrets to building world-class LLMs

liked a model 11 months ago

ibm-granite/granite-3.1-8b-instruct

Text Generation • 8B • Updated Apr 16, 2025 • 99.2k • 167

liked a model about 1 year ago

11mlabs/indri-0.1-124m-tts

Text-to-Speech • 0.1B • Updated May 2, 2025 • 114 • 11

liked a Space over 1 year ago

The Ultra-Scale Playbook

🌌

3.88k

The ultimate guide to training LLM on large GPU Clusters

liked 4 datasets over 1 year ago

liked a dataset almost 2 years ago

speechcolab/gigaspeech

Viewer • Updated Feb 7 • 11.9M • 27k • 164

liked 2 models almost 2 years ago

facebook/w2v-bert-2.0

Feature Extraction • 0.6B • Updated Jan 25, 2024 • 4.02M • 217

stabilityai/cosxl

Text-to-Image • Updated Apr 13, 2024 • 244

liked 2 models about 2 years ago

NousResearch/Hermes-2-Pro-Mistral-7B-GGUF

7B • Updated Mar 28, 2024 • 4.9k • 250

urchade/gliner_large-v2

Token Classification • 0.5B • Updated Jul 12, 2024 • 1.01k • 56

liked a dataset about 2 years ago

CohereLabs/wikipedia-2023-11-embed-multilingual-v3

Viewer • Updated Mar 25 • 247M • 14.1k • 247

liked a dataset over 2 years ago

ivelin/ui_refexp_saved

Viewer • Updated Jan 8, 2023 • 16.7k • 102 • 17

liked 4 models over 2 years ago

TheBloke/OpenHermes-2.5-Mistral-7B-GPTQ

Text Generation • 7B • Updated Nov 2, 2023 • 126 • 29

LanguageBind/Video-LLaVA-Pretrain-7B

Text Generation • Updated Feb 1, 2024 • 13 • 10

vdo/Video-LLaMA-Series

Visual Question Answering • Updated Jun 14, 2023 • 10

AskUI/pta-text-0.1

Updated Mar 11, 2024 • 13

romit

AI & ML interests

Organizations

Unlocking On-Policy Distillation for Any Model Family

The Smol Training Playbook

ibm-granite/granite-3.1-8b-instruct

11mlabs/indri-0.1-124m-tts

The Ultra-Scale Playbook

ScalingIntelligence/KernelBench

parler-tts/libritts_r_filtered

mythicinfinity/libritts_r

mythicinfinity/libriheavy

speechcolab/gigaspeech

facebook/w2v-bert-2.0

stabilityai/cosxl

NousResearch/Hermes-2-Pro-Mistral-7B-GGUF

urchade/gliner_large-v2

CohereLabs/wikipedia-2023-11-embed-multilingual-v3

ivelin/ui_refexp_saved

TheBloke/OpenHermes-2.5-Mistral-7B-GPTQ

LanguageBind/Video-LLaVA-Pretrain-7B

vdo/Video-LLaMA-Series

AskUI/pta-text-0.1

romit

AI & ML interests

Organizations

rom7's activity

Unlocking On-Policy Distillation for Any Model Family

The Smol Training Playbook

The Ultra-Scale Playbook