26 42

Djellal Mohamed Aniss

dmaniss

djellalmohamedaniss

AI & ML interests

SLM, distillation, synthetic data and reasoning.

Recent Activity

liked a Space 1 day ago

aminediroHF/trainer-generator-bf16-mismatch

upvoted an article 4 days ago

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

liked a model about 2 months ago

jinaai/jina-embeddings-v5-text-small-text-matching

View all activity

Organizations

None yet

liked a Space 1 day ago

Defeating the trainer-generator precision mismatch in TRL

🎯

Download research PDF (Pro access required)

upvoted an article 4 days ago

Article

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

Jun 28, 2025

•

liked a model about 2 months ago

jinaai/jina-embeddings-v5-text-small-text-matching

upvoted an article about 2 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

•

740

upvoted an article 4 months ago

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Aug 8, 2025

•

liked a Space 6 months ago

The Smol Training Playbook

📚

3.14k

The secrets to building world-class LLMs

liked a dataset 6 months ago

LangAGI-Lab/magpie-reasoning-v1-100k-math-verifiable

Viewer • Updated Feb 9, 2025 • 21.7k • 22 • 1

upvoted a paper 6 months ago

Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

Paper • 2509.24372 • Published Sep 29, 2025 • 14

liked a Space 6 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

Visualize on-policy distillation for any model family

upvoted a paper 6 months ago

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

Paper • 2508.09726 • Published Aug 13, 2025 • 15

liked 2 datasets 6 months ago

MLCommons/peoples_speech

Viewer • Updated Nov 20, 2024 • 8.05M • 36.7k • 264

MrDragonFox/Elise

Viewer • Updated Mar 27, 2025 • 1.2k • 629 • 129

upvoted 3 papers 7 months ago

liked 2 models 8 months ago

Alibaba-NLP/Tongyi-DeepResearch-30B-A3B

Text Generation • 31B • Updated Oct 10, 2025 • 106k • 811

YannQi/R-4B

Image-Text-to-Text • 5B • Updated Sep 4, 2025 • 111k • 181

upvoted a paper 8 months ago

BOND: Aligning LLMs with Best-of-N Distillation

Paper • 2407.14622 • Published Jul 19, 2024 • 20

liked a model 8 months ago

AIDC-AI/Ovis2.5-9B

Image-Text-to-Text • 9B • Updated Feb 13 • 4.87k • 304

upvoted an article 9 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers

Mar 26, 2025

•

193

Djellal Mohamed Aniss

AI & ML interests

Recent Activity

Organizations

dmaniss's activity

Defeating the trainer-generator precision mismatch in TRL

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

Finally, a Replacement for BERT: Introducing ModernBERT

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

The Smol Training Playbook

Unlocking On-Policy Distillation for Any Model Family

Training and Finetuning Reranker Models with Sentence Transformers