Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
6
4
171
Batuhan S
Ba2han
Follow
champmit's profile picture
expronet's profile picture
mshojaei77's profile picture
24 followers
·
19 following
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
Ba2han/experimental_auto
liked
a model
3 days ago
RunDiffusion/Juggernaut-Z-Image
reacted
to
SeaWolf-AI
's
post
with ❤️
4 days ago
🧬 Darwin Family: Zero Gradient Steps, GPQA Diamond 88.89% How far can we push LLM reasoning *without* training? Our team at VIDRAFT submitted this paper to Daily Papers yesterday, and it's currently #3. Huge thanks to everyone who upvoted — sharing the core ideas below. 🔗 Paper: https://huggingface.co/papers/2605.14386 🔗 arXiv: https://arxiv.org/abs/2605.14386 🔗 Model: https://huggingface.co/FINAL-Bench/Darwin-28B-REASON 🔗 Model: https://huggingface.co/FINAL-Bench/Darwin-28B-Opus --- TL;DR Darwin Family is a training-free evolutionary merging framework. By recombining the weight spaces of existing LLM checkpoints — with zero gradient-based training — it reaches frontier-level reasoning. - 🏆 Darwin-28B-Opus: GPQA Diamond 88.89% - 💸 Zero gradient steps — not a single B200 or H200 hour needed - 🧬 Consistent gains across 4B → 35B scale - 🔀 Cross-architecture breeding between Transformer and Mamba families - 🔁 Stable recursive multi-generation evolution #Three Core Mechanisms ① 14-dim Adaptive Merge Genome — fine-grained recombination at both component level (Attention / FFN / MLP / LayerNorm / Embedding) and block level, expanding the prior evolutionary-merge search space. ② MRI-Trust Fusion — we diagnose each layer's reasoning contribution via an **MRI (Model Reasoning Importance)** signal and fuse it with evolutionary search through a **learnable trust parameter**. Trust the diagnostic too much and search collapses; ignore it and search becomes inefficient — Darwin learns the balance from data. ③ Architecture Mapper — weight-space breeding across heterogeneous families. Attention × SSM crossover actually works. Why It Matters > Diagnose latent capabilities already encoded in open checkpoints, > and recombine them — no gradients required. Replies and critiques welcome 🙌
View all activity
Organizations
None yet
Ba2han
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
3 days ago
RunDiffusion/Juggernaut-Z-Image
Text-to-Image
•
6B
•
Updated
7 days ago
•
20.6k
•
83
liked
a model
5 days ago
HiDream-ai/HiDream-O1-Image
Image-Text-to-Image
•
9B
•
Updated
5 days ago
•
17.6k
•
407
liked
a model
10 days ago
HiDream-ai/HiDream-O1-Image-Dev
Image-Text-to-Image
•
9B
•
Updated
5 days ago
•
6.48k
•
104
liked
a dataset
13 days ago
amd/UltraChat200K-regenerated
Viewer
•
Updated
Mar 2
•
207k
•
107
•
1
liked
a model
15 days ago
HuggingFaceTB/nanowhale-100m
Text Generation
•
0.1B
•
Updated
16 days ago
•
4.12k
•
58
liked
a model
20 days ago
poolside/Laguna-XS.2
Text Generation
•
33B
•
Updated
1 day ago
•
50.2k
•
260
liked
a dataset
21 days ago
Kassadin88/Claude-Distills
Preview
•
Updated
8 days ago
•
873
•
28
liked
a dataset
25 days ago
Modotte/CodeX-2M-Thinking
Viewer
•
Updated
Feb 10
•
2.19M
•
5.92k
•
98
liked
a dataset
26 days ago
lambda/hermes-agent-reasoning-traces
Viewer
•
Updated
Apr 17
•
14.7k
•
4.78k
•
325
liked
a dataset
about 1 month ago
Jackrong/GLM-5.1-Reasoning-1M-Cleaned
Viewer
•
Updated
Apr 19
•
572k
•
11.5k
•
212
liked
3 models
about 1 month ago
AIDC-AI/Marco-Mini-Base
Text Generation
•
17B
•
Updated
Apr 3
•
95
•
8
LiquidAI/LFM2.5-VL-1.6B
Image-Text-to-Text
•
2B
•
Updated
Mar 30
•
76.4k
•
285
MiniMaxAI/MiniMax-M2.7
Text Generation
•
229B
•
Updated
about 1 month ago
•
519k
•
•
1.13k
liked
a model
about 2 months ago
netflix/void-model
Video-to-Video
•
Updated
Apr 6
•
930
liked
4 datasets
about 2 months ago
WiroAI/dolphin-r1-turkish
Viewer
•
Updated
Mar 7, 2025
•
108k
•
298
•
17
turkish-nlp-suite/InstrucTurca
Viewer
•
Updated
Aug 12, 2024
•
2.58M
•
860
•
39
boun-tabilab/Scifact-TR
Viewer
•
Updated
Dec 14, 2025
•
1.26k
•
73
•
3
boun-tabilab/turkish_parliamentary_data
Viewer
•
Updated
Mar 30
•
1.96M
•
1.37k
•
4
liked
2 models
about 2 months ago
tiiuae/Falcon-OCR
Image-to-Text
•
0.3B
•
Updated
7 days ago
•
13.2k
•
96
google/gemma-4-26B-A4B
Image-Text-to-Text
•
27B
•
Updated
Apr 2
•
222k
•
270
Load more