31 21 658

Yazan Agha-Schrader PRO

phi0112358

AI & ML interests

Brain, EEG, BCI, consciousness, autism, octopus, automation, a.i., etymology, numbers, spirituality, astronomy

Recent Activity

liked a model about 8 hours ago

guiferrarib/genesis-152m-instruct

liked a model 1 day ago

unsloth/Z-Image-Turbo-GGUF

liked a model 8 days ago

ggml-org/functiongemma-270m-it-GGUF

View all activity

Organizations

upvoted an article 22 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

23 days ago

•

535

upvoted an article 25 days ago

Article

Norm-Preserving Biprojected Abliteration

Nov 6

•

upvoted 3 collections 4 months ago

upvoted a paper 4 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 147

upvoted a collection 6 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 177

upvoted 2 papers 6 months ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 138

Turning large language models into cognitive models

Paper • 2306.03917 • Published Jun 6, 2023 • 5

upvoted 3 collections 7 months ago

Unsloth Dynamic 2.0 Quants

Collection

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 66 items • Updated 3 days ago • 283

Granite Quantized Models

Collection

Quantized versions of IBM Granite models. Licensed under the Apache 2.0 license. • 44 items • Updated Nov 21 • 29

Text-to-Speech (TTS) models

Collection

A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now! • 16 items • Updated 3 days ago • 26

upvoted a collection 8 months ago

Qwen3

Collection

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 3 days ago • 250

upvoted a collection 9 months ago

Gemma 3 QAT

Collection

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 211

upvoted a collection 12 months ago

GGUF LoRA adapters

Collection

Adapters extracted from fine tuned models, using mergekit-extract-lora • 16 items • Updated 11 days ago • 4

upvoted a collection about 1 year ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 670

upvoted 2 collections over 1 year ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 4 days ago • 309

✂️ Abliteration

Collection

Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration • 34 items • Updated Jul 25 • 139

upvoted a paper over 1 year ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7, 2024 • 24

upvoted an article over 1 year ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

745

Yazan Agha-Schrader PRO

AI & ML interests

Recent Activity

Organizations

phi0112358's activity

We Got Claude to Fine-Tune an Open Source LLM

Norm-Preserving Biprojected Abliteration

Uncensor any LLM with abliteration