5 16 160

andthattoo

https://twitter.com/andthatto

AI & ML interests

RL, Inference, Evolutionary Compute

Recent Activity

liked a model 10 days ago

zai-org/GLM-4.7

upvoted an article 21 days ago

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

liked a model 21 days ago

nvidia/gpt-oss-120b-Eagle3-throughput

View all activity

Organizations

liked a model 10 days ago

zai-org/GLM-4.7

Text Generation • 358B • Updated 11 days ago • 31.2k • • 1.4k

upvoted an article 21 days ago

Article

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

May 17, 2025

•

liked 2 models 21 days ago

nvidia/gpt-oss-120b-Eagle3-throughput

Text Generation • Updated 24 days ago • 607 • 29

EssentialAI/rnj-1-instruct

Text Generation • 8B • Updated 10 days ago • 457k • • 292

liked a model 22 days ago

zai-org/AutoGLM-Phone-9B

Image-Text-to-Text • 934k • Updated 25 days ago • 91.6k • 402

liked a model 29 days ago

facebook/mms-tts-tur

Text-to-Speech • 36.3M • Updated Sep 1, 2023 • 4.37k • 25

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • 685B • Updated Nov 18, 2025 • 72.8k • • 930

liked 3 models 2 months ago

upvoted an article 2 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Oct 23, 2025

•

138

liked a model 2 months ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 3.47M • 3.03k

liked 2 models 3 months ago

vngrs-ai/Kumru-2B-Base

Text Generation • 2B • Updated Sep 26, 2025 • 687 • 17

vngrs-ai/Kumru-2B

Text Generation • 2B • Updated Sep 26, 2025 • 1.41k • 105

New activity in driaforall/mem-agent 3 months ago

How is it trained?

#5 opened 3 months ago by

l-lyubenov

published an article 3 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

Oct 9, 2025

•

liked a dataset 3 months ago

openai/gdpval

Viewer • Updated Sep 25, 2025 • 220 • 28.2k • 430

liked a model 4 months ago

PerceptronAI/Isaac-0.1

Image-Text-to-Text • 3B • Updated 13 days ago • 8.82k • 113

New activity in driaforall/mem-agent 4 months ago

Adding `transformers` as the library tag

#4 opened 4 months ago by

ariG23498

commented on mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL 4 months ago

I would suggest running 4-bit with 8GB RAM on M1, and bf16 with M4 Pro 64GB. Interestingly, 4-bit is surprisingly good on our benchmark. We are considering larger models since they can handle more complex n-hop queries with less hallucination (14B and 30B MoE Qwen). We'll also be releasing the training and data-generation code next week

andthattoo

AI & ML interests

Recent Activity

Organizations

andthattoo's activity

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

Building the Open Agent Ecosystem Together: Introducing OpenEnv

How is it trained?

mem-agent: Equipping LLM Agents with Memory Using RL

Adding `transformers` as the library tag