3 74 301

Kristoffer Rolf Deinoff

gatepoet

AI & ML interests

None yet

Recent Activity

liked a model about 10 hours ago

apple/Sharp

liked a model 2 days ago

unsloth/Qwen-Image-2512-GGUF

liked a model 2 days ago

Qwen/Qwen-Image-2512

View all activity

Organizations

None yet

upvoted a paper 20 days ago

Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed

Paper • 2512.14067 • Published 23 days ago • 13

upvoted an article about 1 month ago

Article

An Edge-First Generalized LLM LoRA Fine-Tuning Framework for Heterogeneous GPUs

Dec 1, 2025

•

upvoted 2 papers about 1 month ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 245

GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms

Paper • 2511.17592 • Published Nov 17, 2025 • 118

upvoted 2 papers about 2 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 211

upvoted a paper 2 months ago

Video Reasoning without Training

Paper • 2510.17045 • Published Oct 19, 2025 • 7

upvoted 2 papers 3 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 139

Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Paper • 2509.25161 • Published Sep 29, 2025 • 25

upvoted 2 papers 5 months ago

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published Aug 7, 2025 • 46

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

upvoted 2 papers 6 months ago

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published Jul 9, 2025 • 45

High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning

Paper • 2507.05920 • Published Jul 8, 2025 • 11

upvoted a paper 7 months ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17, 2025 • 45

upvoted 2 papers 8 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 82

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published May 5, 2025 • 22

upvoted a collection 8 months ago

DeepSeek-Prover

Collection

DeepSeek-Prover-Series • 10 items • Updated Nov 27, 2025 • 60

upvoted 2 papers 9 months ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15, 2025 • 63

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10, 2025 • 43

upvoted an article 10 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26, 2025

•

177