hesamation/Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 35B • Updated 6 days ago • 83.7k • 176
hesamation/Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 36B • Updated 6 days ago • 4.45k • 63
hesamation/Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 35B • Updated 6 days ago • 83.7k • 176
hesamation/Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 35B • Updated 6 days ago • 83.7k • 176
hesamation/Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 36B • Updated 6 days ago • 4.45k • 63
hesamation/Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 36B • Updated 6 days ago • 4.45k • 63
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning Paper • 2601.05593 • Published Jan 9 • 86
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published Jan 11 • 216
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11, 2025 • 119
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published Dec 2, 2025 • 267