naver-hyperclovax/HyperCLOVAX-SEED-Think-32B Text Generation β’ 33B β’ Updated 3 days ago β’ 7.1k β’ 79
view article Article How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day 23 days ago β’ 46
Flash Sparse Attention: An Alternative Efficient Implementation of Native Sparse Attention Kernel Paper β’ 2508.18224 β’ Published Aug 25, 2025 β’ 1
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation Paper β’ 2511.09611 β’ Published Nov 12, 2025 β’ 68
moonshotai/Kimi-Linear-48B-A3B-Instruct Text Generation β’ 49B β’ Updated 16 days ago β’ 91.9k β’ 514
KORMo: Korean Open Reasoning Model for Everyone Paper β’ 2510.09426 β’ Published Oct 10, 2025 β’ 83