TPTT: Transforming Pretrained Transformer into Titans Paper ⢠2506.17671 ⢠Published Jun 21 ⢠5
Reasoning Datasets Collection Distilled synthetic Reasoning datasets ⢠7 items ⢠Updated Feb 2 ⢠61
Cosmos Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/cosmos-predict25 ⢠31 items ⢠Updated 6 days ago ⢠299
Deepseek V3 (All Versions) Collection Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions. ⢠7 items ⢠Updated 5 days ago ⢠39
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs ⢠20 items ⢠Updated Jan 15 ⢠123
Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs Paper ⢠2402.14740 ⢠Published Feb 22, 2024 ⢠15
CodeFusion: A Pre-trained Diffusion Model for Code Generation Paper ⢠2310.17680 ⢠Published Oct 26, 2023 ⢠73