Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
darylmooreNC 's Collections
Multi-Agent Infrastructure
LLM Training Methodologies
LLM Architectures
Agentic AI Training and Tuning
Reinforcement Learning
Agentic AI
Sports Predictive Modeling
Large Language Models

Large Language Models

updated 1 day ago
Upvote
-

  • Universal Deep Research: Bring Your Own Model and Strategy

    Paper • 2509.00244 • Published Aug 29, 2025 • 13

  • The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

    Paper • 2509.02547 • Published Sep 2, 2025 • 227

  • Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

    Paper • 2510.00515 • Published Oct 1, 2025 • 39

  • DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

    Paper • 2509.25454 • Published Sep 29, 2025 • 140

  • Demystifying Reinforcement Learning in Agentic Reasoning

    Paper • 2510.11701 • Published Oct 13, 2025 • 31

  • deepseek-ai/DeepSeek-Math-V2

    Text Generation • 685B • Updated Nov 27, 2025 • 5.43k • 674

  • T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

    Paper • 2512.10430 • Published 21 days ago • 113

  • Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

    Paper • 2512.23447 • Published 3 days ago • 83
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs