Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models Paper • 2512.13607 • Published 18 days ago • 27
IQuestLab/IQuest-Coder-V1-40B-Loop-Instruct Text Generation • 40B • Updated about 5 hours ago • 1.88k • 130
Nested Browser-Use Learning for Agentic Information Seeking Paper • 2512.23647 • Published 4 days ago • 17
Evaluating Gemini Robotics Policies in a Veo World Simulator Paper • 2512.10675 • Published 22 days ago • 16
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published 25 days ago • 74
view post Post 2861 Want to get started with fine-tuning but don’t know where to begin? 🤓☝️We’re expanding our collection of beginner-friendly free Colab notebooks so you can learn and fine-tune models using TRL at no cost🔬 Check out the full list of free notebooks: https://huggingface.co/docs/trl/main/en/example_overview#notebooks🔬 If you want more advanced content, we also have a lot to cover in the community tutorials: https://huggingface.co/docs/trl/community_tutorialsAnd now the obvious question: what would you like us to add next? See translation 🔥 15 15 + Reply