3 16 4

FlySugar

SugarVapeur

AI & ML interests

DeepLearning,NLP,RL

Recent Activity

liked a dataset about 2 months ago

ServiceNow/GroundCUA

upvoted a paper 3 months ago

SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models

upvoted a paper 3 months ago

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

View all activity

Organizations

None yet

upvoted 3 papers 3 months ago

SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models

Paper • 2510.08531 • Published Oct 9, 2025 • 12

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

Paper • 2509.25160 • Published Sep 29, 2025 • 30

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

Paper • 2509.25175 • Published Sep 29, 2025 • 30

upvoted a paper 4 months ago

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15, 2025 • 47

upvoted 6 papers 5 months ago

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Paper • 2508.05613 • Published Aug 7, 2025 • 17

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Paper • 2508.05615 • Published Aug 7, 2025 • 22

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Paper • 2508.05614 • Published Aug 7, 2025 • 20

Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR

Paper • 2507.15085 • Published Jul 20, 2025 • 6

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Paper • 2506.03143 • Published Jun 3, 2025 • 53

Hierarchical Budget Policy Optimization for Adaptive Reasoning

Paper • 2507.15844 • Published Jul 21, 2025 • 16

upvoted 6 papers 6 months ago

A Survey on (M)LLM-Based GUI Agents

Paper • 2504.13865 • Published Mar 27, 2025 • 5

Think Twice, Click Once: Enhancing GUI Grounding via Fast and Slow Systems

Paper • 2503.06470 • Published Mar 9, 2025 • 3

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

Paper • 2507.15758 • Published Jul 21, 2025 • 35

FlySugar

AI & ML interests

Recent Activity

Organizations

SugarVapeur's activity