Chan Hee Song's picture

1 5 4

Chan Hee Song

chanhee-luke

·

https://chanh.ee

AI & ML interests

Multimodal Agents (Robotics, Web, GUI)

Recent Activity

authored a paper 13 days ago

Watch and Learn: Learning to Use Computers from Online Videos

authored a paper 13 days ago

SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL

upvoted a paper 13 days ago

Watch and Learn: Learning to Use Computers from Online Videos

View all activity

Organizations

authored 2 papers 13 days ago

Watch and Learn: Learning to Use Computers from Online Videos

Paper • 2510.04673 • Published Oct 6 • 11

SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL

Paper • 2512.04069 • Published 14 days ago • 21

authored 2 papers 6 months ago

An Illusion of Progress? Assessing the Current State of Web Agents

Paper • 2504.01382 • Published Apr 2 • 4

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published Jun 26 • 51

authored 4 papers 8 months ago

BIOCLIP: A Vision Foundation Model for the Tree of Life

Paper • 2311.18803 • Published Nov 30, 2023 • 1

Dual-View Visual Contextualization for Web Navigation

Paper • 2402.04476 • Published Feb 6, 2024

LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

Paper • 2212.04088 • Published Dec 8, 2022

RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics

Paper • 2411.16537 • Published Nov 25, 2024

authored a paper over 1 year ago

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Paper • 2408.06327 • Published Aug 12, 2024 • 17