Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Claw-Eval

non-profit
https://claw-eval.github.io/
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

lirang04  authored a paper about 20 hours ago
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows
tobiaslee  authored a paper 4 days ago
MiMo-Audio: Audio Language Models are Few-Shot Learners
tobiaslee  authored a paper 4 days ago
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows
View all activity

Papers

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

View all Papers

Rang Li's profile pictureYang Qibin's profile pictureLei Li's profile pictureYuanxin Liu's profile pictureBoWen Ye's profile picture

models 0

None public yet

datasets 1

claw-eval/Claw-Eval

Benchmark • Updated 13 days ago • 300 • 2.91k • 21
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs