Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Claw-Eval
non-profit
https://claw-eval.github.io/
Activity Feed
Follow
11
AI & ML interests
None defined yet.
Recent Activity
lirang04
authored
a paper
about 20 hours ago
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows
tobiaslee
authored
a paper
4 days ago
MiMo-Audio: Audio Language Models are Few-Shot Learners
tobiaslee
authored
a paper
4 days ago
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows
View all activity
Papers
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents
View all Papers
Team members
5
models
0
None public yet
datasets
1
claw-eval/Claw-Eval
Benchmark
•
Updated
13 days ago
•
300
•
2.91k
•
21