Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alex Shaw's picture
127 1 1

Alex Shaw

alexgshaw
ryanmarten's profile picture lincolnhuj's profile picture evalstate's profile picture
·
https://www.tbench.ai/
  • alexgshaw
  • alexgshaw
  • alexgshaw

AI & ML interests

None yet

Recent Activity

new activity 42 minutes ago
harborframework/terminal-bench-2-leaderboard:Fix: add missing task_checksum field to all 89 result.json files
new activity 44 minutes ago
harborframework/terminal-bench-2-leaderboard:Add 100xflux (Claude Sonnet 4.6) submission to Terminal-Bench 2.0 leaderboard
new activity about 8 hours ago
harborframework/terminal-bench-2-leaderboard:Add Simplai Agent (Claude Sonnet 4.6) submission - 52.13%
View all activity

Organizations

Perception, Control, and Cognition Lab's profile picture  ML Foundations Development's profile picture Laude Institute's profile picture DCAgent's profile picture Harbor's profile picture Terminal-Bench's profile picture Harbor Framework's profile picture

spaces 1

Running
1

Terminal Bench Importer

🚀

Validate and import Terminal‑Bench leaderboard submissions

18 days ago

models 1

alexgshaw/hyperpartisan-classifier

Text Classification • Updated Feb 23, 2023 • 5

datasets 4

alexgshaw/natural-instructions-context-dataset-text-embedding-3-large-40-dim

Updated Aug 23, 2024 • 4

alexgshaw/natural-instructions-prompt-rewards

Viewer • Updated May 3, 2024 • 7.14k • 13

alexgshaw/llama-65b-tokenized-wikitext-2-v1

Viewer • Updated Jul 7, 2023 • 36.7k • 4

alexgshaw/llama-13b-tokenized-wikitext-2-v1

Viewer • Updated Jul 7, 2023 • 36.7k • 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs