benchmark lmlmcat/cmmlu Updated Jul 13, 2023 • 10.7k • 73 nlp-waseda/JMMLU Updated Feb 27, 2024 • 292 • 10 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 14k • 95 openai/openai_humaneval Viewer • Updated Jan 4, 2024 • 164 • 126k • 360
benchmark lmlmcat/cmmlu Updated Jul 13, 2023 • 10.7k • 73 nlp-waseda/JMMLU Updated Feb 27, 2024 • 292 • 10 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 14k • 95 openai/openai_humaneval Viewer • Updated Jan 4, 2024 • 164 • 126k • 360