Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tong Zhu's picture
11 38 58

Tong Zhu

Spico
freedom2courage's profile picture telcom's profile picture ngsnethawarya's profile picture
·
https://Spico197.github.io
  • TongZhu197
  • Spico197

AI & ML interests

Information Extraction, Mixture-of-Experts, LLM

Recent Activity

upvoted an article about 22 hours ago
Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models
authored a paper 14 days ago
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
authored a paper 14 days ago
Iterative Value Function Optimization for Guided Decoding
View all activity

Organizations

SUDA-HUAWEI Joint Project's profile picture REx Team in Soochow University's profile picture LLaMA-MoE's profile picture MoE-Dynamic-Routing's profile picture

Spico 's datasets 7

Spico/Mirror_ACE

Preview • Updated May 6, 2025 • 1 • 1

Spico/Mirror_woACE

Preview • Updated Oct 11, 2024 • 286 • 1

Spico/dynamic-moe-sft-instructions

Preview • Updated Jun 17, 2024 • 9 • 1

Spico/Mirror

Preview • Updated Dec 14, 2023 • 17 • 1

Spico/TaskLAMA

Viewer • Updated Sep 12, 2023 • 1.61k • 60 • 2

Spico/Humback

Preview • Updated Aug 22, 2023 • 36 • 3

Spico/ChCatExt

Updated Apr 21, 2023 • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs