Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
SakanaAI 's Collections
Continuous Thought Machines
Reinforcement Learning Teachers
TinySwallow
CycleQD

TinySwallow

updated Jan 30, 2025

Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"

Upvote
17

  • SakanaAI/TinySwallow-1.5B

    Text Generation • 2B • Updated Jan 30, 2025 • 34.4k • • 35

  • SakanaAI/TinySwallow-1.5B-Instruct

    Text Generation • 2B • Updated Jan 30, 2025 • 3.41k • • 56

  • SakanaAI/TinySwallow-1.5B-Instruct-q4f32_1-MLC

    Text Generation • Updated Jan 30, 2025 • 3

  • SakanaAI/TinySwallow-1.5B-Instruct-GGUF

    Text Generation • 2B • Updated Jan 30, 2025 • 539 • 26

  • TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

    Paper • 2501.16937 • Published Jan 28, 2025 • 7
Upvote
17
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs