Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
openai
/
whisper-large-v2
like
1.78k
Follow
OpenAI
29.8k
Automatic Speech Recognition
Transformers
PyTorch
google-tensorflow
TensorFlow
JAX
Safetensors
99 languages
whisper
audio
hf-asr-leaderboard
arxiv:
2212.04356
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
121
Deploy
Use this model
main
whisper-large-v2
24.7 GB
7 contributors
History:
31 commits
sanchit-gandhi
Add missing merge to tokenizer (
#100
)
ae46427
verified
almost 2 years ago
.gitattributes
1.48 kB
initial commit
about 3 years ago
README.md
19 kB
Update README.md
over 2 years ago
added_tokens.json
34.6 kB
add timestamp tokens (#64)
over 2 years ago
config.json
1.99 kB
Update config.json to suppress task tokens (#32)
almost 3 years ago
flax_model.msgpack
6.17 GB
xet
Add Flax weights
almost 3 years ago
generation_config.json
4.29 kB
Correct long-form generation config parameters 'max_initial_timestamp_index' and 'prev_sot_token_id'. (#95)
almost 2 years ago
merges.txt
494 kB
Add missing merge to tokenizer (#100)
almost 2 years ago
model.safetensors
6.17 GB
xet
Adding `safetensors` variant of this model (#57)
over 2 years ago
normalizer.json
52.7 kB
Upload processor
about 3 years ago
preprocessor_config.json
185 kB
Upload processor
about 3 years ago
pytorch_model.bin
6.17 GB
xet
Upload WhisperForConditionalGeneration
about 3 years ago
special_tokens_map.json
2.19 kB
Add missing merge to tokenizer (#100)
almost 2 years ago
tf_model.h5
6.17 GB
xet
Add TF weights (#10)
about 3 years ago
tokenizer.json
2.48 MB
Add missing merge to tokenizer (#100)
almost 2 years ago
tokenizer_config.json
283 kB
Add missing merge to tokenizer (#100)
almost 2 years ago
vocab.json
836 kB
Add missing merge to tokenizer (#100)
almost 2 years ago