Instructions to use nvidia/multitalker-parakeet-streaming-0.6b-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use nvidia/multitalker-parakeet-streaming-0.6b-v1 with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("nvidia/multitalker-parakeet-streaming-0.6b-v1") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
Multilingual model planned?
Thanks for sharing! This is a really nice model - thanks for all the work which went into it ❤️
Is there a multilingual version planned (like parakeet-tdt-0.6b-v3)? I'm particularity interested in German, Spanish, Italian & French.
Best
fosple
Hi.
We will be looking into non-English ASR support in the near future.
Note that Sortformer speaker diarization model is not limited to English.
Only multitalker Parakeet ASR model is limited to English.
@taejinp Thanks for the info :) Do you know if a "parakeet-streaming-0.6b" model (w/o speaker diarization) is planned? So like nvidia/parakeet-tdt-0.6b-v3 but optimized for streaming?
so basically if one instance of this model accepts audio of 2 speakers, whoever speaks first will be transcribed only till the end of audio?