Text-To-Speech myshell-ai/OpenVoice Text-to-Speech • Updated Dec 24, 2024 • 486 coqui/XTTS-v2 Text-to-Speech • Updated Dec 11, 2023 • 4.81M • 3.29k suno/bark Text-to-Speech • Updated Oct 4, 2023 • 15.6k • 1.5k microsoft/speecht5_tts Text-to-Speech • Updated Nov 8, 2023 • 70.9k • 821
Speech-To-Text jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 90.9k • 475 nvidia/parakeet-rnnt-1.1b Automatic Speech Recognition • Updated Nov 27, 2025 • 750 • 163 facebook/seamless-m4t-v2-large Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 230k • 940
jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 90.9k • 475
Text-To-Speech myshell-ai/OpenVoice Text-to-Speech • Updated Dec 24, 2024 • 486 coqui/XTTS-v2 Text-to-Speech • Updated Dec 11, 2023 • 4.81M • 3.29k suno/bark Text-to-Speech • Updated Oct 4, 2023 • 15.6k • 1.5k microsoft/speecht5_tts Text-to-Speech • Updated Nov 8, 2023 • 70.9k • 821
Speech-To-Text jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 90.9k • 475 nvidia/parakeet-rnnt-1.1b Automatic Speech Recognition • Updated Nov 27, 2025 • 750 • 163 facebook/seamless-m4t-v2-large Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 230k • 940
jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 90.9k • 475
elmoghany/Videos-Dataset-For-LLMs-RAG-That-Require-Audio-Vidoes-And-Text Updated Sep 14, 2025 • 2.83k • 1