speech-synthesis

Here are 1,242 public repositories matching this topic...

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts speech-synthesis voice-conversion vocoder voice-synthesis tacotron voice-cloning speaker-encodings melgan speaker-encoder multi-speaker-tts glow-tts hifigan tts-model

Updated Aug 16, 2024
Python

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated Sep 5, 2024
TypeScript

NVIDIA / DeepLearningExamples

Star

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

nlp translation computer-vision deep-learning mxnet tensorflow pytorch speech-synthesis speech-recognition forecasting drug-discovery recommender-systems paddlepaddle tensorflow2 large-language-models

Updated Aug 12, 2024
Jupyter Notebook

NVIDIA / NeMo

Star

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models speaker-diariazation generative-ai

Updated Nov 15, 2024
Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Nov 14, 2024
Python

voicepaw / so-vits-svc-fork

Star

so-vits-svc fork with realtime support, improved interface and more features.

lightning deep-learning realtime pytorch speech-synthesis gan hacktoberfest voice-conversion voice-changer pytorch-lightning hubert vits sovits so-vits-svc softvc contentvec

Updated Nov 11, 2024
Python

espnet / espnet

Star

End-to-End Speech Processing Toolkit

text-to-speech deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Nov 14, 2024
Python

open-mmlab / Amphion

Star

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

text-to-speech audit speech-synthesis audio-synthesis music-generation voice-conversion vocoder emilia text-to-audio fastspeech2 vits audio-generation singing-voice-conversion vall-e audioldm naturalspeech2 maskgct

Updated Nov 1, 2024
Jupyter Notebook

netease-youdao / EmotiVoice

Star

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

python text-to-speech ai deep-learning style prompt speech emotion pytorch tts speech-synthesis multi-speaker emotivoice

Updated Aug 13, 2024
Python

jaywalnut310 / vits

Star

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

text-to-speech deep-learning pytorch tts speech-synthesis

Updated Dec 6, 2023
Python

rhasspy / piper

Star

A fast, local neural text to speech system

text-to-speech tts speech-synthesis

Updated Oct 21, 2024
C++

rany2 / edge-tts

Star

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

text-to-speech tts speech-synthesis

Updated Nov 11, 2024
Python

snakers4 / silero-models

Star

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Updated Oct 18, 2023
Jupyter Notebook

yl4579 / StyleTTS2

Star

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

text-to-speech deep-learning pytorch tts speech-synthesis gan speaker-adaptation adversarial-training diffusion-models wavlm latent-diffusion latent-diffusion-models

Updated Aug 10, 2024
Python

MoonInTheRiver / DiffSinger

Star

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

text-to-speech midi tts speech-synthesis diffusion-model singing-voice singing-synthesis singing-voice-synthesis singing-voice-database aaai2022 diffusion-speedup

Updated May 2, 2023
Python

espeak-ng / espeak-ng

Star

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

android text-to-speech speech-synthesis espeak espeak-ng

Updated Oct 30, 2024
C

collabora / WhisperSpeech

Star

An Open Source text-to-speech system built by inverting Whisper.

pytorch tts speech-synthesis

Updated Jun 18, 2024
Jupyter Notebook

metavoiceio / metavoice-src

Star

Foundational model for human-like, expressive TTS

text-to-speech ai deep-learning speech pytorch tts speech-synthesis voice-clone zero-shot-tts

Updated Jul 30, 2024
Python

TensorSpeech / TensorFlowTTS

Star

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Updated Jul 5, 2024
Python

huggingface / speech-to-speech

Star

Speech To Speech: an effort for an open-sourced and modular GPT4-o

python machine-learning ai speech speech-synthesis assistant speech-to-text language-model speech-translation

Updated Oct 31, 2024
Python

Improve this page

Add a description, image, and links to the speech-synthesis topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-synthesis topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-synthesis

Here are 1,242 public repositories matching this topic...

coqui-ai / TTS

leon-ai / leon

NVIDIA / DeepLearningExamples

NVIDIA / NeMo

PaddlePaddle / PaddleSpeech

voicepaw / so-vits-svc-fork

espnet / espnet

open-mmlab / Amphion

netease-youdao / EmotiVoice

jaywalnut310 / vits

rhasspy / piper

rany2 / edge-tts

snakers4 / silero-models

yl4579 / StyleTTS2

MoonInTheRiver / DiffSinger

espeak-ng / espeak-ng

collabora / WhisperSpeech

metavoiceio / metavoice-src

TensorSpeech / TensorFlowTTS

huggingface / speech-to-speech

Improve this page

Add this topic to your repo