#

text-to-speech

Here are 3,060 public repositories matching this topic...

MockingBird

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

text-to-speech ai deep-learning speech pytorch tts

Updated Jul 6, 2024
Python

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts speech-synthesis voice-conversion vocoder voice-synthesis tacotron voice-cloning speaker-encodings melgan speaker-encoder multi-speaker-tts glow-tts hifigan tts-model

Updated Jul 25, 2024
Python

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

text-to-speech tts voice-cloning vits voice-clone voice-cloneai

Updated Aug 3, 2024
Python

2noise / ChatTTS

A generative speech model for daily dialogue.

python chat agent text-to-speech torch tts english chinese gpt natural-language-inference english-language chinese-language torchaudio llm chatgpt llm-agent chattts

Updated Aug 3, 2024
Python

myshell-ai / OpenVoice

Instant voice cloning by MyShell.

text-to-speech tts voice-clone zero-shot-tts

Updated Jul 23, 2024
Python

leon

leon-ai / leon

🧠 Leon is your open-source personal assistant.

Updated Aug 1, 2024
TypeScript

mozilla / TTS

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

python text-to-speech deep-learning speech pytorch tts vocoder tacotron tensorflow2 tacotron2 melgan speaker-encoder dataset-analysis glow-tts multiband-melgan gantts

Updated Nov 9, 2023
Jupyter Notebook

jianchang512 / pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

text-to-speech speech-to-text video-transition

Updated Aug 4, 2024
Python

espnet / espnet

End-to-End Speech Processing Toolkit

text-to-speech deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Aug 3, 2024
Python

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

text-to-speech tts gpt transformer-architecture emotional-speech voice-clone vall-e

Updated Feb 11, 2024
Python

netease-youdao / EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

python text-to-speech ai deep-learning style prompt speech emotion pytorch tts speech-synthesis multi-speaker emotivoice

Updated Jul 12, 2024
Python

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

text-to-speech deep-learning pytorch tts speech-synthesis

Updated Dec 6, 2023
Python

rhasspy / piper

A fast, local neural text to speech system

text-to-speech tts speech-synthesis

Updated Jul 31, 2024
C++

rany2 / edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

text-to-speech tts speech-synthesis

Updated Jul 3, 2024
Python

snakers4 / silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Updated Oct 18, 2023
Jupyter Notebook

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

text-to-speech deep-learning pytorch tts speech-synthesis gan speaker-adaptation adversarial-training diffusion-models wavlm latent-diffusion latent-diffusion-models

Updated Jul 31, 2024
Python

Amphion

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

text-to-speech audit speech-synthesis audio-synthesis music-generation voice-conversion text-to-audio fastspeech2 vits hifi-gan audio-generation singing-voice-conversion vall-e audioldm naturalspeech2

Updated Jul 26, 2024
Python

MoonInTheRiver / DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

text-to-speech midi tts speech-synthesis diffusion-model singing-voice singing-synthesis singing-voice-synthesis singing-voice-database aaai2022 diffusion-speedup

Updated May 2, 2023
Python

myshell-ai / MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

multilingual text-to-speech japanese tts english spanish chinese korean french

Updated Aug 3, 2024
Python

espeak-ng / espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

android text-to-speech speech-synthesis espeak espeak-ng

Updated Aug 3, 2024
C

Improve this page

Add a description, image, and links to the text-to-speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-to-speech topic, visit your repo's landing page and select "manage topics."