πŸ—£οΈ Open TTS Tracker

A one stop shop to track all open-access/ source TTS models as they come out. Feel free to make a PR for all those that aren’t linked here.

This is aimed as a resource to increase awareness for these models and to make it easier for researchers, developers, and enthusiasts to stay informed about the latest advancements in the field.

[!NOTE]
This repo will only track open source/access codebase TTS models. More motivation for everyone to open-source! πŸ€—











Name ↕ GitHub
πŸ’»
Weights
βš–
License
🧾
Fine-tune
πŸ‘€
Languages Paper
πŸ“„
Demo
πŸ—£οΈ
Issues
πŸ“š
Processor
⚑
Word pronunciation adjustment
πŸ‘„
Insta-clone
πŸ‘₯
Emotional control
🎭
Prompting
πŸ“–
Streaming support
🌊
Audio control
🎚
S2S support
🦜
XTTS Repo πŸ€— Hub CPML Yes Multilingual Technical notes πŸ€— Space                  
TorToiSe TTS Repo πŸ€— Hub Apache 2.0 Yes English Technical report πŸ€— Space                  
VITS/ MMS-TTS Repo πŸ€— Hub / MMS Apache 2.0 Yes English Paper πŸ€— Space                  
Pheme Repo πŸ€— Hub CC-BY Yes English Paper πŸ€— Space                  
OpenVoice Repo πŸ€— Hub CC-BY-NC 4.0 No ZH + EN Paper πŸ€— Space                  
IMS-Toucan Repo GH release Apache 2.0 Yes Multilingual Paper πŸ€— Space                  
Matcha-TTS Repo GDrive MIT Yes English Paper πŸ€— Space GPL-licensed phonemizer                
pflowTTS Unofficial Repo GDrive MIT Yes English Paper Not Available GPL-licensed phonemizer                
StyleTTS 2 Repo πŸ€— Hub MIT Yes English Paper πŸ€— Space GPL-licensed phonemizer                
VALL-E Unofficial Repo Not Available MIT Yes NA Paper Not Available                  
HierSpeech++ Repo GDrive CC-BY-NC-SA 4.0 No KR + EN Paper πŸ€— Space                  
Bark Repo πŸ€— Hub MIT No Multilingual Paper πŸ€— Space                  
EmotiVoice Repo GDrive Apache 2.0 Yes ZH + EN Not Available Not Available Separate GUI agreement                
Amphion Repo πŸ€— Hub MIT No Multilingual Paper πŸ€— Space                  
xVASynth Repo GH commit GPL-3.0 Yes Multilingual Paper Not Available Copyright materials used for training. CPU / CUDA ARPAbet   4-type
πŸ˜‘πŸ˜ƒ
😭😯 per-phoneme
    speed / pitch / energy
🎚
per-phoneme
🦜
OverFlow TTS Repo GitHub MIT Yes English Paper GH Pages                  
Neural-HMM TTS Repo GitHub MIT Yes English Paper GH Pages                  
Tacotron 2 Unofficial Repo GDrive BSD-3 Yes English Paper Webpage                  
Glow-TTS Repo GDrive MIT Yes English Paper GH Pages                  
Silero Repo GH links CC BY-NC-SA No EM + DE + ES + EA Not Available Not Available Non Commercial                
MahaTTS Repo πŸ€— Hub Apache 2.0 No English, Hindi, Indian English, Bengali, Tamil, Telugu, Punjabi, Marathi, Gujarati, Assamese Not Available Recordings, Colab