Voice Cloning App

A Python/Pytorch app for easily synthesising human voices

Documentation

Discord Server

Video guide

Voice Sharing Hub

FAQ's

System Requirements

Windows 10 or Ubuntu 20.04 operating system
5GB Disk space
NVIDIA GPU with at least 4GB of memory & driver version 456.38 (optional)

Key features

Automatic dataset generation (with support for subtitles and audiobooks)
Additional language support
Local & remote training
Easy train start/stop
Data importing/exporting
Multi GPU support

Manual Guides

Future Improvements

Add support for Talknet
Add GTA alignment for Hifi-gan
Improved batch size estimation
AMD GPU support

Other resources

Remote training notebook
Try out existing voices at uberduck.ai and Vocodes
Youtube data fetching (created by Diskr33t#5880)
Synthesize in Colab (created by mega b#6696)
Generate youtube transcription (created by mega b#6696)
Wit.ai transcription

Acknowledgements

This project uses a reworked version of Tacotron2. All rights for belong to NVIDIA and follow the requirements of their BSD-3 licence.

Additionally, the project uses DSAlign, Silero, DeepSpeech & hifi-gan.

Thank you to Dr. John Bustard at Queen's University Belfast for his support throughout the project.

Supported by uberduck.ai, reach out to them for live model hosting.

Also a big thanks to the members of the VocalSynthesis subreddit for their feedback.

Finally thank you to everyone raising issues and contributing to the project.

Name		Name	Last commit message	Last commit date
Latest commit History 572 Commits
.circleci		.circleci
alphabets		alphabets
application		application
dataset		dataset
docs		docs
extra-hooks		extra-hooks
synthesis		synthesis
tests		tests
training		training
.coveragerc		.coveragerc
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build_exe.py		build_exe.py
faqs.md		faqs.md
install.md		install.md
install.sh		install.sh
main.py		main.py
maintenance.md		maintenance.md
mkdocs.yml		mkdocs.yml
preview.png		preview.png
pyproject.toml		pyproject.toml
requirements-cpu.txt		requirements-cpu.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice Cloning App

Documentation

Discord Server

Video guide

Voice Sharing Hub

FAQ's

System Requirements

Key features

Manual Guides

Future Improvements

Other resources

Acknowledgements

About

Releases 41

Packages

Contributors 11

Languages

License

voice-cloning-app/Voice-Cloning-App

Folders and files

Latest commit

History

Repository files navigation

Voice Cloning App

System Requirements

Key features

Manual Guides

Future Improvements

Other resources

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Languages