🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
Updated
Aug 16, 2024 - Python
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🔊 A comprehensive list of open-source datasets for voice and sound computing (95 datasets).
Machine learning based speech synthesis Electron app, with voices from specific characters from video games
This repository has implementation for "Neural Voice Cloning With Few Samples"
Fully reproduce the paper of StarGAN-VC. Stable training and Better audio quality .
Desktop application for neural speech synthesis written in C
Manim plugin for all things voiceover
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to…
A programmable version of Neil Thapen's Pink Trombone
auto video translation-video translator can auto translate video hard subtitles, auto video translation and dubbing, remove any video text, auto remove video subtitles/text. 自动视频翻译配音,自动翻译视频字幕和回填样式,自动硬字幕翻译。
Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client application for your bot or Custom Command service. You will also be able to easily deploy a working Custom Command based Voice Assistant to your own Azure subscription
Voice stress analysis (VSA) aims to differentiate between stressed and non-stressed outputs in response to stimuli (e.g., questions posed), with high stress seen as an indication of deception. In this work, we propose a deep learning-based psychological stress detection model using speech signals. With increasing demands for communication betwee…
TTS models for Arabic (Tacotron2, FastPitch)
A non-official Eleven Labs voice synthesis client for Unity (UPM)
lessampler is a Singing Voice Synthesizer
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
A Non-Official ElevenLabs RESTful API Client for dotnet
💬 "Realtime" voice transcription and cloning using ElevenLabs's API.
Klatt formant synthesizer
Text prompt steered synthetic audio generators
Add a description, image, and links to the voice-synthesis topic page so that developers can more easily learn about it.
To associate your repository with the voice-synthesis topic, visit your repo's landing page and select "manage topics."