Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to perform diarization and transcription or aeneas to force align text to audio.
-
Updated
Apr 17, 2022 - Python
Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to perform diarization and transcription or aeneas to force align text to audio.
Automates the creation of full-text (sound and text) ebooks in epub/epub3/daisy format, the webserver/client creates smil files to sync audio with text using aeneas and converts daisy ebooks to epub3 ebooks (supporting media overlays).
This repository is purposed to track the changes towards the development of the Accessible Digital Textbook (ADT) in its initial prototype, beta protototype and final versions.
Add a description, image, and links to the aeneas topic page so that developers can more easily learn about it.
To associate your repository with the aeneas topic, visit your repo's landing page and select "manage topics."