-
Tianjin University
- Tianjin, China
-
08:31
(UTC -12:00) - https://scholar.google.com.hk/citations?user=9BVJbdsAAAAJ&hl=zh-CN
Stars
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Diffusion-based singing voice pitch correction
[Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
LlamaVoice is a llama-based large voice generation model, providing inference and training ability.
Official data preparation scripts for the URGENT 2024 Challenge
SALMONN: Speech Audio Language Music Open Neural Network
Speech, Language, Audio, Music Processing with Large Language Model
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
天大博士/硕士学位论文Latex模板,根据2021年版要求修改,可直接在Overleaf上运行。:star:所写的论文成功提交天津大学图书馆存档!(2021.12.24)
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
Project of Singing Voice Conversion.
The official implementation of GTCRN, an ultra-lite speech enhancement model.
VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement
ModelScope: bring the notion of Model-as-a-Service to life.
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation