Skip to content
View linan2's full-sized avatar

Organizations

@group122

Block or report linan2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GLM-4-Voice | 端到端中英语音对话模型

Python 2,003 152 Updated Oct 31, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 6,262 703 Updated Nov 1, 2024

田柯宇 (Tian Keyu)恶意攻击集群事件的证据揭露

572 39 Updated Oct 20, 2024

Diffusion-based singing voice pitch correction

Python 91 14 Updated Sep 20, 2024

[Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement

Python 33 1 Updated Oct 17, 2024

Target Speaker Extraction Toolkit

Python 96 12 Updated Oct 31, 2024

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Python 677 117 Updated Mar 8, 2024

LlamaVoice is a llama-based large voice generation model, providing inference and training ability.

Python 217 11 Updated Aug 26, 2024
Python 90 15 Updated Apr 24, 2023

Official data preparation scripts for the URGENT 2024 Challenge

Python 63 5 Updated Aug 12, 2024
Python 4 Updated Feb 7, 2024

SALMONN: Speech Audio Language Music Open Neural Network

Python 1,036 80 Updated Oct 31, 2024

Speech, Language, Audio, Music Processing with Large Language Model

Python 562 52 Updated Nov 1, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Jupyter Notebook 6,976 513 Updated Nov 1, 2024

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 712 117 Updated Oct 22, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 14,974 1,378 Updated Oct 15, 2024

PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio

Python 148 14 Updated Jul 14, 2023
1 Updated Apr 26, 2024

TDOA based on GCC-PHAT

Python 169 65 Updated Apr 16, 2024

DBPNet model

Python 32 3 Updated Jun 5, 2024

天大博士/硕士学位论文Latex模板,根据2021年版要求修改,可直接在Overleaf上运行。:star:所写的论文成功提交天津大学图书馆存档!(2021.12.24)

TeX 316 60 Updated Aug 26, 2022

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

C 120 9 Updated Mar 6, 2024
Python 147 15 Updated Nov 1, 2024

Project of Singing Voice Conversion.

Python 14 3 Updated Oct 27, 2023

TensorFlow 1.4.0 installed version.

2 Updated Feb 2, 2024

The official implementation of GTCRN, an ultra-lite speech enhancement model.

Python 204 34 Updated Aug 9, 2024

VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement

Python 37 8 Updated Sep 12, 2024

ModelScope: bring the notion of Model-as-a-Service to life.

Python 6,971 718 Updated Nov 1, 2024

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

C 31,150 7,857 Updated Aug 3, 2024
Next