Skip to content
View cloudchenl's full-sized avatar
Block or Report

Block or report cloudchenl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏

Python 232 38 Updated Sep 10, 2023

🐙 关于提示词工程(prompt)的指南、论文、讲座、笔记本和资源大全(自动持续更新)

Jupyter Notebook 424 38 Updated Oct 25, 2023

Prompt工程师指南,源自英文版,但增加了AIGC的prompt部分,为了降低同学们的学习门槛,翻译更新

Jupyter Notebook 1,018 113 Updated Mar 29, 2023

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 27,393 3,438 Updated Aug 6, 2024

High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN

Python 337 77 Updated Mar 27, 2024

High quality Lip sync

Python 980 253 Updated Jul 30, 2024

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,637 301 Updated Jul 14, 2024

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 550 79 Updated Dec 27, 2023

http://www.facegood.cc

Python 1,781 356 Updated Feb 8, 2023

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 46,256 5,481 Updated Jun 24, 2024

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,349 251 Updated Jan 27, 2024

Orchestra is a sheet music reader (optical music recognition (OMR) system) that converts sheet music to a machine-readable version.

Python 103 22 Updated Jun 20, 2023

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 448 67 Updated Aug 1, 2024

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 283 30 Updated Jun 2, 2024

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Python 112 27 Updated Aug 22, 2022

Open source voice labeling application

Kotlin 143 20 Updated Jul 31, 2024

Audio generation using diffusion models, in PyTorch.

Python 1,896 165 Updated Jun 12, 2023

A playbook for systematically maximizing the performance of deep learning models.

26,156 2,181 Updated Jun 18, 2024

A timeline of the latest AI models for audio generation, starting in 2023!

1,875 67 Updated Jan 4, 2024

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

Python 8,884 1,828 Updated Aug 18, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,362 301 Updated Jan 4, 2024

vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统

Python 209 72 Updated Sep 27, 2021

Text Normalization & Inverse Text Normalization

Python 439 66 Updated Aug 1, 2024

雪球股票数据接口 python edition

Python 1,069 268 Updated Jul 15, 2024

雪球股票信息超级爬虫

Java 2,156 820 Updated Mar 27, 2024

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Python 126 21 Updated May 21, 2024

Official PyTorch Implementation of CleanUNet (ICASSP 2022)

Python 281 46 Updated Oct 11, 2023

Official implementation of SawSing (ISMIR'22)

Python 249 35 Updated Aug 28, 2022
Python 167 23 Updated Dec 4, 2023

fetch snowball portfolio

Python 54 38 Updated Mar 27, 2016
Next