The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 46,256 5,481 Updated Jun 24, 2024

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,349 251 Updated Jan 27, 2024

AbdallahHemdan / Orchestra

Orchestra is a sheet music reader (optical music recognition (OMR) system) that converts sheet music to a machine-readable version.

Python 103 22 Updated Jun 20, 2023

sp-uhh / sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 448 67 Updated Aug 1, 2024

haoheliu / audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 283 30 Updated Jun 2, 2024

yl4579 / PitchExtractor

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Python 112 27 Updated Aug 22, 2022

sdercolin / vlabeler

Open source voice labeling application

Kotlin 143 20 Updated Jul 31, 2024

archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Python 1,896 165 Updated Jun 12, 2023

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

26,156 2,181 Updated Jun 18, 2024

archinetai / audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

1,875 67 Updated Jan 4, 2024

akfamily / akshare

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

Python 8,884 1,828 Updated Aug 18, 2024

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,362 301 Updated Jan 4, 2024

UEhQZXI / vits_chinese

vits chinese, tts chinese, tts mandarin 史上训练最简单，音质最好的语音合成系统

Python 209 72 Updated Sep 27, 2021

wenet-e2e / WeTextProcessing

Text Normalization & Inverse Text Normalization

Python 439 66 Updated Aug 1, 2024

uname-yang / pysnowball

雪球股票数据接口 python edition

Python 1,069 268 Updated Jul 15, 2024

decaywood / XueQiuSuperSpider

雪球股票信息超级爬虫

Java 2,156 820 Updated Mar 27, 2024

zhenghuatan / rVADfast

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Python 126 21 Updated May 21, 2024

NVIDIA / CleanUNet

Official PyTorch Implementation of CleanUNet (ICASSP 2022)

Python 281 46 Updated Oct 11, 2023

YatingMusic / ddsp-singing-vocoders

Official implementation of SawSing (ISMIR'22)

Python 249 35 Updated Aug 28, 2022

tencent-ailab / FRA-RIR

Python 167 23 Updated Dec 4, 2023

crossin / avalanche

fetch snowball portfolio

Python 54 38 Updated Mar 27, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cloudchenl

Block or report cloudchenl

Stars

Executedone / Chinese-FastSpeech2

yunwei37 / Prompt-Engineering-Guide-zh-CN

wangxuqi / Prompt-Engineering-Guide-Chinese

xinntao / Real-ESRGAN

saifhassan / Wav2Lip-HD

ajay-sainy / Wav2Lip-GFPGAN

yang-song / score_sde_pytorch

yangdongchao / AcademiCodec

FACEGOOD / FACEGOOD-Audio2Face

facebookresearch / segment-anything