Skip to content
View iwaterxt's full-sized avatar
Block or Report

Block or report iwaterxt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

418 15 Updated Aug 12, 2024
Python 167 23 Updated Dec 4, 2023

Blind Source Separation and Dereverberation

Python 19 6 Updated Mar 26, 2021

Complex Neural Beamformer

Python 25 12 Updated Oct 15, 2020

Easy to use Beamformers for multi-channel speech separation/enhancement

Python 174 48 Updated Jan 26, 2021

simple delaysum, MVDR and CGMM-MVDR

Python 227 72 Updated Jan 19, 2019

基于深度学习的声学回声消除基线代码

Python 123 37 Updated May 21, 2021
Python 52 10 Updated Apr 11, 2022

音频标注工具

JavaScript 69 30 Updated Nov 2, 2021

End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation

Python 21 8 Updated Nov 17, 2023
Jupyter Notebook 17 4 Updated Aug 11, 2024

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

Cuda 468 91 Updated Feb 27, 2024

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 823 95 Updated Aug 13, 2024

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Python 4,202 779 Updated Nov 21, 2023

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

14,232 1,301 Updated Jul 21, 2024

第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。

Python 503 53 Updated Sep 11, 2023

Faster Whisper transcription with CTranslate2

Python 10,921 914 Updated Aug 18, 2024

A high-resolution direction-of-arrival finding algorithm relying on finite rate of innovation sampling with a robust reconstruction algorithm.

Python 89 46 Updated Oct 16, 2018

unofficial vits2-TTS implementation in pytorch

Python 468 84 Updated Mar 28, 2024

partitioned block based frequency domain Kalman filter

Python 36 8 Updated Jan 14, 2023
Jupyter Notebook 22 4 Updated Apr 10, 2023

Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva

Python 78 23 Updated Aug 14, 2024

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Python 148 27 Updated Jul 24, 2024

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python 670 113 Updated Oct 23, 2023

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 927 174 Updated Dec 22, 2023

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Python 2,483 294 Updated Jul 8, 2024

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,152 315 Updated Jan 22, 2024

Acoustic Echo Cancellation with Nerual Kalman Filtering

HTML 214 57 Updated Feb 21, 2023

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,391 422 Updated Aug 8, 2024
Next