Stars
A lightweight, high-performance Kalman Filter library in C, C , and MATLAB, offering superior numerical stability and efficiency with minimal dependencies.
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C , C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
Production first, nn-based on-device signal processing toolkit.
MLNLP: Notes for MIT-Linear-Algebra
Acoustic Echo Cancellation with Nerual Kalman Filtering
Think DSP: Digital Signal Processing in Python, by Allen B. Downey.
Notebooks for "Python for Signal Processing" book
Impulse response generation based on state-of-the-art geometric sound propagation engine.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmented-reality (AR) -motivated multi-sensor egocentric world view.
Production First and Production Ready End-to-End Keyword Spotting Toolkit
Measuring room impulse responses with python and sounddevice
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C & Python API
Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
Time delay neural network (TDNN) implementation in Pytorch using unfold method
A PyTorch implementation of DNN-based source separation.
🔊 A comprehensive list of open-source datasets for voice and sound computing (95 datasets).
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
Conferencing Speech Challenge
Production First and Production Ready End-to-End Speech Recognition Toolkit
A higher-level Neural Network library for microcontrollers.
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Analyze, visualize, and process sound field data recorded by spherical microphone arrays.
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation