Stars
Use PEFT or Full-parameter to finetune 350 LLMs or 90 MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vi…
Open Source framework for voice and multimodal conversational AI
A fast inference library for running LLMs locally on modern consumer-class GPUs
Free and Open Source, Distributed, RESTful Search Engine
Maix Speech AI lib, a fast and small speech lib running on embedded devices, including ASR, chat, TTS etc.
A 10000 hours dataset for Chinese speech recognition
LvHang / aps
Forked from funcwj/apsA workspace for single/multi-channel speech recognition & enhancement & separation.
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
Production First and Production Ready End-to-End Speech Recognition Toolkit
⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
FSA/FST algorithms, differentiable, with PyTorch compatibility.
kaldi-asr/kaldi is the official location of the Kaldi project.