Skip to content
View solee0022's full-sized avatar
Block or Report

Block or report solee0022

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"

Python 239 3 Updated May 24, 2024

Charsiu: A neural phonetic aligner.

Jupyter Notebook 265 34 Updated Sep 19, 2022

Keyword spotting and forced alignment in any language

Python 29 2 Updated Jun 29, 2024

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,442 642 Updated Jul 25, 2024

Phonetisaurus G2P

Shell 441 123 Updated Jun 1, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 10,224 1,072 Updated Jul 11, 2024

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 304 24 Updated Feb 21, 2024
Python 1,346 180 Updated Feb 11, 2024
Python 228 15 Updated Jun 14, 2024

Official code for Wav2Seq

Python 93 11 Updated Jul 19, 2022

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 1,740 143 Updated Jul 22, 2024

Segment an audio file and obtain utterance alignments. (Python package)

Python 309 28 Updated May 15, 2024

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 13,108 861 Updated Jul 25, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,196 4,016 Updated Jul 17, 2024

BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing

Python 40 8 Updated Mar 11, 2024

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Python 27,582 3,311 Updated Jul 25, 2024

The official Meta Llama 3 GitHub site

Python 24,326 2,640 Updated Jul 25, 2024

Awesome speech/audio LLMs, representation learning, and codec models

515 26 Updated May 29, 2024

This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.

72 6 Updated Jun 7, 2024

EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

Jupyter Notebook 209 15 Updated May 19, 2024
2 Updated Nov 5, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,649 3,388 Updated Jul 26, 2024

Inference code for Llama models

Python 54,575 9,359 Updated Jul 25, 2024

SpeechGPT Series: Speech Large Language Models

Python 1,093 69 Updated Jul 22, 2024

Single-blind supplementary materials for NeurIPS 2023 submission

Python 52 4 Updated Jun 6, 2024

Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"

Python 113 2 Updated May 8, 2024

An Audio Language model for Audio Tasks

Python 269 15 Updated Apr 19, 2024

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 340 27 Updated Apr 24, 2024

A tree explorer plugin for vim.

Vim Script 19,432 1,435 Updated Jul 20, 2024
Next