K-Wav2vec 2.0

This official implementation of "K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables"

Requirements and Installation

PyTorch version >= 1.7.1
Python version >= 3.6
To install K-wav2vec and develop locally:

git clone https://github.com/JoungheeKim/K-wav2vec.git
cd K-wav2vec

## install essential library
pip install -r requirements.txt

## install locally
python setup.py develop

We only test this implementation in Ubuntu 18.04.
DockerFile is also supported in this repo.

Instructions

We support script examples to execute code easily(check script folder)
Following this instruction give you exact matched results.

# Guilde to make multi-model with Ksponspeech(orthographic transcription) 

# [1] preprocess dataset & make manifest
bash script/preprocess/make_ksponspeech_script_for_mulitmodel.sh

# [2] further pre-train the model
bash script/pretrain/run_further_pretrain.sh
 
# [3] fine-tune the model
bash script/finetune/run_ksponspeech_multimodel.sh

# [4] inference the model
bash script/inference/evaluate_multimodel.sh

Pretrained model

E-Wav2vec 2.0 : Wav2vec 2.0 pretrained on Englsih dataset released by Fairseq(-py)
K-Wav2vec 2.0 : The model further pretrained on Ksponspeech by using Englsih model
- Fairseq Version : If you want to fine-tune your model with fairseq framework, you can download with this LINK
- Huggingface Version : If you want to fine-tune your model with huggingface framework, you can download with this LINK

Dataset

Ksponspeech : Open-domain dialog corpus
Clovacall : Call-based speech corpus for reservation

Acknowledgments

Our code was modified from fairseq codebase. We use the same license as fairseq.
The preprocessing code was developed with reference to Kospeech.

License

Our implementation code(-py) is MIT-licensed. The license applies to the pre-trained models as well.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
configs		configs
experiments/ksponspeech		experiments/ksponspeech
fairseq		fairseq
fairseq_cli		fairseq_cli
inference		inference
preprocess		preprocess
script		script
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
__version__.py		__version__.py
hubconf.py		hubconf.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

K-Wav2vec 2.0

Requirements and Installation

Instructions

Pretrained model

Dataset

Acknowledgments

License

About

Releases

Packages

Languages

License

dongwon00kim/K-wav2vec

Folders and files

Latest commit

History

Repository files navigation

K-Wav2vec 2.0

Requirements and Installation

Instructions

Pretrained model

Dataset

Acknowledgments

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages