FCGCL: Fine- and Coarse-Granularity Contrastive Learning for Speech Translation

This is the pytorch implementation for paper "FCCL: Fine- and Coarse-Granularity Contrastive Learning for Speech Translation".

Enviroment Configuration

Our code is based on Espnet and use PyTorch-Lightning to organize our code. Please install Espnet and PyTorch-Lightning following the official guidance.

Data Preparation

Download the wav2vec 2.0 model published in Huggingface.
We extract feature bases on wav2vec 2.0 before training. The scripts are saved on ./scripts/.
Save to json file. This is consistent with Espnet. We upload the dev.json and the corresponding feature for reference to quickly debug the code.

Model Training

. ./run.sh

The training process in defined on ./src/bins/plModule.py. The contrastive module is defined on ./src/bins/cl_loss.py.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
conf		conf
local		local
scripts		scripts
src		src
utils		utils
README.md		README.md
cmd.sh		cmd.sh
path.sh		path.sh
run.pl		run.pl
run.sh		run.sh
slurm.pl		slurm.pl
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FCGCL: Fine- and Coarse-Granularity Contrastive Learning for Speech Translation

Enviroment Configuration

Data Preparation

Model Training

About

Releases

Packages

Languages

zhhao1/fcgcl

Folders and files

Latest commit

History

Repository files navigation

FCGCL: Fine- and Coarse-Granularity Contrastive Learning for Speech Translation

Enviroment Configuration

Data Preparation

Model Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages