NU-Wave — Official PyTorch Implementation

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling
Junhyeok Lee, Seungu Han @ MINDsLab Inc., SNU

Paper(arXiv): https://arxiv.org/abs/2104.02321 (Accepted to INTERSPEECH 2021)
Audio Samples: https://mindslab-ai.github.io/nuwave

Official Pytorch Lightning Implementation for NU-Wave.

Update: CODE RELEASED! README is still updating.
TODO: How to preprocessing/ training/ evaluation

Preprocessing

TODO

Training

TODO run trainer.py

Evaludation

TODO run for_test.py or test.py

Repository Structure

.
├── Dockerfile
├── dataloader.py           # Dataloader for train/val(=test)
├── filters.py              # Filter implementation
├── test.py                 # Test with lightning_loop.
├── for_test.py             # Test with for_loop. Recommended due to device dependency of lightning
├── hparameter.yaml         # Config
├── lightning_model.py      # NU-Wave implementation. DDPM is based on ivanvok's WaveGrad implementation
├── model.py                # NU-Wave model based on lmnt-com's DiffWave implementation
├── requirement.txt         # requirement libraries
├── sampling.py             # Sampling a file
├── trainer.py              # Lightning trainer
├── README.md           
├── utils
│  ├── stft.py              # STFT layer
│  ├── tblogger.py          # Tensorboard Logger for lightning
│  └── wav2pt.py            # Preprocessing
└── docs                    # For github.io
    └─ ...

Requirements

Pytorch >=1.7.0 for nn.SiLU(swish) Pytorch-Lightning==1.1.6 The requirements are highlighted in requirements.txt. We also provide docker setup Dockerfile.

References

This implementation uses code from following repositories:

This README and the webpage for the audio samples are inspired by:

The audio samples on our webpage are partially derived from:

VCTK: 46 hours of English speech from 108 speakers.

Citation & Contact

If this repository useful for your research, please consider citing! Bibtex will be updated after INTERSPEECH 2021 conference.

@article{lee2021nuwave,
  title={NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling},
  author={Lee, Junhyeok and Han, Seungu},
  journal={arXiv preprint arXiv:2104.02321},
  year={2021}
}

If you have a question or any kind of inquiries, please contact Junhyeok Lee at [email protected]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NU-Wave — Official PyTorch Implementation

Preprocessing

Training

Evaludation

Repository Structure

Requirements

References

Citation & Contact

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
docs		docs
utils		utils
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
dataloader.py		dataloader.py
filters.py		filters.py
for_test.py		for_test.py
hparameter.yaml		hparameter.yaml
lightning_model.py		lightning_model.py
model.py		model.py
requirements.txt		requirements.txt
sampling.py		sampling.py
test.py		test.py
trainer.py		trainer.py

License

oytunturk/nuwave

Folders and files

Latest commit

History

Repository files navigation

NU-Wave — Official PyTorch Implementation

Preprocessing

Training

Evaludation

Repository Structure

Requirements

References

Citation & Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages