Skip to content

Pur1zumu/HarmoF0

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HarmoF0 Pitch Tracker

This repo is the Pytorch implementation of "HarmoF0: Logarithmic Scale Dilated Convolution For Pitch Estimation".

HarmoF0 is a light-weight and high-performance pitch tracking model using multi-rate dilated convolution. The evaluation results with threshold of 50 cents are:

Installing and usage

Install required packages and harmof0 using

$ pip install -r requirements.txt
$ python setup.py install

Estimating pitch of an audio file or folder using pretrained harmof0. HarmoF0 supports wav, mp3 and flac.

$ harmof0 test/a.mp3 
$ harmof0 test

The results are saved in test/a.f0.txt and test/a.activation.png by default.

a.f0.txt

# time frequency activation
0.000 27.500 0.000
0.010 27.500 0.000
0.020 27.500 0.000
0.030 27.500 0.000
0.040 27.500 0.000
0.050 27.500 0.000
0.060 27.500 0.000
0.070 27.500 0.000
0.080 55.000 0.047
0.090 55.000 0.032
0.100 59.118 0.062

a.activation.png

Use post processing:

harmof0 test --post-processing=True

a.activation.post.png

Use specified output dir and device:

$ harmof0 test/a.mp3 --output-dir=output --device=cuda

For more information:

$ harmof0 --help

Usage inside Python

Import harmof0 as module:

import harmof0
import torchaudio

pit = harmof0.PitchTracker()
waveform, sr = torchaudio.load('test/a.mp3')
time, freq, activation, activation_map = pit.pred(waveform, sr)

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%