h-detach

This repository contains the code used to reproduce the paper h-detach: Modifying the LSTM Gradient Towards Better Optimization. The final report can be found here- https://github.com/dido1998/iclr_2019/tree/master/papers/TeamYakataka-ryf7ioRqFX

Software Requirements

Python 3, Pytorch 0.4, tensor board, tqdm

Copying Task

First change the directory path for saving results on line 27 of copying.py.

To run copying task with time delay of 300 steps using h-detach with 0.5 probability of blocking gradinets through the h-state using seed value 3, execute the following command:

python copying.py --save-dir=enter_experiment_dir_name_here --p-detach=0.5 --T=300 --seed=3

To run copying task without h-detach, execute the following command:

python copying.py --save-dir=enter_experiment_dir_name_here --T=300 --seed=3

To run transfer copying task using a previously trained model for T=200 (or any other value), run:

python transfer_copy.py --T=200 --model-dir=enter_full_model_path

Sequential MNIST

First uncomment and change the directory path for saving results on line 30 of pixelmnist.py.

To run pixel MNIST, execute,

python pixelmnist.py --p-detach=0.25 --seed=250 --save-dir=enter_experiment_dir_name_here

We experimented with multiple seeds (around 5), and picked the model with the best validation accuracy to report the final test performance. On our machine a seed of 400 gives the best validation model with test accuracy 98.62%.

Permuted MNIST

First uncomment and change the directory path for saving results on line 30 of pixelmnist.py.

To run permuted pixel MNIST, execute,

python pixelmnist.py --p-detach=0.25 --seed=150 --permute --save-dir=enter_experiment_dir_name_here

We experimented with multiple seeds (around 5), and picked the model with the best validation accuracy to report the final test performance. On our machine a seed of 400 gives the best validation model with test accuracy 91.92%.

Image Captioning

The code for image captioning experiments can be found in the other branch called captioning.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
h-detach-cuda-extension		h-detach-cuda-extension
.gitignore		.gitignore
README.md		README.md
copying.py		copying.py
generator.py		generator.py
lstm_cell.py		lstm_cell.py
pixelmnist.py		pixelmnist.py
transfer_copy.py		transfer_copy.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

h-detach

Software Requirements

Copying Task

Sequential MNIST

Permuted MNIST

Image Captioning

About

Releases 1

Packages

Languages

License

dido1998/h-detach

Folders and files

Latest commit

History

Repository files navigation

h-detach

Software Requirements

Copying Task

Sequential MNIST

Permuted MNIST

Image Captioning

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages