Skip to content

Releases: NickleDave/thrillington

Ramón

04 Feb 12:58
Compare
Choose a tag to compare
Ramón Pre-release
Pre-release

Added

  • ability to use only a subset of MNIST training data and get a validation set from it
  • ability to shuffle dataset on each epoch
  • ability to train replicates (experiment repeated n times with same training data,
    only random initialization / shuffling different)
  • ability to specify other modules to use to load other datasets
  • logging of runs/experiments, with option to dump to a text file
  • tests for MNIST module in datasets
  • a CHANGELOG (this one)

Changed

  • change argparser to use positional arguments command and config
    • before all arguments were "optional" (although the program would crash without them)
  • many changes to training, in attempt to reproduce original paper reconcile different versions
    • currently: use pdf of Gaussian for policy gradient of location network, and
      normalize both baseline, target of baseline, and advantage to decrease variance and
      to keep gradient from exploding

Fixed

  • fix action network and glimpse network, did not have correct number of layers

Ramone

04 Feb 12:57
Compare
Choose a tag to compare

Added

  • original version, probably closest to the one from Kevin Zakka
  • but in Tensorflow (Eager) instead of PyTorch