Training loop cleaning and commenting for better understanding and further improvements #90

ngarneau · 2018-12-30T14:32:54Z

Hey guys,

I want to add some specific behaviors (like conditional language modeling) to the AWD-LSTM model and to do so I needed to fully understand the training loop as well as the model.

To this end, I re-arranged a couple of parts within the code, simplifying the training loop with callbacks provided by an open source lib that makes this easily affordable.

I also plan to add detailed logging using a specific lib for this (I had Sacred in mind, maybe you would suggest other?) as suggested by the team of AllenNLP (see this). This is mainly for proper debugging and reproducibility.

If you think there should be things done differently please let me know and I will change accordingly.

Here is a list of the callbacks implemented:

Initialize the hidden state on_epoch_begin.
Repackage the hidden state on_batch_begin.
Adaptative LR on_batch_begin depending on the sequence length.
Evaluation callback when switched to ASGD (did not quite fully understood this one, I need more explanation of the why we need this) that clones the parameters on_epoch_begin/end. This callback may not be working as expected for the moment.
Switch to ASGD using the non-mono trigger on_epoch_begin.

I also did a coupled of changes within the model:

Moved the hidden state carrying and handling within the model.
Moved the criterion within the model as well as the computation of the Activation regularization and the temporal activation regularization for the final loss.
Added accuracy of the model for information purpose.

Feel free to integrate these changes into your codebase. I think it simplifies a lot the understanding of the training loop and the code that will support further development.

Many thanks for the implementation!

…awd-lstm-lm into training-loop-cleaning

…-lm into training-loop-cleaning

…awd-lstm-lm into training-loop-cleaning

salesforce-cla · 2018-12-30T14:32:58Z

Thanks for the contribution! Before we can merge this, we need @ngarneau to sign the Salesforce.com Contributor License Agreement.

ngarneau added 30 commits December 19, 2018 19:58

Add data and corpuses in the gitignore

4f3898d

Re organize main file

8b12a9a

Pytoune experiments import

c4872fc

Removing prints

57e3b8c

Adding alpha and betas into the model for the criterion

a183e58

Cleaning the model and adding the criterion within it

a21cbc7

Changing default dir name

d9a4d81

Using pytoune for the training loop and several callbacks

6929f6b

Creating a sentence loader for pytoune

c39ba2b

Slight change for parameter passing

5d214c3

Remove print and useless methods

5dbace3

Adding some metrics during training

baf7bf5

Fixing the way we compute the loss on validation and the accuracy

e4d541c

Fixing tmp variable name

7c3704c

Properly compute number of batches

fd6061f

Fixing the dataset loader and print when switching to ASGD

428f18c

Using same batch size for valid and test, quick fix

6375cc9

Merge branch 'training-loop-cleaning' of https://github.com/ngarneau/…

d6e9f29

…awd-lstm-lm into training-loop-cleaning

Check if parameter exist in tmp var before reading it

f8f81d0

Merge branch 'training-loop-cleaning' of github.com:ngarneau/awd-lstm…

c490940

…-lm into training-loop-cleaning

Fix the evaluation callback

7d4ff11

make old main work again

f5b5bb6

Reduce LR on plateau callback

d6407b2

Merge branch 'training-loop-cleaning' of github.com:ngarneau/awd-lstm…

001f9cc

…-lm into training-loop-cleaning

Moving callbacks into seperate module

7f48b6a

Monitor validation loss

f121619

Document callbacks

01ac056

Missing imports

6ccae4e

Merge branch 'training-loop-cleaning' of github.com:ngarneau/awd-lstm…

a715d47

…-lm into training-loop-cleaning

Properly name experiment with dataset and model name

2c5ffe6

ngarneau added 15 commits December 27, 2018 11:59

Temp fix for the batch size

c8a6074

Change eval batch size

1010528

Do not randomize when in eval mode

6fb9872

Dynamic handling of batch size on eval does not work

e68066d

Merge branch 'training-loop-cleaning' of https://github.com/ngarneau/…

371a6f2

…awd-lstm-lm into training-loop-cleaning

Wrangling batch size and randomness to reproduce exact results...

2e3cc6c

Make main work back again on branch

e2c2ed3

Make sure we restart at same seq len

55a7c2a

Properly set seed in training loop

6ecc6c0

Patch to reset hidden state on evaluation

3ef6b35

Fix evaluation and model saving

a35243b

Allow keyboard interrupt then evaluate on test

a569717

Provide batch size to init hidden state

bd1ad2b

Add requirements file

51e8098

Merge branch 'training-loop-cleaning'

480319e

salesforce-cla bot added the cla:missing label Dec 30, 2018

salesforce-cla bot added cla:signed and removed cla:missing labels Dec 30, 2018

ngarneau added 2 commits December 30, 2018 18:18

Use pytoune main and model as the real main and model now

7175311

Use model instead of pytoune model for the RNNModel

4ffb50c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training loop cleaning and commenting for better understanding and further improvements #90

Training loop cleaning and commenting for better understanding and further improvements #90

ngarneau commented Dec 30, 2018

salesforce-cla bot commented Dec 30, 2018

Training loop cleaning and commenting for better understanding and further improvements #90

Are you sure you want to change the base?

Training loop cleaning and commenting for better understanding and further improvements #90

Conversation

ngarneau commented Dec 30, 2018

salesforce-cla bot commented Dec 30, 2018