Copy update by pskrunner14 · Pull Request #172 · IBM/pytorch-seq2seq

pskrunner14 · 2018-10-12T19:59:59Z

Fixed copy_decoder def and minor changes.
Latest changes from develop.

TODO: fix dimensions for compatibility with top_k_decoder.

* Fixed topk decoder.

* Use torchtext from pipe. * Fixed torch text sorting order.

…BM#90) * attention is not required when only using teacher forcing in decoder

Fixed field arguments validation.

* 0.1.5 (IBM#91) * Modified parameter order of DecoderRNN.forward (IBM#85) * Updated TopKDecoder (IBM#86) * Fixed topk decoder. * Use torchtext from pipy (IBM#87) * Use torchtext from pipe. * Fixed torch text sorting order. * attention is not required when only using teacher forcing in decoder (IBM#90) * attention is not required when only using teacher forcing in decoder * Updated docs and version. * Fixed code style. * shuffle the training data

* fix example of inflate function in TopKDecoer.py

* Fix hidden_layer size for one-directional decoder Hidden layer size of the decoder was given `hidden_size * 2 if bidirectional else 1`, resulting in a dimensionality error for non-bidirectional decoders. Changed `1` to `hidden_size`.

* Adapt load to allow CPU loading of GPU models Add storage parameter to torch.load to allow loading models on a CPU that are trained on the GPU, depending on availability of cuda.

* Fix wrong parameter use on DecoderRNN

# Conflicts: # seq2seq/models/TopKDecoder.py # seq2seq/trainer/supervised_trainer.py

* Modified parameter order of DecoderRNN.forward (IBM#85) * Updated TopKDecoder (IBM#86) * Fixed topk decoder. * Use torchtext from pipy (IBM#87) * Use torchtext from pipe. * Fixed torch text sorting order. * attention is not required when only using teacher forcing in decoder (IBM#90) * attention is not required when only using teacher forcing in decoder * Updated docs and version. * Fixed code style. * bugfix (IBM#92) Fixed field arguments validation. * Removed `initial_lr` when resuming optimizer with scheduler. (IBM#95) * shuffle the training data (IBM#97) * 0.1.5 (IBM#91) * Modified parameter order of DecoderRNN.forward (IBM#85) * Updated TopKDecoder (IBM#86) * Fixed topk decoder. * Use torchtext from pipy (IBM#87) * Use torchtext from pipe. * Fixed torch text sorting order. * attention is not required when only using teacher forcing in decoder (IBM#90) * attention is not required when only using teacher forcing in decoder * Updated docs and version. * Fixed code style. * shuffle the training data * fix example of inflate function in TopKDecoer.py (IBM#98) * fix example of inflate function in TopKDecoer.py * Fix hidden_layer size for one-directional decoder (IBM#99) * Fix hidden_layer size for one-directional decoder Hidden layer size of the decoder was given `hidden_size * 2 if bidirectional else 1`, resulting in a dimensionality error for non-bidirectional decoders. Changed `1` to `hidden_size`. * Adapt load to allow CPU loading of GPU models (IBM#100) * Adapt load to allow CPU loading of GPU models Add storage parameter to torch.load to allow loading models on a CPU that are trained on the GPU, depending on availability of cuda. * Fix wrong parameter use on DecoderRNN (IBM#103) * Fix wrong parameter use on DecoderRNN

* Upgrade to pytorch-0.3.0 * Use pytorch 3.0 in travis env.

…eturns several seqs for a given seq (IBM#116) * Adding a predictor method to return n predicted seqs for a src_seq input (intended to be used along to Beam Search using TopKDecoder)

when attention is turned off, pytorch (well, 0.4 at least) gets angry about calling view on a non-contiguous tensor

* add contiguous call to tensor (IBM#127) when attention is turned off, pytorch (well, 0.4 at least) gets angry about calling view on a non-contiguous tensor * Fixed shape documentation (IBM#131) * Update to pytorch-0.4 * Remove pytorch manual install in travis.

* Modified parameter order of DecoderRNN.forward (IBM#85) * Updated TopKDecoder (IBM#86) * Fixed topk decoder. * Use torchtext from pipy (IBM#87) * Use torchtext from pipe. * Fixed torch text sorting order. * attention is not required when only using teacher forcing in decoder (IBM#90) * attention is not required when only using teacher forcing in decoder * Updated docs and version. * Fixed code style. * bugfix (IBM#92) Fixed field arguments validation. * Removed `initial_lr` when resuming optimizer with scheduler. (IBM#95) * shuffle the training data (IBM#97) * 0.1.5 (IBM#91) * Modified parameter order of DecoderRNN.forward (IBM#85) * Updated TopKDecoder (IBM#86) * Fixed topk decoder. * Use torchtext from pipy (IBM#87) * Use torchtext from pipe. * Fixed torch text sorting order. * attention is not required when only using teacher forcing in decoder (IBM#90) * attention is not required when only using teacher forcing in decoder * Updated docs and version. * Fixed code style. * shuffle the training data * fix example of inflate function in TopKDecoer.py (IBM#98) * fix example of inflate function in TopKDecoer.py * Fix hidden_layer size for one-directional decoder (IBM#99) * Fix hidden_layer size for one-directional decoder Hidden layer size of the decoder was given `hidden_size * 2 if bidirectional else 1`, resulting in a dimensionality error for non-bidirectional decoders. Changed `1` to `hidden_size`. * Adapt load to allow CPU loading of GPU models (IBM#100) * Adapt load to allow CPU loading of GPU models Add storage parameter to torch.load to allow loading models on a CPU that are trained on the GPU, depending on availability of cuda. * Fix wrong parameter use on DecoderRNN (IBM#103) * Fix wrong parameter use on DecoderRNN * Upgrade to pytorch-0.3.0 (IBM#111) * Upgrade to pytorch-0.3.0 * Use pytorch 3.0 in travis env. * Make sure tensor contiguous when attention's not used. (IBM#112) * Implementing the predict_n method. Using the beam search outputs it returns several seqs for a given seq (IBM#116) * Adding a predictor method to return n predicted seqs for a src_seq input (intended to be used along to Beam Search using TopKDecoder) * Checkpoint after batches not epochs (IBM#119) * Pytorch 0.4 (IBM#134) * add contiguous call to tensor (IBM#127) when attention is turned off, pytorch (well, 0.4 at least) gets angry about calling view on a non-contiguous tensor * Fixed shape documentation (IBM#131) * Update to pytorch-0.4 * Remove pytorch manual install in travis. * Allow using pre-trained embedding (IBM#135) * updated docs

kylegao91 and others added 30 commits October 24, 2017 09:46

Updated TopKDecoder (IBM#86)

842d8aa

* Fixed topk decoder.

Use torchtext from pipy (IBM#87)

a32999e

* Use torchtext from pipe. * Fixed torch text sorting order.

attention is not required when only using teacher forcing in decoder (I…

3f201b8

…BM#90) * attention is not required when only using teacher forcing in decoder

Updated docs and version.

96af89d

Fixed code style.

1574e1c

bugfix (IBM#92)

feabc36

Fixed field arguments validation.

Removed initial_lr when resuming optimizer with scheduler. (IBM#95)

ed8b90c

fix example of inflate function in TopKDecoer.py (IBM#98)

bd3537e

* fix example of inflate function in TopKDecoer.py

Adapt load to allow CPU loading of GPU models (IBM#100)

97aca03

* Adapt load to allow CPU loading of GPU models Add storage parameter to torch.load to allow loading models on a CPU that are trained on the GPU, depending on availability of cuda.

Fix wrong parameter use on DecoderRNN (IBM#103)

a3232b0

* Fix wrong parameter use on DecoderRNN

Merge branch 'master' into develop

fc584e9

# Conflicts: # seq2seq/models/TopKDecoder.py # seq2seq/trainer/supervised_trainer.py

Merge branch 'master' into develop

f86e8c7

Upgrade to pytorch-0.3.0 (IBM#111)

fdbc4a7

* Upgrade to pytorch-0.3.0 * Use pytorch 3.0 in travis env.

Make sure tensor contiguous when attention's not used. (IBM#112)

75a8b76

Implementing the predict_n method. Using the beam search outputs it r…

38e7e21

…eturns several seqs for a given seq (IBM#116) * Adding a predictor method to return n predicted seqs for a src_seq input (intended to be used along to Beam Search using TopKDecoder)

Checkpoint after batches not epochs (IBM#119)

cbd8d8b

add contiguous call to tensor (IBM#127)

aef9b9f

when attention is turned off, pytorch (well, 0.4 at least) gets angry about calling view on a non-contiguous tensor

Fixed shape documentation (IBM#131)

4c661ca

Merge branch 'master' into develop

302c1e8

Pytorch 0.4 (IBM#134)

8995987

* add contiguous call to tensor (IBM#127) when attention is turned off, pytorch (well, 0.4 at least) gets angry about calling view on a non-contiguous tensor * Fixed shape documentation (IBM#131) * Update to pytorch-0.4 * Remove pytorch manual install in travis.

Merge remote-tracking branch 'origin/develop' into develop

519db31

Allow using pre-trained embedding (IBM#135)

0e1d875

updated docs

ee7d6bd

updated README

4799b7c

Merge branch 'master' into develop

1d6c6e2

updated README

17fe235

pskrunner14 added 29 commits September 2, 2018 06:48

fixes test loss concat list issue

cb5be01

fixes sample.py accuracy issue

04acfb4

fixes integration test python3 test fail bug

ac21962

update docs and README

eb21a48

update README

c2cbd45

update docs and README

7ce607c

modified topkdecoder test for more batch sizes

ca80052

changes topkdecoder test

a2d9a20

changes topkdecoder test

2322c5b

changes topkdecoder test

51dc6ea

changes topkdecoder test

20cff5d

changes topkdecoder test

7993fea

debugging topkdecoder test

2e82975

debugging topkdecoder test

a38254b

debugging topkdecoder test

993de89

debugging topkdecoder test

5b54ad1

debugging topkdecoder test

1df8903

beam search fails with batch>2

a148c49

removes deprecated warnings

7aa9330

fixes deprecated warnings

8229fbe

fixes typo

5f61757

fixes typo

539fbc1

fixes typo

59eb3f0

refactors modules and makes init changes for copy mech

c346d33

updated docs and refactored modules

1d6d251

branches copy and coverage api to copy

b733442

debugging copy decoder dims mismatch

104e2b9

backtracked to prev copy dec def due to dimensional errors

4fcad39

fix copy loss

bf05795

diegoantognini mentioned this pull request Nov 6, 2018

Copy attention based decoder #153

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Copy update#172

Copy update#172
pskrunner14 wants to merge 80 commits intoIBM:copyfrom
pskrunner14:copy

pskrunner14 commented Oct 12, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants

Conversation

pskrunner14 commented Oct 12, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants