Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to reproduce the scores in paper? #75

Open
ivy94419 opened this issue Jul 12, 2018 · 12 comments
Open

how to reproduce the scores in paper? #75

ivy94419 opened this issue Jul 12, 2018 · 12 comments

Comments

@ivy94419
Copy link

I have train a model and the parameters are set similar with yours:
20 epochs
~80,000 train set
~4000 val set
but I only got 19.3 bleu-4 in epoch 8, but the paper has achieved 24.3 bleu-4
image

When continue training, the scores decrease as follows:
Epoch bleu4
-------------
15 18.28
16 17.71
17 17.79
18 17.72
19 17.43
20 17.35

I am a beginner in deep learning, and how should I adjust parameters or other things to get higher scores?

@mukeshmithrakumar
Copy link

Are you using the same code from the notebook? I ask since most couldn't even get it to run

@ivy94419
Copy link
Author

@mukeshmithrakumar Yes I run the same code, and I used Python 3.6, Tensorflow 1.4, it can run on both Windows10 and ubuntu, I only change some trivial code for adaptation.

@mukeshmithrakumar
Copy link

Hi @ivy94419 I will test the code and will get back to you

@rijuldhir
Copy link

rijuldhir commented Aug 17, 2018

Hi @ivy94419 @mukeshmithrakumar I am able to train the model (python 2.7 tensorflow = 1 4)but could not evaluate them.
Any help regarding it would be helpful

@mukeshmithrakumar
Copy link

mukeshmithrakumar commented Aug 17, 2018

Hi @rijuldhir, I wanted this to be a benchmark and it turned out to be a lot of hustle to get it to predict something so moved on to a different model. Will let you know if I come back to this but I highly doubt anytime soon. You will have better luck with asking ivy

@ivy94419
Copy link
Author

@rijuldhir what errors have you faced during evaluating?

@rijuldhir
Copy link

rijuldhir commented Aug 17, 2018

When i run evaluate_model.ipynb
I get the following error
InvalidArgumentError (see above for traceback): Assign requires shapes of both tensors to match. lhs shape= [1500] rhs shape= [1024]
[[Node: save/Assign_7 = Assign[T=DT_FLOAT, _class=["loc:@initial_lstm/b_c"], use_locking=true, validate_shape=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](initial_lstm/b_c, save/RestoreV2_7/_29)]]
[[Node: save/RestoreV2_19/_14 = _SendT=DT_FLOAT, client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device_incarnation=1, tensor_name="edge_62_save/RestoreV2_19", _device="/job:localhost/replica:0/task:0/device:CPU:0"]]

@ivy94419 In case you know it shapes are not equal?

@ivy94419
Copy link
Author

@rijuldhir Sorry for late reply, try to change this to 1024
image

@rijuldhir
Copy link

Thanks for the reply @ivy94419 .
I actually got it while checking the code.
Did you got the BLEU scores as given in the paper as I got the max of BLEU-4 19.2 only?
Do I need to make any changes to the code?

@ivy94419
Copy link
Author

ivy94419 commented Aug 19, 2018

@rijuldhir I only achieved 19.3 BLEU-4 as mentioned above ...

@rijuldhir
Copy link

@ivy94419 I have been trying to change vgg network from vgg19 to vgg16 in the code but I am getting some errors.
Any chance you know what's the problem?

@gorgeousyouth
Copy link

do you get the same score in the paper?thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants