Discrepancy Between Graphs #6

iPhone-Dev · 2019-05-02T17:45:11Z

Hey there,

I tried to regenerate the graphs depicted in the article, Tetris is showing significantly lower performance than what is expected based on these graphs.
Another issue is that Packer is absent from all generated graphs ( why? ).
DeepRM's (or PG's in this case) average job slowdown is asymptotically around 2 in the article graphs while in mine is around 4.
All trainings and tests were done using the default commands described in the README.md file.
Is anyone here who could reproduce the exact results described in the article? I'd be thankful if you could help me with this issue.

pg_re_lr_curve.pdf

Regards

hongzimao · 2019-05-26T13:00:55Z

Packer is implemented here https://github.com/hongzimao/deeprm/blob/master/other_agents.py#L4

Random seed might affect the result. You can try running multiple experiments with different seed to see the variance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discrepancy Between Graphs #6

Discrepancy Between Graphs #6

iPhone-Dev commented May 2, 2019 •

edited

Loading

hongzimao commented May 26, 2019

Discrepancy Between Graphs #6

Discrepancy Between Graphs #6

Comments

iPhone-Dev commented May 2, 2019 • edited Loading

hongzimao commented May 26, 2019

iPhone-Dev commented May 2, 2019 •

edited

Loading