Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Discrepancy Between Graphs #6

Open
iPhone-Dev opened this issue May 2, 2019 · 1 comment
Open

Discrepancy Between Graphs #6

iPhone-Dev opened this issue May 2, 2019 · 1 comment

Comments

@iPhone-Dev
Copy link

iPhone-Dev commented May 2, 2019

Hey there,

I tried to regenerate the graphs depicted in the article, Tetris is showing significantly lower performance than what is expected based on these graphs.
Another issue is that Packer is absent from all generated graphs ( why? ).
DeepRM's (or PG's in this case) average job slowdown is asymptotically around 2 in the article graphs while in mine is around 4.
All trainings and tests were done using the default commands described in the README.md file.
Is anyone here who could reproduce the exact results described in the article? I'd be thankful if you could help me with this issue.

pg_re_lr_curve.pdf

Regards

@hongzimao
Copy link
Owner

Packer is implemented here https://github.com/hongzimao/deeprm/blob/master/other_agents.py#L4

Random seed might affect the result. You can try running multiple experiments with different seed to see the variance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants