New dataset and metadata integration #292

daphne-cornelisse · 2024-11-08T19:14:06Z

Description

This PR integrates a new version of the dataset with GPUDrive.

Todo's

@aaravpandya Verify with Madrona viewer
@aaravpandya Review PR
@daphnecor Test run on 10 scenarios + videos to verify correctness
@nadarenator Fix metadata design
@daphnecor Update tutotrials

nadarenator

Looks good to me

aaravpandya · 2024-11-09T06:36:13Z

This PR has breaking changes with the datasets. Please change the dataset before merging in this PR.

Also, can you change the file in tests/pytest_data with the new format? Tests wont work with the old file format.

daphne-cornelisse · 2024-11-09T14:30:01Z

This PR has breaking changes with the datasets. Please change the dataset before merging in this PR.

Also, can you change the file in tests/pytest_data with the new format? Tests wont work with the old file format.

Yes, we will do that this week. Since this PR cannot be merged until we fix the data, I have taken your code and applied it to our integrations/vbd branch temporarily.

daphne-cornelisse · 2024-11-13T13:32:56Z

@nadarenator Let's see if we can update the dataset links in the readme today so we can merge this PR

nadarenator · 2024-11-13T13:37:11Z

I think we should hold off on regenerating the dataset until we finish vbd alignment, unless you're confident that the current processing script extracts all required info

eugenevinitsky · 2024-11-13T18:43:59Z

Is regenerating that expensive? We can just regenerate a small number of files for testing

nadarenator · 2024-11-13T18:48:09Z

Yes that's what I meant, using a small set for our purposes. But before merging this pr we'd need updated links, which would involve regenerating the whole dataset. And if we find out during vbd integration that we need yet another piece of info from the raw tfrecords we'd have to regenerate the full dataset again.

nadarenator · 2024-12-05T22:18:15Z

Only thing remaining: Regenerate dataset and upload to hf, add link to readme.

daphne-cornelisse · 2024-12-06T14:43:56Z

Only thing remaining: Regenerate dataset and upload to hf, add link to readme.

Can you send me a link to the new data once it's up? I'd like to do a test PPO run

examples/tutorials/01_scenario_loading.ipynb

pygpudrive/datatypes/metadata.py

pygpudrive/env/config.py

nadarenator · 2024-12-06T14:52:59Z

Only thing remaining: Regenerate dataset and upload to hf, add link to readme.

Can you send me a link to the new data once it's up? I'd like to do a test PPO run

It should be available now on my public directory on greene (the same one I shared before).

nadarenator · 2024-12-13T01:21:40Z

Would be great if we could redo some of the tutorial notebooks, most of the indexing is outdated, and there's also datatypes now that handle the indexing internally

src/headless.cpp

aaravpandya

Okay so 3 issues to fix:

Revert headless paths
Remove metadata.id unless this is absolutely required.
Use a hashmap in json serialization to make the code cleaner

src/headless.cpp

src/types.hpp

src/json_serialization.hpp

…b/gpudrive into feat/align_data_struct

aaravpandya · 2024-12-19T02:38:33Z

I am approving this because I am unable to fix the tests. However, I have done a smell test, visualized the scenarios and reverified the code. I am confident that everything should be working fine. But considering that this feature is blocking work on vbd integration, I think its fine to merge it in.

I will be working on getting the tests to work in a separate PR.

Also, @daphnecor since many people are using GPUDrive, I would recommend bumping the version number of our current release, and putting a "warning/info" card on the readme to notify people that this new version has breaking changes with the previous dataset and that they should download a new version.

daphne-cornelisse · 2024-12-19T14:59:31Z

I am approving this because I am unable to fix the tests. However, I have done a smell test, visualized the scenarios and reverified the code. I am confident that everything should be working fine. But considering that this feature is blocking work on vbd integration, I think its fine to merge it in.

I will be working on getting the tests to work in a separate PR.

Also, @daphnecor since many people are using GPUDrive, I would recommend bumping the version number of our current release, and putting a "warning/info" card on the readme to notify people that this new version has breaking changes with the previous dataset and that they should download a new version.

Great suggestion. I will add the release and description

daphne-cornelisse requested review from aaravpandya and nadarenator November 8, 2024 19:14

nadarenator approved these changes Nov 8, 2024

View reviewed changes

aaravpandya requested a review from eugenevinitsky November 9, 2024 06:59

nadarenator force-pushed the feat/align_data_struct branch from 79f8128 to 46f9df4 Compare December 4, 2024 22:55

daphne-cornelisse commented Dec 6, 2024

View reviewed changes

examples/tutorials/01_scenario_loading.ipynb Outdated Show resolved Hide resolved

pygpudrive/datatypes/metadata.py Outdated Show resolved Hide resolved

pygpudrive/datatypes/metadata.py Show resolved Hide resolved

pygpudrive/env/config.py Outdated Show resolved Hide resolved

nadarenator and others added 15 commits December 6, 2024 16:40

initial changes

e635d41

using vehicle_size struct and comments

b81617b

datatype indexing changes

283caa8

removed comment

e0e9873

minor cleanup

0a09dec

new dataset yay

070e763

fixed downloads and added links

2b4ffa9

extract script for large dataset

a8e4bd2

added hf to env

098738c

update dataset size

12f9d9f

added hf to env

6b45a14

Fix typo

c790596

minor docstring update

f829adc

Add agent ids and remove degrees conversion

0cfa8fe

Wrap yaws to be in [-pi, pi] + linting

e211d17

merging obj_heights to data_struct

95e76d7

daphne-cornelisse changed the title ~~Add agent ids and remove degrees conversion~~ New dataset and metadata integration Dec 11, 2024

daphne-cornelisse added 2 commits December 11, 2024 15:43

Resample and render fixes

26ac5ab

Small rendering fix: agent colors and classification.

8766bca

Emerge-Lab deleted a comment from aaravpandya Dec 12, 2024

nadarenator added 3 commits December 12, 2024 20:10

metadata v2

ddaebab

minor fix to init_all

cbaed40

new example scenes

df8d6c4

daphne-cornelisse commented Dec 13, 2024

View reviewed changes

src/headless.cpp Outdated Show resolved Hide resolved

daphne-cornelisse added 6 commits December 13, 2024 09:26

Update tutorial 1

3584aae

Update tutorial 2

a01c918

Silence metadata warning

f65f10c

Update tutorial 3

6840098

Update tutorials 4 + 5

3bde5fc

Mini update: training configs

26c27a5

aaravpandya requested changes Dec 13, 2024

View reviewed changes

nadarenator added 2 commits December 13, 2024 12:44

hardcoded file path

4cc69cf

readability, warnings, remove metadata.id

a79b1e2

nadarenator requested a review from aaravpandya December 13, 2024 18:36

daphne-cornelisse and others added 6 commits December 13, 2024 14:55

Collapse data downloading details

4fc5f02

Merge branch 'feat/align_data_struct' of https://github.com/Emerge-La…

b6532b8

…b/gpudrive into feat/align_data_struct

Small fixes for resampling (sorry, last change)

a222b35

Update example scenarios

42c6565

Merge branch 'main' into feat/align_data_struct

01b2e2b

tests changes

54873dc

aaravpandya approved these changes Dec 19, 2024

View reviewed changes

daphne-cornelisse merged commit ecb236c into main Dec 19, 2024
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New dataset and metadata integration #292

New dataset and metadata integration #292

daphne-cornelisse commented Nov 8, 2024 •

edited

Loading

nadarenator left a comment

aaravpandya commented Nov 9, 2024

daphne-cornelisse commented Nov 9, 2024

daphne-cornelisse commented Nov 13, 2024

nadarenator commented Nov 13, 2024

eugenevinitsky commented Nov 13, 2024

nadarenator commented Nov 13, 2024 •

edited

Loading

nadarenator commented Dec 5, 2024

daphne-cornelisse commented Dec 6, 2024 •

edited

Loading

nadarenator commented Dec 6, 2024

nadarenator commented Dec 13, 2024

aaravpandya left a comment

aaravpandya commented Dec 19, 2024

daphne-cornelisse commented Dec 19, 2024

New dataset and metadata integration #292

New dataset and metadata integration #292

Conversation

daphne-cornelisse commented Nov 8, 2024 • edited Loading

Description

Todo's

nadarenator left a comment

Choose a reason for hiding this comment

aaravpandya commented Nov 9, 2024

daphne-cornelisse commented Nov 9, 2024

daphne-cornelisse commented Nov 13, 2024

nadarenator commented Nov 13, 2024

eugenevinitsky commented Nov 13, 2024

nadarenator commented Nov 13, 2024 • edited Loading

nadarenator commented Dec 5, 2024

daphne-cornelisse commented Dec 6, 2024 • edited Loading

nadarenator commented Dec 6, 2024

nadarenator commented Dec 13, 2024

aaravpandya left a comment

Choose a reason for hiding this comment

aaravpandya commented Dec 19, 2024

daphne-cornelisse commented Dec 19, 2024

daphne-cornelisse commented Nov 8, 2024 •

edited

Loading

nadarenator commented Nov 13, 2024 •

edited

Loading

daphne-cornelisse commented Dec 6, 2024 •

edited

Loading