Remove padding agents using interface entities #103

aaravpandya · 2024-05-05T22:32:26Z

This is a big PR since remove the padding agents impacts almost everything.

We introduce new archetypes AgentInterface and RoadInterface with the components that we want to export. To fill in the observation, we simply use new structs AgentInterfaceEntity and RoadInterfaceEntity that store a a reference to these interface entities. While collecting observations, we can simply get the corresponding arrays and fill the values while ignoring the padded values.

While we make as many entities from these new archetypes as specified by consts, they do not have physics components and are not registered with the BVH. Also, these entities are not iterated through in the task graph. The effect is that the taskgraph is only run for the actual number of agents in the Sim and we simply reference the "export components" by id and fill in the observations.

aaravpandya · 2024-09-05T18:41:06Z

I am getting incorrect values in this branch while executing on CUDA. The values are exactly correct in every index for CPU exec mode. I am unsure of how to debug this.

What this PR tries to do - Before this change, we simply had Agent and PhysicsEntity. We had to create padding agents and physicsentities to make up for irregular number of entities in the dataset per world. To remove the padding agents, I introduced AgentInterface and RoadInterface archetypes. Now we only create as many agents as are needed per world, but have consts::kMaxAgentCount number of AgentInterface entities. Each agent stores a reference to an AgentInterface using a AgentInterfaceEntity struct. We fill in the observations in the AgentInterface which are then exported, while keeping zeros/ones in the padded AgentInterface entities depending on semantics. This is designed in a similar way as gpu_hideseek but in the reverse manner. The Agent stores the reference to AgentInterface.
The relevant code for entity creation is in level_gen.cpp::343 and the Archetypes are defined in types.hpp::241.
After the Agents are created, we make sure we have consts::kMaxAgentCount number of AgentInterface here - level_gen.cpp::299

How to reproduce the error - Simply run headless on CPU and then CUDA using ./headless CPU 1 for 1 step. Print any tensor (I am printing Done tensor in here as it is very easy to infer as it is a scalar value). The values ouput by the CPU exec mode are correct. The values output by CUDA are not correct.

Also, ctest can be run by changing exec modes. The tests pass on CPU but fail on CUDA.

I am not sure how to debug this or where the error could even reside.

@eugenevinitsky

aaravpandya · 2024-09-09T19:10:18Z

@shacklettbp Hi Brennan, would you mind taking a look ? I am not sure how to proceed with this issue.

For reference, this is how the output looks like and I think the main issue is that CUDA exec mode is returning different values than CPU. The CPU values are correct.

(madrona) (base) aarav@emerge2-desktop:~/gpudrive/build$ ./headless CPU 1
Done
[ 0 0 1 1 1 0 0 0 1 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ]
[ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ]
[ 1 0 0 1 1 1 1 1 0 1 1 1 1 0 0 1 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ]
FPS 3377.298340
Agent-Normalized FPS 178996.812500
(madrona) (base) aarav@emerge2-desktop:~/gpudrive/build$ ./headless CUDA 1
Initialization finished
Done
[ 0 0 0 0 1 0 1 0 1 0 1 0 0 0 0 0 1 0 1 0 1 0 1 0 1 0 0 0 0 0 1 0 1 1 1 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ]
[ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ]
[ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ]
FPS 1700.869263
Agent-Normalized FPS 0.000000

shacklettbp · 2024-09-10T01:50:21Z

You need to sort the new interface entities when using the GPU backend. It doesn't look like you're doing that currently. I assume this could be done once during the initialization task graph.

eugenevinitsky · 2024-09-10T02:09:36Z

Well that was quick, thanks!

* Init remove padding agents * Testing * Refactor out the controlled state * Fix the tests * Use by reference * Temp fix for map export * cleanup * Fix merge issues * Fix merge issues * make consts 6000 to pass tests * Use init counts for BVH * Remove unused function * Pass all tests * Set controlledstate * Zero out things * Zero padded agents out * Add a test.py for easy debugging * Better check for map drawing. * Set agents to 128 * Cycle through files in test.py * Rename InterfaceEntity to AgentInterfaceEntity * Only print dones * Remove debug checks * Separete out the observation systems * Sort interfaces * remove debug script * Remove debug statements * Minor improvements * Info tensor interface matches main * Correctly export response type and trajectory

aaravpandya added 11 commits May 5, 2024 00:15

Init remove padding agents

6b0aff7

Merge branch 'main' into ap_removePaddingAgents

6e37a75

Testing

f70350b

Merge branch 'main' into ap_removePaddingAgents

8904310

Merge branch 'main' into ap_removePaddingAgents

6265e41

Refactor out the controlled state

63fa549

Merge branch 'main' into ap_removePaddingAgents

e287a63

Fix the tests

55e411f

Use by reference

293f953

Temp fix for map export

2b5384c

cleanup

754f04c

aaravpandya marked this pull request as ready for review May 11, 2024 17:30

aaravpandya requested review from eugenevinitsky, SamanKazemkhani and daphne-cornelisse May 11, 2024 17:30

aaravpandya added 15 commits May 14, 2024 17:59

Merge branch 'main' into ap_removePaddingAgents

da59cb1

Fix merge issues

adbb5f9

Merge branch 'main' into ap_removePaddingAgents

9328909

Fix merge issues

50ca28b

Merge branch 'main' into ap_removePaddingAgents

3c1afdd

Merge branch 'main' into ap_removePaddingAgents

835e8fe

Merge branch 'main' into ap_removePaddingAgents

79f3a41

make consts 6000 to pass tests

1e46bbb

Merge branch 'main' into ap_removePaddingAgents

0e5c70b

Use init counts for BVH

8d38136

Merge branch 'main' into ap_removePaddingAgents

7ab4983

Merge branch 'main' into ap_removePaddingAgents

5917867

Merge branch 'main' into ap_removePaddingAgents

2139693

Remove unused function

6421861

Merge branch 'main' into ap_removePaddingAgents

e88af6c

aaravpandya added 7 commits August 27, 2024 22:14

Better check for map drawing.

e104d3d

Set agents to 128

ab3f3f7

Cycle through files in test.py

4e3a427

Merge branch 'main' into ap_removePaddingAgents

a1e19bb

Merge branch 'main' into ap_removePaddingAgents

5e7e162

Rename InterfaceEntity to AgentInterfaceEntity

1eab6d5

Only print dones

e10e2b3

aaravpandya requested a review from shacklettbp September 9, 2024 19:17

aaravpandya added 6 commits September 13, 2024 21:21

Remove debug checks

67021b7

Separete out the observation systems

11cb74a

Merge branch 'main' into ap_removePaddingAgents

78bb617

Sort interfaces

60cc090

remove debug script

5979e50

Remove debug statements

b353f8d

aaravpandya requested review from eugenevinitsky and daphne-cornelisse September 16, 2024 18:16

aaravpandya added 4 commits September 17, 2024 14:04

Merge branch 'main' into ap_removePaddingAgents

e9f4c83

Minor improvements

476e691

Info tensor interface matches main

d21a7e4

Correctly export response type and trajectory

7685237

daphne-cornelisse approved these changes Sep 20, 2024

View reviewed changes

aaravpandya requested a review from SamanKazemkhani September 20, 2024 18:25

aaravpandya merged commit dc160be into main Sep 21, 2024
1 check passed

aaravpandya deleted the ap_removePaddingAgents branch October 4, 2024 03:21

aaravpandya linked an issue Oct 9, 2024 that may be closed by this pull request

Run exps and merge LIDAR + padding change #223

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove padding agents using interface entities #103

Remove padding agents using interface entities #103

aaravpandya commented May 5, 2024 •

edited

Loading

aaravpandya commented Sep 5, 2024

aaravpandya commented Sep 9, 2024

shacklettbp commented Sep 10, 2024

eugenevinitsky commented Sep 10, 2024

Remove padding agents using interface entities #103

Remove padding agents using interface entities #103

Conversation

aaravpandya commented May 5, 2024 • edited Loading

aaravpandya commented Sep 5, 2024

aaravpandya commented Sep 9, 2024

shacklettbp commented Sep 10, 2024

eugenevinitsky commented Sep 10, 2024

aaravpandya commented May 5, 2024 •

edited

Loading