-
Built using latest master. Disabled PCM_USE_PERF and see this error on 2 different platforms so far (Broadwell and Haswell).
|
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 13 replies
-
does it work if you don't use PCM_USE_PERF variable? Could you share the output if PCM_USE_PERF is not set? |
Beta Was this translation helpful? Give feedback.
-
thank you for the explanation. haswell, broadwell, skylake do not support the same set of metrics. If I correctly remember Haswell does not support LMB and RMB memory bandwidth and Skylake does not support L2HIT by default. You can check by running the main pcm binary (not pcm-sensor-server) and see what metrics it prints on each platform. When you use PCM_NO_PERF=1 then the perf_event API is not used and PCM programs PMU directly such that when other clients try to use PMU through perf kernel API they don't know that PCM is using PMU already and corrupt the PMU counter programming (the errors you see). If everyone is using perf_event API then the access is coordinated and no conflicts occur.
I decoded the value 1114304 from your log: someone is programming PMU to collect INST_RETIRED.ANY_P event for core 22 in parallel when you are running PCM. Do you have an idea what could it be? |
Beta Was this translation helpful? Give feedback.
-
There's an entire development team behind the OS I am testing on. I will pursue an answer to the question about anything programming the PMU. If I cannot resolve sharing the PMU issue then it seems the perf_event API is the way to go. This is what I see from pcm on the haswell system with master build using the perf_event API:
|
Beta Was this translation helpful? Give feedback.
-
And I don't have access to a skylake system now so easier to debug on haswell. I can get one if we need to move forward on this issue. Thanks. |
Beta Was this translation helpful? Give feedback.
There's an entire development team behind the OS I am testing on. I will pursue an answer to the question about anything programming the PMU.
If I cannot resolve sharing the PMU issue then it seems the perf_event API is the way to go. This is what I see from pcm on the haswell system with master build using the perf_event API: