-
Notifications
You must be signed in to change notification settings - Fork 554
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
portsorch: don't call updateDbPortOperStatus on all port types #3505
portsorch: don't call updateDbPortOperStatus on all port types #3505
Conversation
/azp run |
Azure Pipelines successfully started running 1 pipeline(s). |
PORT_TABLE contains PortChannel oper_status entries which are not expected by portsorch which leads to warm/fastreboot failures like: ``` 2025 Feb 10 09:33:07.111055 sonic NOTICE swss#orchagent: :- bake: foundPortConfigDone = 1 2025 Feb 10 09:33:07.111080 sonic NOTICE swss#orchagent: :- bake: foundPortInitDone = 1 2025 Feb 10 09:33:07.111395 sonic NOTICE swss#orchagent: :- bake: m_portTable->getKeys 263 2025 Feb 10 09:33:07.111403 sonic NOTICE swss#orchagent: :- bake: portCount = 257, m_portCount = 0 2025 Feb 10 09:33:07.111403 sonic ERR swss#orchagent: :- bake: Invalid port table: portCount, expecting 257, got 261 ``` Fixes sonic-net/sonic-buildimage#21688 Signed-off-by: Brad House (@bradh352)
5172cec
to
6480e04
Compare
/azp run |
Azure Pipelines successfully started running 1 pipeline(s). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me, can you cover this with a VS test?
I'm not at all familiar with the vs testing framework. It would be a lot easier in test_portchannel.py to check to make sure there are no PortChannel entries in the port table. Would that be acceptable since that would have caught this regression? |
Sure |
/azp run |
Azure Pipelines successfully started running 1 pipeline(s). |
Test added, I haven't tested locally but CI here should validate I didn't make a typo or something. |
Looks like all tests passed. Can you approve? |
Co-authored-by: Stepan Blyshchak <[email protected]>
/azp run |
Azure Pipelines successfully started running 1 pipeline(s). |
@stepanblyschak maybe now approve it? :) |
@prsunny Can you help review? |
@prgeor, @vaibhavhd for viz |
Has someone verified that the warm-reboot issue does not exist with the TUNNEL oper status still present in the DB? The failure was seen due to portchannel in the DB and that I believe is fixed. But, we may still have a problem with TUNNEL status in DB. That test_wr_arp validates that part by running warm-reboot with VXLAN tunnel programmed. Please check. |
The tunnel port type gets routed to a different table so there's no way it could cause the same issue: sonic-swss/orchagent/portsorch.cpp Lines 3427 to 3442 in d3d95ab
|
What I did
Only call
updateDbPortOperStatus()
on PHY and TUNNEL ports.Why I did it
PORT_TABLE contains PortChannel oper_status entries which are not expected by portsorch which leads to warm/fastreboot failures like:
Regression caused by #3383
How I verified it
Asked @stepanblyschak to verify who reported issue.
Details if related
Fixes sonic-net/sonic-buildimage#21688
Signed-off-by: Brad House (@bradh352)