Skip to content

[Master] Bug: [Chassis] Orchagent crashes in SUPERVISOR #22478

@saksarav-nokia

Description

@saksarav-nokia

Is it platform specific

generic

Importance or Severity

Critical

Description of the bug

With the latest master with sonic-swss PR sonic-net/sonic-swss#3504, the orchagent continually crashes for all the swss dockers in Superviosr. The IMMs seem to be staying up.

Reverting this PR fixed the crash

Steps to Reproduce

Build the image with latest github master and load the image in SUP

Actual Behavior and Expected Behavior

The box should not crash

orchagent.1745939790.40.13.core.gz

Relevant log output

Attached the docker image and core file, also the back trace

[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
Core was generated by `/usr/bin/orchagent -d /var/log/swss -b 1024 -s -i nokia-bdb:7:1 -f swss.asic13.'.
Program terminated with signal SIGSEGV, Segmentation fault.
#0  0x00005633a1e79716 in __gnu_cxx::__normal_iterator<std::reference_wrapper<FlexCounterCachedManager>*, std::vector<std::reference_wrapper<FlexCounterCachedManager>, std::allocator<std::reference_wrapper<FlexCounterCachedManager> > > >::__normal_iterator (__i=<error reading variable: Cannot access memory at address 0xbb8>, this=<optimized out>) at /usr/include/c++/12/bits/stl_iterator.h:1073
1073	/usr/include/c++/12/bits/stl_iterator.h: No such file or directory.
[Current thread is 1 (Thread 0x7f7a7483e480 (LWP 40))]
(gdb) bt
#0  0x00005633a1e79716 in __gnu_cxx::__normal_iterator<std::reference_wrapper<FlexCounterCachedManager>*, std::vector<std::reference_wrapper<FlexCounterCachedManager>, std::allocator<std::reference_wrapper<FlexCounterCachedManager> > > >::__normal_iterator (__i=<error reading variable: Cannot access memory at address 0xbb8>, this=<optimized out>) at /usr/include/c++/12/bits/stl_iterator.h:1073
#1  std::vector<std::reference_wrapper<FlexCounterCachedManager>, std::allocator<std::reference_wrapper<FlexCounterCachedManager> > >::begin (this=0xbb8) at /usr/include/c++/12/bits/stl_vector.h:869
#2  PortsOrch::flushCounters (this=0x0) at ./orchagent/portsorch.cpp:8384
#3  0x00005633a206f5c3 in FlexCounterOrch::doTask (this=0x7f7a7444e900, consumer=...) at ./orchagent/flexcounterorch.cpp:274
#4  0x00005633a1dded53 in Consumer::execute (this=0x7f7a745fe600) at ./orchagent/orch.cpp:338
#5  0x00005633a1dcacb0 in OrchDaemon::start (this=this@entry=0x7f7a7443e510, heartBeatInterval=heartBeatInterval@entry=10000) at ./orchagent/orchdaemon.cpp:978
#6  0x00005633a1d30b10 in main (argc=<optimized out>, argv=<optimized out>) at ./orchagent/main.cpp:860
(gdb) frame 2
#2  PortsOrch::flushCounters (this=0x0) at ./orchagent/portsorch.cpp:8384
8384	./orchagent/portsorch.cpp: No such file or directory.
(gdb)

Output of show version, show techsupport

Attach files (if any)

No response

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions