Address spurious LB+RB log flood on APC BXnnnnMI devices by jimklimov · Pull Request #2565 · networkupstools/nut

jimklimov · 2024-07-29T09:40:12Z

Closes: #2347
Also note for #2533 question

It also adds some visibility around calibration status setting, extends "dstate" API with a status_get() method, and this helps avoid setting duplicate states (roughly like "OB LB OB") seen in some drivers earlier.

I hope this toggle allows to fix the problem in the field by optionally delaying spurious status propagation from the driver by lbrb_log_delay_sec at most, and if the device is otherwise "online" and is calibrating (unless lbrb_log_delay_without_calibrating flag was also set).

The fix goes to some lengths to try detecting the device model during init to default the setting to 3 sec for this line-up, otherwise defaults to 0 (immediate status propagation).

@desertwitch @grifferz @ShiroDN @PilaScat @bitmario @marcgarciamarti @KillianMelsen @gerben838665 @mauro-dasilva @tsopokis @statte @s7uben @Sanderluc5 @ivanjx @gabrieleancora @JoshNansoz1 @rioachim @owenperkins111 : Better late than never: would you be able to try a custom build of NUT following https://github.com/networkupstools/nut/wiki/Building-NUT-for-in%E2%80%90place-upgrades-or-non%E2%80%90disruptive-tests to see if it handles the devices better?

For the git checkout, use this PR's source branch:

:; git clone https://github.com/jimklimov/nut -b issue-2347 nut
:; cd nut
...

If you run the built driver with debug verbosity of 2 or greater, it should log that it saw these calibration-like, LB and RB states, and chose to suppress them for a while according to settings. Checking that the numbers from CLI/ups.conf settings are propagated and considered correctly would also be helpful :)

Maybe these messages should be sunk to a less visible debug verbosity, eventually.

Also of interest is if the impacted devices report frequent calibration messages by default (without debug) and if that should be addressed additionally or if onlinedischarge_calibration and/or onlinedischarge_log_throttle_sec and related existing settings address it and the logs can be made peaceful and quiet already.

That "\n" gets printed as "networkupstools#12" Signed-off-by: Jim Klimov <[email protected]>

…us_get() Signed-off-by: Jim Klimov <[email protected]>

…etworkupstools#2347] Signed-off-by: Jim Klimov <[email protected]>

…ay_sec et al [networkupstools#2347] Signed-off-by: Jim Klimov <[email protected]>

jimklimov · 2024-07-30T07:23:22Z

Converting to draft while this is being tested, so NUT CI won't rebuild it in vain against newer target branch as it evolves.

jimklimov · 2024-08-05T08:17:04Z

Gentle bump. So many people complained about the issue, is anyone still interested in testing a prospective fix? :)

ivanjx · 2024-08-05T08:20:43Z

im new to this so please bear with me

so to test just need to clone this branch, compile, and sudo make install on the drivers folder?

the way i currently install nut is installing via apt first then overwrite it with manual compile and sudo make install

desertwitch · 2024-08-05T08:21:37Z

Gentle bump. So many people complained about the issue, is anyone still interested in testing a prospective fix? :)

Hi Jim, thanks a lot for the effort put into making this work for everyone.
Unfortunately not able to test due to lack of affected hardware, but will do a code review / sanity-check tonight.

jimklimov · 2024-08-05T08:42:26Z

im new to this so please bear with me

so to test just need to clone this branch, compile, and sudo make install on the drivers folder?

the way i currently install nut is installing via apt first then overwrite it with manual compile and sudo make install

Generally, yes. A finer approach is presented at https://github.com/networkupstools/nut/wiki/Building-NUT-for-in%E2%80%90place-upgrades-or-non%E2%80%90disruptive-tests which refers to the list of dependencies per platform, configure the new build similarly to what your packages (or older custom builds) delivered, and describes how to test a new driver from the build workspace before installing it over your older build for "production" use (or not, if the test is unsuccessful). Surely it is not the only way to skin a cat, but one best streamlined to exploratory custom builds.

desertwitch · 2024-08-06T12:37:06Z

Code looking good as usual, Jim, one thing we might want to consider is if we should default to lbrb_log_delay_without_calibrating = 1 for the affected APC series as well. I'm thinking this, as these spurious and seemingly random events might not always be preceded by an assumed or actual calibration. 3 seconds delay to registering an actual LB status will probably not make a difference in real life, as opposed to the very real annoyance of false statuses being reported and users then having to go search for the lbrb_log_delay_without_calibrating toggle in the manuals. Thanks for your efforts!

jimklimov · 2024-08-06T19:25:23Z

Well, given lbrb_log_delay_without_calibrating is a flag, users of those models would not be able to disable it. Can at least suggest it in autodetection message, though.

Signed-off-by: Jim Klimov <[email protected]>

…etworkupstools#2437] Signed-off-by: Jim Klimov <[email protected]>

… tweaks since 2023)" [networkupstools#2347] Signed-off-by: Jim Klimov <[email protected]>

jimklimov · 2024-08-12T06:10:58Z

The CI faults are due to a change with an agent after an upgrade (lacked 32-bit libs for some dependencies now).

jimklimov · 2024-08-12T07:13:04Z

Tested the monster message printer, works well but relies on math a bit (that the testvar findings are exactly 0 or 1) so will poke that a bit later.

Not sure why CI builds that code path where it failed due to missing libs, by config it should not have.

…actly 0/1 [networkupstools#2347] When building a complex text expression, we rely on maths in some spots. Signed-off-by: Jim Klimov <[email protected]>

…etection Signed-off-by: Jim Klimov <[email protected]>

Signed-off-by: Jim Klimov <[email protected]>

…ctually build a graphical program Namely, that further third-party libs are available for the chosen architecture, not only the headers. Had a problem with 32/64-bit build agent that only had a binary lib*.so set for 64-bit after an update. Signed-off-by: Jim Klimov <[email protected]>

desertwitch · 2024-08-12T09:19:40Z

Tested the monster message printer, works well but relies on math a bit (that the testvar findings are exactly 0 or 1) so will poke that a bit later.

Not sure why CI builds that code path where it failed due to missing libs, by config it should not have.

Thanks for the additions, is there a process for testing drivers and such hardware-specific code without actually having the affected hardware (which I assume you don't either)? Would love to help out with testing such code, but not sure how to go about that with the drivers (dummy-ups wouldn't work for a specific driver, right?). That message printer, as an example.

jimklimov · 2024-08-12T10:27:12Z

In this case, I copy-pasted the block as a C program, replacing `upsdebugx` with `printf` and fiddling with the `got_*` flags that already were `int`s cached to avoid many `testvar()` calls :) So not much of an established process yet, although there are some precedents in `tests/*.c{,pp}` files about poking into e.g. `driver/main.c` code for a semblance of unit tests...

…

On Mon, Aug 12, 2024, 11:20 Rysz ***@***.***> wrote: Tested the monster message printer, works well but relies on math a bit (that the testvar findings are exactly 0 or 1) so will poke that a bit later. Not sure why CI builds that code path where it failed due to missing libs, by config it should not have. Thanks for the additions, is there a process for testing drivers and such hardware-specific code without actually having the affected hardware (which I assume you don't either)? Would love to help out with testing such code, but not sure how to go about that with the drivers (dummy-ups wouldn't work for a specific driver, right?). — Reply to this email directly, view it on GitHub <#2565 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAMPTFDW46BKV6MPTBETODLZRB44HAVCNFSM6AAAAABLT4WXO6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEOBTGQ3TQNBYGE> . You are receiving this because you authored the thread.Message ID: ***@***.***>

…tworkupstools#2565, networkupstools#2708] Signed-off-by: Jim Klimov <[email protected]>

…rings of another [networkupstools#2708, networkupstools#2565] Signed-off-by: Jim Klimov <[email protected]>

invario · 2025-05-06T12:52:00Z

Hi, just wanted to note that the LB/RB problems occurs for my new APC BVK750M2 also. Going to try this fix and see how it goes. Thanks for your work.

jimklimov added 4 commits July 29, 2024 11:27

drivers/usbhid-ups.c: avoid "\n" starting an upsdebugx() message

5a52815

That "\n" gets printed as "networkupstools#12" Signed-off-by: Jim Klimov <[email protected]>

drivers/dstate.{c,h}, docs/new-drivers.txt, NEWS.adoc: introduce stat…

67a3bc1

…us_get() Signed-off-by: Jim Klimov <[email protected]>

drivers/usbhid-ups.c: track start and end timestamps of calibration [n…

1f5b686

…etworkupstools#2347] Signed-off-by: Jim Klimov <[email protected]>

drivers/usbhid-ups.c, docs/man/usbhid-ups.txt: introduce lbrb_log_del…

d15ab31

…ay_sec et al [networkupstools#2347] Signed-off-by: Jim Klimov <[email protected]>

jimklimov added this to the 2.8.3 milestone Jul 29, 2024

jimklimov requested review from aquette and clepple July 29, 2024 09:40

jimklimov mentioned this pull request Jul 29, 2024

APC Back-UPS BX1600MI spurious LOWBATT/REPLACEBATT events #2347

Closed

Merge branch 'master' into issue-2347

82abc3d

jimklimov marked this pull request as draft July 30, 2024 07:23

jimklimov added 2 commits August 1, 2024 08:49

Merge branch 'master' into issue-2347

5f7630c

Merge branch 'master' into issue-2347

fd39200

jimklimov marked this pull request as ready for review August 5, 2024 08:17

Merge branch 'master' into issue-2347

6401b70

jimklimov marked this pull request as draft August 5, 2024 17:34

desertwitch approved these changes Aug 6, 2024

View reviewed changes

Merge remote-tracking branch 'upstream/master' into issue-2347

a27d0ab

Signed-off-by: Jim Klimov <[email protected]>

drivers/usbhid-ups.c: clarify suggested settings for LB+RB log flood [n…

2986cc2

…etworkupstools#2437] Signed-off-by: Jim Klimov <[email protected]>

jimklimov force-pushed the issue-2347 branch from 117eaa9 to 2986cc2 Compare August 11, 2024 19:30

data/driver.list.in: update about "Back-UPS BX****MI Series (may need…

61574c2

… tweaks since 2023)" [networkupstools#2347] Signed-off-by: Jim Klimov <[email protected]>

jimklimov added 4 commits August 12, 2024 07:44

drivers/usbhid-ups.c: upsdrv_initups(): cache testvar() outcome as ex…

120cd2a

…actly 0/1 [networkupstools#2347] When building a complex text expression, we rely on maths in some spots. Signed-off-by: Jim Klimov <[email protected]>

ci_build.sh: fix shell syntax that could confuse CANBUILD_LIBGD_CGI d…

1476ad7

…etection Signed-off-by: Jim Klimov <[email protected]>

ci_build.sh: actually honour CANBUILD_LIBGD_CGI=no

015e238

Signed-off-by: Jim Klimov <[email protected]>

jimklimov merged commit 443ba6a into networkupstools:master Aug 12, 2024

jimklimov deleted the issue-2347 branch August 12, 2024 21:21

jimklimov mentioned this pull request Oct 7, 2024

APC Back-UPS BX1200MI connection problems nut_libusb_get_report: Input/Output Error #2653

Closed

This was referenced Nov 6, 2024

NUT 2.8.1-3 "Can't claim USB device [051d:0002]@0/0: Entity not found" using usbhid-ups #2666

Closed

nut_libusb_get_interrupt: Connection timed out on debug #2644

Open

jimklimov mentioned this pull request Dec 30, 2024

[HCL] APC BX950MI supported by usbhid-ups #2741

Closed

jimklimov mentioned this pull request Feb 10, 2025

Clean-up or legalise: revise status_set() values vs. the NUT standard dictionary #2708

Open

jimklimov added a commit to jimklimov/nut that referenced this pull request Feb 10, 2025

drivers/dstate.c: fix status_get(), got inverted maths in a check [ne…

33948aa

…tworkupstools#2565, networkupstools#2708] Signed-off-by: Jim Klimov <[email protected]>

jimklimov added a commit to jimklimov/nut that referenced this pull request Feb 11, 2025

drivers/dstate.c: fix status_get(), got inverted maths in a check [ne…

d43377b

…tworkupstools#2565, networkupstools#2708] Signed-off-by: Jim Klimov <[email protected]>

jimklimov mentioned this pull request Feb 11, 2025

Fix handling of spaces in driver status_set() args #2801

Merged

jimklimov added a commit to jimklimov/nut that referenced this pull request Feb 11, 2025

drivers/dstate.c: status_get(): properly handle tokens that are subst…

0df4947

…rings of another [networkupstools#2708, networkupstools#2565] Signed-off-by: Jim Klimov <[email protected]>

jimklimov added a commit to jimklimov/nut that referenced this pull request Feb 11, 2025

drivers/dstate.c: status_get(): properly handle tokens that are subst…

332cd9e

…rings of another [networkupstools#2708, networkupstools#2565] Signed-off-by: Jim Klimov <[email protected]>

jimklimov restored the issue-2347 branch April 15, 2025 09:55

jimklimov deleted the issue-2347 branch April 15, 2025 10:41

This was referenced May 7, 2025

APC model BVK750M2 false indications of LB/RB #2942

Closed

Add APC BVKnnnnM2 to list of devices affected by spurious LB/RB messages #2944

Merged

jimklimov mentioned this pull request Jul 3, 2025

drivers/pijuice.c, NEWS.adoc: revise use of status_set() [#2708] #3005

Merged

andyegg mentioned this pull request Jul 4, 2025

False LB+RB indication on APC BK650M2-CH #3006

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Address spurious LB+RB log flood on APC BXnnnnMI devices#2565

Address spurious LB+RB log flood on APC BXnnnnMI devices#2565
jimklimov merged 18 commits intonetworkupstools:masterfrom
jimklimov:issue-2347

jimklimov commented Jul 29, 2024 •

edited

Loading

Uh oh!

jimklimov commented Jul 30, 2024

Uh oh!

jimklimov commented Aug 5, 2024

Uh oh!

ivanjx commented Aug 5, 2024 •

edited

Loading

Uh oh!

desertwitch commented Aug 5, 2024

Uh oh!

jimklimov commented Aug 5, 2024

Uh oh!

desertwitch commented Aug 6, 2024

Uh oh!

jimklimov commented Aug 6, 2024

Uh oh!

jimklimov commented Aug 12, 2024

Uh oh!

jimklimov commented Aug 12, 2024

Uh oh!

desertwitch commented Aug 12, 2024 •

edited

Loading

Uh oh!

jimklimov commented Aug 12, 2024 via email •

edited

Loading

Uh oh!

invario commented May 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Conversation

jimklimov commented Jul 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jimklimov commented Jul 30, 2024

Uh oh!

jimklimov commented Aug 5, 2024

Uh oh!

ivanjx commented Aug 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

desertwitch commented Aug 5, 2024

Uh oh!

jimklimov commented Aug 5, 2024

Uh oh!

desertwitch commented Aug 6, 2024

Uh oh!

jimklimov commented Aug 6, 2024

Uh oh!

jimklimov commented Aug 12, 2024

Uh oh!

jimklimov commented Aug 12, 2024

Uh oh!

desertwitch commented Aug 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jimklimov commented Aug 12, 2024 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

invario commented May 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

jimklimov commented Jul 29, 2024 •

edited

Loading

ivanjx commented Aug 5, 2024 •

edited

Loading

desertwitch commented Aug 12, 2024 •

edited

Loading

jimklimov commented Aug 12, 2024 via email •

edited

Loading