otel: fixes and perf adjustments by alltilla · Pull Request #4827 · syslog-ng/syslog-ng

alltilla · 2024-02-12T09:37:23Z

No description provided.

Signed-off-by: Attila Szakacs <[email protected]>

to control how often we call batch cbs. Signed-off-by: Attila Szakacs <[email protected]>

Signed-off-by: Attila Szakacs <[email protected]>

MrAnno · 2024-02-12T09:41:09Z

modules/grpc/otel/otel-source-services.hpp

-                  response_status = ::grpc::Status(::grpc::StatusCode::UNAVAILABLE, "Server is unavailable");
-                  break;
-                }
+              worker.post(msg);


What exactly will happen to the clients if we block and don't respond within a reasonable time?
Previously, we answered UNAVAILABLE, which seemed more native to the gRPC/HTTP environment.

Also, what was the trigger of this change?

If a request timeout is not set on the client side, they will wait indefinitely.
If timeout is set, the client will get a DEADLINE_EXCEEDED result, which should be handled as a temporary error (we do so in the otel dest).

https://grpc.io/docs/guides/deadlines/#deadlines-on-the-client

We have seen that syslog-ng "gives up" too quickly if it cannot forward the message. If we have multiple otel source workers, and they receive requests at the same exact time, the destination, which grabs those messages might not be able to grab them quickly enough. So even if we used a devnull destination, which can process 2 million messages per sec, sometimes it can put backpressure on the otel source, if there is a high transient load coming from it. We can raise log-iw-size() to buffer more messages and flatten these transients, but it raises the memory footprint considerably.

We could have a timeout for this, like trying blocking post, but surrendering after x seconds and returning unavailable. But I believe the current implementation is completely functional, too, at least on the server side. It would be nice to have options for request timeout on the client side (otel dest).

Sounds reasonable. My only concern is that the original otel-source implementation was not necessarily a "thread-per-client" implementation. By blocking, I think we make it so.

True.

However, concurrent-requests() makes us able to scale to more clients than workers(), which should be sufficient. With it the workers() option detaches from limiting the number of clients (gRPC will handle them for us if there are enough ServiceCalls registered), and just configures the processing parallelization.

github-actions · 2024-02-12T09:46:30Z

This Pull Request introduces config grammar changes

syslog-ng/be526b484d8913b98edbe043e0d5a783d6d0e7d7 -> alltilla/otel-perf-adjustment

Details

--- a/destination
+++ b/destination

 bigquery(
+    channel-args(
+        <empty>
+        <string> => <number>
+        <string> => <string>
+    )
 )

 loki(
+    channel-args(
+        <empty>
+        <string> => <number>
+        <string> => <string>
+    )
 )

 opentelemetry(
+    channel-args(
+        <empty>
+        <string> => <number>
+        <string> => <string>
+    )
 )

 syslog-ng-otlp(
+    channel-args(
+        <empty>
+        <string> => <number>
+        <string> => <string>
+    )
 )

--- a/source
+++ b/source

 opentelemetry(
+    channel-args(
+        <empty>
+        <string> => <number>
+        <string> => <string>
+    )
+    concurrent-requests(<positive-integer>)
+    log-fetch-limit(<nonnegative-integer>)
 )

 syslog-ng-otlp(
+    channel-args(
+        <empty>
+        <string> => <number>
+        <string> => <string>
+    )
+    concurrent-requests(<positive-integer>)
+    log-fetch-limit(<nonnegative-integer>)
 )

Signed-off-by: Attila Szakacs <[email protected]>

alltilla · 2024-02-12T10:37:06Z

Pushed style fixes.

Signed-off-by: Attila Szakacs <[email protected]>

alltilla added 7 commits February 12, 2024 10:37

otel: fix ctor initializer list order

257fede

Signed-off-by: Attila Szakacs <[email protected]>

otel: use blocking post in source

c7aa02b

Signed-off-by: Attila Szakacs <[email protected]>

otel: allocate one completion queue for each worker

8a2f387

Signed-off-by: Attila Szakacs <[email protected]>

otel: check for under_termination in source

d8bd934

Signed-off-by: Attila Szakacs <[email protected]>

otel: split log-iw-size() between workers

8d07ed3

Signed-off-by: Attila Szakacs <[email protected]>

otel: add log-fetch-limit() option

6851cd1

to control how often we call batch cbs. Signed-off-by: Attila Szakacs <[email protected]>

otel: add concurrent-requests() to source

3b6742a

Signed-off-by: Attila Szakacs <[email protected]>

alltilla added a commit to alltilla/syslog-ng that referenced this pull request Feb 12, 2024

news: add entries for syslog-ng#4827

c56a0a2

Signed-off-by: Attila Szakacs <[email protected]>

alltilla force-pushed the otel-perf-adjustment branch from 3c45564 to c56a0a2 Compare February 12, 2024 09:38

MrAnno reviewed Feb 12, 2024

View reviewed changes

alltilla added 4 commits February 12, 2024 11:36

otel: add channel-args() option

95efc6a

Signed-off-by: Attila Szakacs <[email protected]>

bigquery: add channel-args()

5670120

Signed-off-by: Attila Szakacs <[email protected]>

loki: add channel-args()

1a15bd4

Signed-off-by: Attila Szakacs <[email protected]>

news: add entries for syslog-ng#4827

bdb6990

Signed-off-by: Attila Szakacs <[email protected]>

alltilla force-pushed the otel-perf-adjustment branch from c56a0a2 to bdb6990 Compare February 12, 2024 10:37

MrAnno approved these changes Feb 12, 2024

View reviewed changes

MrAnno merged commit 79ed5c9 into syslog-ng:master Feb 12, 2024

bshifter pushed a commit to bshifter/syslog-ng that referenced this pull request Feb 19, 2024

news: add entries for syslog-ng#4827

6622d9e

Signed-off-by: Attila Szakacs <[email protected]>

bshifter pushed a commit to bshifter/syslog-ng that referenced this pull request Feb 22, 2024

news: add entries for syslog-ng#4827

90d628d

Signed-off-by: Attila Szakacs <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

otel: fixes and perf adjustments#4827

otel: fixes and perf adjustments#4827
MrAnno merged 11 commits intosyslog-ng:masterfrom
alltilla:otel-perf-adjustment

alltilla commented Feb 12, 2024 •

edited

Loading

Uh oh!

MrAnno Feb 12, 2024

Uh oh!

alltilla Feb 12, 2024

Uh oh!

MrAnno Feb 12, 2024

Uh oh!

alltilla Feb 12, 2024

Uh oh!

github-actions bot commented Feb 12, 2024

Uh oh!

alltilla commented Feb 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alltilla commented Feb 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MrAnno Feb 12, 2024

Choose a reason for hiding this comment

Uh oh!

alltilla Feb 12, 2024

Choose a reason for hiding this comment

Uh oh!

MrAnno Feb 12, 2024

Choose a reason for hiding this comment

Uh oh!

alltilla Feb 12, 2024

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Feb 12, 2024

This Pull Request introduces config grammar changes

Uh oh!

alltilla commented Feb 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

alltilla commented Feb 12, 2024 •

edited

Loading