Skip to content

Permanent errors during offset validation should be retried#4447

Merged
Emanuele Sabellico (emasab) merged 6 commits intomasterfrom
dev_fix_permanent_error_during_validation
Sep 29, 2023
Merged

Permanent errors during offset validation should be retried#4447
Emanuele Sabellico (emasab) merged 6 commits intomasterfrom
dev_fix_permanent_error_during_validation

Conversation

@emasab
Copy link
Copy Markdown
Contributor

@emasab Emanuele Sabellico (emasab) commented Sep 26, 2023

During offset validation a permanent error like host resolution failure
would cause an offset reset.
This isn't what's expected or what the Java implementation does.
Solved by retrying even in case of permanent errors.

Closes #4427.

Copy link
Copy Markdown
Contributor

@milindl Milind L (milindl) left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM given the context modulo one question.
Not much we can do about it though, I guess?

Comment thread tests/0139-offset_validation_mock.c
@emasab Emanuele Sabellico (emasab) deleted the dev_fix_permanent_error_during_validation branch September 29, 2023 11:13
Axel Andersson (axelandersson) added a commit to axelandersson/librdkafka that referenced this pull request Oct 5, 2023
* upstream/master:
  librdkafka v2.3.0 (confluentinc#4455)
  Fix for idempotent producer fatal errors, triggered after a possibly persisted message state (confluentinc#4438)
  Move can_q_contain_fetched_msgs inside q_serve (confluentinc#4431)
  [KIP-580] Exponential Backoff with Mock Broker Changes to Automate Testing. (confluentinc#4422)
  Update only the mklove version of OpenSSL to 3.0.11 (confluentinc#4454)
  Permanent errors during offset validation should be retried (confluentinc#4447)
  Increased flexver request size for Metadata request to include topic_id size (confluentinc#4453)
  Fix loop of OffsetForLeaderEpoch requests on quick leader changes (confluentinc#4433)
  Fix for stored offsets not being committed if they lacked the leader epoch (confluentinc#4442)
  Add leader epoch to control messages (confluentinc#4434)
  Refactored tmpabuf and fixed an insufficient buffer allocation (confluentinc#4449)
  Work around KIP-700 restrictions for DescribeCluster [KIP-430]
  [admin] KIP-430: Add authorized operations to describe API
  Fix segfault if assignor state is NULL, (confluentinc#4381)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Fluctuations and resets in committed offsets due to temporal network outage

2 participants