Handle -READONLY as a redirection signal for Redis Cluster (AWS ElastiCache support) by NivekNK · Pull Request #1656 · predis/predis

NivekNK · 2026-03-18T15:59:40Z

This PR introduces explicit handling for the -READONLY error response within the RedisCluster connection class.

The Problem:
In AWS ElastiCache (Redis OSS mode), when using the Configuration Endpoint (DNS round-robin), a client might occasionally connect to a replica. If a Lua script (with KEYS[]) is executed against this replica, the server returns a -READONLY You can't write against a read only replica error instead of a -MOVED redirection.

Currently, Predis treats this as a generic ServerException. This can lead to intermittent failures because the internal slot map isn't explicitly marked as stale or updated upon receiving this specific protocol error, causing the retry to potentially hit the same node.

The Solution:

Modified onErrorResponse to intercept the READONLY prefix.
Implemented onReadOnlyResponse which:
- Disconnects the current faulty connection.
- Triggers askSlotMap() to refresh the cluster topology.
- Re-executes the command, allowing the distributor to pick the correct Master node based on the updated map.

This approach follows the existing reactive discovery pattern in Predis and ensures high availability in AWS ElastiCache environments without requiring a new configuration option.

…dis#1656)

vladvildanov · 2026-03-19T08:31:37Z

@NivekNK I still don't understand why do we need this specific post-retry exception handling. Since, READONLY error is wrapped into ServerException and handling of this exception enables slot map update.

Please provide a unit test case that shows the behaviour if READONLY error wrapped into ServerException is thrown. I want to understand if internally we do retry and update topology and if we do, why do we need another topology update after all retries happened

…ONLY error

NivekNK · 2026-03-19T13:58:06Z

@vladvildanov Hi! I've added the unit test you requested to demonstrate the exact behavior when a READONLY error is wrapped inside a ServerException.

As the new test shows, when ServerException is thrown, Predis does catch it and triggers the retry mechanism. However, it does not trigger a topology update (askSlotMap() and disconnect() are never called) because onFailCallback() only evaluates and handles connection-level errors (ConnectionException). As a result, the automated retry loops infinitely against the very same broken node until it hits the retry limit, ultimately bubbling up the exception to the end user.

This is exactly why my explicit handling in onErrorResponse and onReadOnlyResponse is strictly necessary to cleanly survive AWS ElastiCache OSS failover events. By manually catching the -READONLY keyword inside ErrorResponse and explicitly triggering askSlotMap() + disconnect(), we actively force the topology to refresh and successfully re-route the failed command to the newly promoted primary node.

Let me know what you think and if you need any further adjustments.

I see the CI failed on testClusterExecutePipeline throwing a MOVED ServerException. Since my code exclusively touches the READONLY case in onErrorResponse(), it is unrelated to this flaky pipeline test failure. Could you please re-run the CI jobs?

vladvildanov · 2026-03-23T06:59:15Z

@NivekNK Thanks for the detailed explanation, no more objections from my side!

feature: Added support for READONLY responses from AWS Elastic Cache

97bda2f

NivekNK requested a review from a team as a code owner March 18, 2026 15:59

NivekNK requested review from tillkruss and vladvildanov March 18, 2026 15:59

NivekNK mentioned this pull request Mar 18, 2026

Proactive Slot Map Discovery via connection option (Addressing AWS ElastiCache READONLY vs MOVED inconsistency in LUA scripts) #1655

Closed

chore: improve -READONLY handler comment and add changelog entry (pre…

43bf33a

…dis#1656)

tests: add unit test demonstrating ServerException behavior with READ…

ebe61c8

…ONLY error

vladvildanov approved these changes Mar 23, 2026

View reviewed changes

vladvildanov merged commit bb37322 into predis:main Mar 23, 2026
65 of 66 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Handle -READONLY as a redirection signal for Redis Cluster (AWS ElastiCache support)#1656

Handle -READONLY as a redirection signal for Redis Cluster (AWS ElastiCache support)#1656
vladvildanov merged 3 commits into
predis:mainfrom
NivekNK:feature/readonly-handler

NivekNK commented Mar 18, 2026

Uh oh!

vladvildanov commented Mar 19, 2026

Uh oh!

NivekNK commented Mar 19, 2026 •

edited

Loading

Uh oh!

vladvildanov commented Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

NivekNK commented Mar 18, 2026

Uh oh!

vladvildanov commented Mar 19, 2026

Uh oh!

NivekNK commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vladvildanov commented Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

NivekNK commented Mar 19, 2026 •

edited

Loading