rgw: fetch_remote_obj() need to re-encrypt sse-s3 encrypted objects by clwluvw · Pull Request #62272 · ceph/ceph

clwluvw · 2025-03-13T11:17:33Z

As SSE-S3 encrypted objects are tied to the source bucket's encryption key, they must be re-encrypted using the destination bucket's key to eliminate any dependencies on the source bucket's key.
The same principle applies to replication.

Fixes: https://tracker.ceph.com/issues/70446

src/rgw/driver/rados/rgw_rados.cc

cbodley · 2025-03-19T18:11:24Z

src/rgw/rgw_rest_s3.cc

+    auto crypt_mode = attrs.find(RGW_ATTR_CRYPT_MODE);
+    if (crypt_mode != attrs.end()) {
+      if (crypt_mode->second.to_str() != "AES256") {
+        // AES256 needs to be decrypted so that the target zone can re-encrypt with destination bucket's key


with rgw's default replication for DR, the source bucket and destination bucket are always the same, where i assume we'd never need to re-encrypt the objects. can we find a way to make this specific to bucket replication policy?

Agree, the simplest approach would be to consider the rgwx-zonegroup argument if we had a clear distinction between zone replication and zonegroup replication.
However, for now, we could take the skip-encryption argument more seriously and always skip encryption when this argument is present. This way, the destination zone can decide whether encryption needs to be skipped by comparing the source and destination buckets.
If the argument is not present, the source zone would make the decision based on the source object’s encryption mode. In this case, we might need to check whether the request is for replication, which may require introducing another argument (the existing ones are implicit for that i think).

Alternatively, we could introduce a new argument, skip-enc-sse-s3, which the destination zone would send when the source and destination buckets are different. This would allow the source zone to decide based on either skip-encryption or skip-enc-sse-s3. What do you think?

it's hard to make changes here that preserve compatibility between ceph versions

maybe we should leave the server side unchanged (always skip decrypt), and rely on fetch_remote_obj() to fetch both source and destination bucket keys so it can decrypt/reencrypt the body itself when necessary

multisite really shouldn't transfer objects in plaintext when they're supposed to be encrypted on both sites

it's hard to make changes here that preserve compatibility between ceph versions

With the introduction of skip-enc-sse-s3, if the destination requests it from a source that hasn't been upgraded yet, the object will only be retrieved in plain text if it's actually encrypted with SSE-S3 (that's where we don't send skip-encryption and only send skip-enc-sse-s3 which the source doesn't understand it yet) and that is fine because the destination will create the keys and encrypt it again. Otherwise, the destination will send skip-encryption anyway, and the object will remain encrypted. The only scenario where this might not work is when both the source and destination are the same, and we don't want to decrypt it. However, this would only impact performance until the upgrade is complete, which I believe is negligible.

~~Vice-versa, the old version always requests skip-encryption, and the upgraded version will always respect that.~~

multisite really shouldn't transfer objects in plaintext when they're supposed to be encrypted on both sites

Is this due to any security constraints? if so isn't (enforcing) https enough for that?

multisite really shouldn't transfer objects in plaintext when they're supposed to be encrypted on both sites

Is this due to any security constraints? if so isn't (enforcing) https enough for that?

tls helps, but what does enforcement mean? failing bucket replication for sse-s3-encrypted objects, i guess?

client traffic isn't effected

i wasn't referring only to client traffic but the replication time and eventually the replication performance at scale.

i'm not sure it makes much difference?

But I'm not sure. if we don't have the right way to do it at least the other way, i guess the best is to start with the one you suggested and measure the performance see if this is really bothering so we improve later (even with not a clean way) with a reason.

but to decrypt and reencrypt perhaps we would need some implementations from #54543

hmm, i find this a bit inefficient to decrypt and encrypt on the destination zone at the same time, i thought if we could leave the decryption to the source zone why not?

sorry, i thought we were talking about the cpu overhead of decryption. we either need to pay that cost on the server side or the client side, and either will count towards replication latency. am i missing something?

no that is true, just thought if we distribute the load (decryption to source and encryption to destination) that could leave the benefit of better resource utilisation than gathering all at one place in the big picture across all clusters.

it's hard to make changes here that preserve compatibility between ceph versions

One another approach i could think of:
What if we base the decision to apply the encryption filter on a response header, such as x-rgw-decrypted?
This would allow us to determine whether the source zone we're interacting with has been upgraded and whether the content has actually been decrypted. If it has, we could then apply the encryption filter accordingly. If not, the sync would maintain its current behavior, ensuring backward compatibility.
we would need to introduce skip-decrypt-sse-s3 anyway to hint source not to if it was encrypted via sse-s3 when it's between buckets.

github-actions · 2025-03-20T14:26:38Z

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

github-actions · 2025-05-22T20:02:12Z

This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days.
If you are a maintainer or core committer, please follow-up on this pull request to identify what steps should be taken by the author to move this proposed change forward.
If you are the author of this pull request, thank you for your proposed contribution. If you believe this change is still appropriate, please ensure that any feedback has been addressed and ask for a code review.

github-actions · 2025-07-07T15:35:50Z

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

github-actions · 2025-07-14T10:35:21Z

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

github-actions · 2025-09-12T12:02:16Z

This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days.
If you are a maintainer or core committer, please follow-up on this pull request to identify what steps should be taken by the author to move this proposed change forward.
If you are the author of this pull request, thank you for your proposed contribution. If you believe this change is still appropriate, please ensure that any feedback has been addressed and ask for a code review.

github-actions · 2025-11-11T13:10:58Z

This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days.
If you are a maintainer or core committer, please follow-up on this pull request to identify what steps should be taken by the author to move this proposed change forward.
If you are the author of this pull request, thank you for your proposed contribution. If you believe this change is still appropriate, please ensure that any feedback has been addressed and ask for a code review.

requesting user can be a non-bucket owner, so picking the tenant can result in different contexts. Picking bucket owner ensures the consistent context. Signed-off-by: Seena Fallah <[email protected]>

Signed-off-by: Seena Fallah <[email protected]>

As sse-s3 encryption is bind to the bucket's key, we should always serve the objects decrypted so it can be reencrypted again by the destination bucket's key id. Signed-off-by: Seena Fallah <[email protected]>

Signed-off-by: Seena Fallah <[email protected]>

When the source object is encrypted via sse-s3, fetch_remote_obj() need to pause the receive and generate a new sse-s3 key for the destination bucket and object and continue to receive and process filter based on the new enc filter. Signed-off-by: Seena Fallah <[email protected]>

Passing decrypt-mode rather than a bool skip-decrypt would allow the dest zone to pass "skip-except-sse-s3" when we have the src and dest bucket tre not he same and passing "skip" when they are. so with that, source zone can make the right call based on the string provided by the decrypt-mode param. Signed-off-by: Seena Fallah <[email protected]>

github-actions · 2026-02-01T18:03:22Z

This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days.
If you are a maintainer or core committer, please follow-up on this pull request to identify what steps should be taken by the author to move this proposed change forward.
If you are the author of this pull request, thank you for your proposed contribution. If you believe this change is still appropriate, please ensure that any feedback has been addressed and ask for a code review.

github-actions · 2026-02-03T17:45:17Z

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

github-actions bot added rgw tests labels Mar 13, 2025

clwluvw commented Mar 13, 2025

View reviewed changes

src/rgw/driver/rados/rgw_rados.cc Outdated Show resolved Hide resolved

clwluvw force-pushed the sse-s3-fetch branch from d2dd0b6 to 0d5b899 Compare March 19, 2025 16:58

clwluvw marked this pull request as ready for review March 19, 2025 16:59

clwluvw requested a review from a team as a code owner March 19, 2025 16:59

clwluvw commented Mar 19, 2025

View reviewed changes

src/rgw/driver/rados/rgw_rados.cc Outdated Show resolved Hide resolved

src/rgw/driver/rados/rgw_rados.cc Show resolved Hide resolved

cbodley reviewed Mar 19, 2025

View reviewed changes

github-actions bot added the needs-rebase label Mar 20, 2025

clwluvw mentioned this pull request Mar 23, 2025

rgw: add support bucket replication between zonegroups #59911

Closed

github-actions bot added the stale label May 22, 2025

clwluvw force-pushed the sse-s3-fetch branch from 0d5b899 to 6542bf3 Compare June 16, 2025 17:57

github-actions bot removed the needs-rebase label Jun 16, 2025

clwluvw force-pushed the sse-s3-fetch branch from 6542bf3 to 3822f6a Compare June 16, 2025 18:14

clwluvw removed the stale label Jun 16, 2025

github-actions bot added the needs-rebase label Jul 7, 2025

clwluvw force-pushed the sse-s3-fetch branch from 3822f6a to c3931ed Compare July 8, 2025 08:01

clwluvw requested a review from cbodley July 8, 2025 08:03

github-actions bot added needs-rebase and removed needs-rebase labels Jul 8, 2025

github-actions bot added the stale label Sep 12, 2025

clwluvw removed the stale label Sep 12, 2025

github-actions bot added the stale label Nov 11, 2025

clwluvw removed the stale label Nov 12, 2025

clwluvw added 11 commits December 3, 2025 17:57

rgw/crypt: no context bounding to requesting user

ef6db63

requesting user can be a non-bucket owner, so picking the tenant can result in different contexts. Picking bucket owner ensures the consistent context. Signed-off-by: Seena Fallah <[email protected]>

rgw: weaning off fetch_bucket_key_id() from req_state

e00d6ac

Signed-off-by: Seena Fallah <[email protected]>

rgw: weaning off expand_key_name() from req_state

1874269

Signed-off-by: Seena Fallah <[email protected]>

rgw: weaning off get_sse_s3_bucket_key() from req_state

98715fa

Signed-off-by: Seena Fallah <[email protected]>

rgw: weaning off make_canonical_context() from req_state

13793cd

Signed-off-by: Seena Fallah <[email protected]>

rgw: always decrypt AES256 objects when skip_decrypt

a4569a1

As sse-s3 encryption is bind to the bucket's key, we should always serve the objects decrypted so it can be reencrypted again by the destination bucket's key id. Signed-off-by: Seena Fallah <[email protected]>

rgw: extract handle sse-s3 encryption as a function

07f566a

Signed-off-by: Seena Fallah <[email protected]>

qa/rgw: add sse-s3 replication test

c92a4cc

Signed-off-by: Seena Fallah <[email protected]>

rgw: allow http receive and handle header to pause

1959d37

Signed-off-by: Seena Fallah <[email protected]>

clwluvw force-pushed the sse-s3-fetch branch from c3931ed to b437e78 Compare December 3, 2025 17:02

github-actions bot removed the needs-rebase label Dec 3, 2025

github-actions bot added stale needs-rebase labels Feb 1, 2026

github-actions bot removed the stale label Feb 3, 2026

Comments

Conversation

clwluvw commented Mar 13, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

clwluvw Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

clwluvw Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 20, 2025

Uh oh!

github-actions bot commented May 22, 2025

Uh oh!

github-actions bot commented Jul 7, 2025

Uh oh!

github-actions bot commented Jul 14, 2025

Uh oh!

github-actions bot commented Sep 12, 2025

Uh oh!

github-actions bot commented Nov 11, 2025

Uh oh!

github-actions bot commented Feb 1, 2026

Uh oh!

github-actions bot commented Feb 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

clwluvw Mar 19, 2025 •

edited

Loading

clwluvw Mar 20, 2025 •

edited

Loading