Audit storage: validate consistency of replica and shard location metadata #9628

kakaiu · 2023-03-08T22:35:39Z

Introduction
AuditStorage is a functionality that serves to audit the system's data storage by checking for data consistency. It is triggered when a client requests an audit or when a consistency check is required at the end of simulation.
When a client issues an auditStorage request, the request is first processed by CC. CC forwards the request to DD, which is responsible for processing audit requests.
DD checks for ongoing audits before processing a new audit request. If there is an ongoing audit with the same audit type and range as the new request, DD obtains the audit ID for that audit. If there is an ongoing but irrelevant audit, DD returns an error message indicating that the system is busy, as currently only one ongoing audit is allowed at a time. If there is no ongoing audit, DD creates a new audit and persists its state.
AuditStorage requests are asynchronous. DD immediately replies with the existing audit ID to CC. If DD is unable to get and persist the result, it captures the failure and automatically retries the audit until one of three outcomes is achieved: (1) the maximum number of retry attempts is exceeded, resulting in a "Failed" result; (2) the audit is completed without any error, resulting in a "Complete" result; or (3) the audit is completed with errors detected, resulting in an "Error" result.
In some cases, CC might not know whether the request has been delivered to DD, for example, when DD restarts after CC sends an audit request. In such cases, CC replies with "request_maybe_delivered" to the client. The client can then issue a new audit request if necessary.

Following designs are obeyed when developing the audit storage:
(1) A audit request generates an AuditStorage;
(2) An AuditStorage can automatically retry for failures;
(3) Any component of AuditStorage must not block or kill SS and DD and CC;
(4) Audit storage must be retriable --- being able to make progress by retrying. A large audit is partitioned into tasks and assigned to SSes. Each SS runs assigned tasks until completing all assigned tasks or failed. Upon completing each task, SS persists the progress. If a task is failed, SS notifies DD, and DD loads the progress made by the SS and resend the remaining tasks to the SS.

Current limitations:
(1) TSS servers are not covered;
(2) If a bad assignment consistently updated to metadata, this bad assignment is not detected. For example, DD assigns a removed SSID (or invalid SSID, like 0) to KeyServer and ServerKey. This bad assignment cannot be detected by current implementation of AuditStorage.

AduitStorageTest 100k:
20230429-063732-zhewang-b06297784516243e compressed=True data_size=32945656 duration=3028909 ended=100000 fail_fast=10 max_runs=100000 pass=100000 priority=100 remaining=0 runtime=0:45:18 sanity=False started=100000 stopped=20230429-072250 submitted=20230429-063732 timeout=5400 username=zhewang

100k correctness with two irrelevant failures:
20230429-063413-zhewang-422a958e38f9fa0b compressed=True data_size=32915465 duration=5142264 ended=100000 fail=2 fail_fast=10 max_runs=100000 pass=99998 priority=100 remaining=0 runtime=1:19:41 sanity=False started=100000 stopped=20230429-075354 submitted=20230429-063413 timeout=5400 username=zhewang

Code-Reviewer Section

The general pull request guidelines can be found here.

Please check each of the following things and check all boxes before accepting a PR.

The PR has a description, explaining both the problem and the solution.
The description mentions which forms of testing were done and the testing seems reasonable.
Every function/class/actor that was touched is reasonably well documented.

For Release-Branches

If this PR is made against a release-branch, please also check the following:

This change/bugfix is a cherry-pick from the next younger branch (younger release-branch or main if this is the youngest branch)
There is a good reason why this PR needs to go into a release branch and this reason is documented (either in the description above or in a linked GitHub issue)

…idate-data-consistency

…-helium/foundationdb into persisted-validate-data-consistency

…sisted-validate-data-consistency

Moved AuditUtils to fdbserver/

Throw/Send audit_storage_error when there is a data corruption. Added doAuditStorage() for resuming Audit.

…sisted-validate-data-consistency

fdbcli.

…sisted-validate-data-consistency

foundationdb-ci · 2023-04-28T23:54:23Z

Result of foundationdb-pr-clang-ide on Linux CentOS 7

Commit ID: b60603b
Duration 0:15:22
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

fdb-windows-ci · 2023-04-29T00:02:15Z

Doxense CI Report for Windows 10

Commit ID: b60603b
Result: ❌ FAILED
Build Logs (available for 30 days)

foundationdb-ci · 2023-04-29T00:40:51Z

Result of foundationdb-pr on Linux CentOS 7

Commit ID: b60603b
Duration 1:01:51
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

foundationdb-ci · 2023-04-29T00:46:17Z

Result of foundationdb-pr-clang on Linux CentOS 7

Commit ID: b60603b
Duration 1:07:16
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

foundationdb-ci · 2023-04-29T00:59:05Z

Result of foundationdb-pr-cluster-tests on Linux CentOS 7

Commit ID: b60603b
Duration 1:20:03
Result: ❌ FAILED
Error: Error while executing command: if $fail_test; then exit 1; fi. Reason: exit status 1
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)
Cluster Test Logs zip file of the test logs (available for 30 days)

foundationdb-ci · 2023-04-29T04:23:20Z

Result of foundationdb-pr-clang-ide on Linux CentOS 7

Commit ID: 7974d00
Duration 0:15:56
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

fdb-windows-ci · 2023-04-29T04:26:54Z

Doxense CI Report for Windows 10

Commit ID: 7974d00
Result: ❌ FAILED
Build Logs (available for 30 days)

foundationdb-ci · 2023-04-29T04:35:36Z

Result of foundationdb-pr-macos-m1 on macOS Ventura 13.x

Commit ID: 7974d00
Duration 0:28:12
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

foundationdb-ci · 2023-04-29T04:45:47Z

Result of foundationdb-pr-macos on macOS Ventura 13.x

Commit ID: 7974d00
Duration 0:38:24
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

foundationdb-ci · 2023-04-29T04:56:19Z

Result of foundationdb-pr-clang on Linux CentOS 7

Commit ID: 7974d00
Duration 0:48:57
Result: ❌ FAILED
Error: Error while executing command: if python3 -m joshua.joshua list --stopped | grep ${ENSEMBLE_ID} | grep -q 'pass=10[0-9][0-9][0-9]'; then echo PASS; else echo FAIL && exit 1; fi. Reason: exit status 1
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

foundationdb-ci · 2023-04-29T05:02:51Z

Result of foundationdb-pr on Linux CentOS 7

Commit ID: 7974d00
Duration 0:55:27
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

foundationdb-ci · 2023-04-29T05:28:10Z

Result of foundationdb-pr-cluster-tests on Linux CentOS 7

Commit ID: 7974d00
Duration 1:20:47
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)
Cluster Test Logs zip file of the test logs (available for 30 days)

fdb-windows-ci · 2023-04-29T06:46:12Z

Doxense CI Report for Windows 10

Commit ID: fe7e95e
Result: ❌ FAILED
Build Logs (available for 30 days)

foundationdb-ci · 2023-04-29T06:53:27Z

Result of foundationdb-pr-clang-ide on Linux CentOS 7

Commit ID: fe7e95e
Duration 0:17:23
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

foundationdb-ci · 2023-04-29T07:04:39Z

Result of foundationdb-pr-macos-m1 on macOS Ventura 13.x

Commit ID: fe7e95e
Duration 0:28:36
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

foundationdb-ci · 2023-04-29T07:16:28Z

Result of foundationdb-pr-macos on macOS Ventura 13.x

Commit ID: fe7e95e
Duration 0:40:23
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

foundationdb-ci · 2023-04-29T07:31:34Z

Result of foundationdb-pr on Linux CentOS 7

Commit ID: fe7e95e
Duration 0:55:32
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

foundationdb-ci · 2023-04-29T07:32:09Z

Result of foundationdb-pr-clang on Linux CentOS 7

Commit ID: fe7e95e
Duration 0:56:09
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)

foundationdb-ci · 2023-04-29T07:56:13Z

Result of foundationdb-pr-cluster-tests on Linux CentOS 7

Commit ID: fe7e95e
Duration 1:20:10
Result: ✅ SUCCEEDED
Error: N/A
Build Log terminal output (available for 30 days)
Build Workspace zip file of the working directory (available for 30 days)
Cluster Test Logs zip file of the test logs (available for 30 days)

…adata (apple#9628) * Implemented AuditUtils.actor.cpp Moved AuditUtils to fdbserver/ * Persist AuditStorageState. * Passed persisted AuditStorageState test. * Added audit_storage_error to indicate a corruption is caught. Throw/Send audit_storage_error when there is a data corruption. Added doAuditStorage() for resuming Audit. * Load and resume AuditStorage when DD restarts. * Generate audit id monotonically. * Fixed minor issue AuditId/Type was not set. * Adding getLatestAuditStates. * Improved persisted errors and added AuditStorageCommand.actor.cpp for fdbcli. * Added `audit_storage` fdbcli command. * fmt. * Fixed null shared_ptr issue. * Improve audit data. * Change DDAuditFailed to SevWarn. * Sev. * set SERVE_AUDIT_STORAGE_PARALLELISM to 1. * Moved AuditUtils* to fdbclient/. * Added getAuditStatus fdbcli command. * Refactor audit storage fdb cli commands. * Added auditStorage in sim. * Cleanup. * Resolved comments. * Resolved comments. * Added SystemData for metadata audit. Refactored audit workflow to make sure all sub-tasks are executed w/o early exit. * Improvements. * Persisted Failed state after too many retries. * Added retryCount for resumeAuditStorage(). * resolving conflict. * Resolved conflicts. * allow-merged-to-run * add timeout to audit client * fmt * validate replica * add audit serverKey * address comments and fmt * fix audit_storage_exceeded_request_limit * fix segfault in getLatestAuditStatesImpl * fix bugs * remove timeout from workload * fix bugs * audit local view of shard assignment * fmt * fix-stuck-issue-and-make-dd-audit-storage-self-retry * fix timeout * fix timeout * fix bugs and cleanup * fix nit * change name state to coreState for audit metadata * address comments * code clean * fmt * setup debug * cleanup * clean up * code cleanup * code clean * remove tmp file * fmt * trace portion of shards that of anonymous physical shard * remove unnecessary actor cleanup * do not give up when tr is too old * address commits * refactor * clean * fmt * fix-command-help-text * fix-auditstate-restore-and-enable-restore-to-metadata-audit * address comments * fmrt * debug and improve efficient of resume audit * small change * fix audit cli * bypass completed audit when dd restart * fix auditStorageCommandActor * make mismatch key range more visable * address comments * make local shard metadata check can make progress by retries * address comments * address comments * partition location metadata validation by range and server * unset MIN_TRACE_SEVERITY * address comments and SS auto proceed until failed then notify dd * persistNewAuditState should checkMoveKeysLock * audit storage location metadata partitioned by range and move shard assignment history def to the end of SS structure * code cleanup * fix error message in metadata validation * fix registerAuditsForShardAssignmentHistoryCollection input for local shard validation * add comments to code and add guard to make sure the SS audit does not proceeds automatically for many times without being notified by DD --- to support audit cancellation later * fix coalesceRangeList * replace rangeOverlapping func with operator and use struct instead of complicated type for return value of getKeyServer/serverKey/shardInfo * simplify shard assignment history * shardAssignmentRecordRequests should be unorder_map * address comments, make trackShardAssignment simple, make anyChildAuditFailed cover all audit children, keep only one audit actor run at a time on each SS * only run validate shard info once at a time, other audit type does not have this limitation --------- Co-authored-by: He Liu <[email protected]> Co-authored-by: He Liu <[email protected]> Co-authored-by: Zhe Wang <[email protected]>

liquid-helium and others added 30 commits October 10, 2022 10:30

Merge branch 'main' of https://github.com/apple/foundationdb into val…

1b91e34

…idate-data-consistency

Merge branch 'validate-data-consistency' of https://github.com/liquid…

d4c0485

…-helium/foundationdb into persisted-validate-data-consistency

Merge branch 'main' of https://github.com/apple/foundationdb into per…

c3555a7

…sisted-validate-data-consistency

Implemented AuditUtils.actor.cpp

2d25e32

Moved AuditUtils to fdbserver/

Persist AuditStorageState.

a3e6d6f

Passed persisted AuditStorageState test.

0ff6d14

Added audit_storage_error to indicate a corruption is caught.

f8312e0

Throw/Send audit_storage_error when there is a data corruption. Added doAuditStorage() for resuming Audit.

Load and resume AuditStorage when DD restarts.

8e948e6

Merge branch 'main' of https://github.com/apple/foundationdb into per…

667b328

…sisted-validate-data-consistency

Generate audit id monotonically.

6c593d2

Fixed minor issue AuditId/Type was not set.

8fcffe6

Adding getLatestAuditStates.

6ffacbf

Improved persisted errors and added AuditStorageCommand.actor.cpp for

6b1a71a

fdbcli.

Added audit_storage fdbcli command.

d49264c

Merge branch 'main' of https://github.com/apple/foundationdb into per…

6ea4815

…sisted-validate-data-consistency

fmt.

8f054f4

Fixed null shared_ptr issue.

ab95f65

Merge branch 'main' of https://github.com/apple/foundationdb into per…

4682ae1

…sisted-validate-data-consistency

Merge branch 'main' of https://github.com/apple/foundationdb into per…

3847962

…sisted-validate-data-consistency

Merge branch 'main' of https://github.com/apple/foundationdb into per…

6a0e939

…sisted-validate-data-consistency

Improve audit data.

6c2eea6

Merge branch 'main' of https://github.com/apple/foundationdb into per…

8efa403

…sisted-validate-data-consistency

Change DDAuditFailed to SevWarn.

61be9b7

Sev.

57604fb

Merge branch 'main' of https://github.com/apple/foundationdb into per…

17597b2

…sisted-validate-data-consistency

set SERVE_AUDIT_STORAGE_PARALLELISM to 1.

0309d1c

Merge branch 'main' of https://github.com/apple/foundationdb into per…

9495cec

…sisted-validate-data-consistency

Moved AuditUtils* to fdbclient/.

ce8de22

Added getAuditStatus fdbcli command.

f5ee8c9

Refactor audit storage fdb cli commands.

56e1a24

kakaiu dismissed liquid-helium’s stale review via b60603b April 28, 2023 23:38

kakaiu requested a review from liquid-helium April 28, 2023 23:39

Merge remote-tracking branch 'upstream/main' into audit-metadata

fe7e95e

kakaiu force-pushed the audit-metadata branch from 7974d00 to fe7e95e Compare April 29, 2023 06:35

liquid-helium approved these changes May 1, 2023

View reviewed changes

kakaiu merged commit d6e7b5f into apple:main May 1, 2023

This was referenced May 4, 2023

Adding cleanup of old audit metadata #10137

Merged

Adding cleanup of audit progress metadata when audit complete #10118

Merged

Audit storage statistic #10155

Merged

kakaiu mentioned this pull request May 11, 2023

[release-7.3] Cherrypick Audit Storage #10214

Merged

5 tasks

Audit storage: validate consistency of replica and shard location metadata #9628

Audit storage: validate consistency of replica and shard location metadata #9628

Uh oh!

Conversation

kakaiu commented Mar 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code-Reviewer Section

For Release-Branches

Uh oh!

foundationdb-ci commented Apr 28, 2023

Result of foundationdb-pr-clang-ide on Linux CentOS 7

Uh oh!

fdb-windows-ci commented Apr 29, 2023

Doxense CI Report for Windows 10

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr on Linux CentOS 7

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr-clang on Linux CentOS 7

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr-cluster-tests on Linux CentOS 7

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr-clang-ide on Linux CentOS 7

Uh oh!

fdb-windows-ci commented Apr 29, 2023

Doxense CI Report for Windows 10

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr-macos-m1 on macOS Ventura 13.x

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr-macos on macOS Ventura 13.x

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr-clang on Linux CentOS 7

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr on Linux CentOS 7

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr-cluster-tests on Linux CentOS 7

Uh oh!

fdb-windows-ci commented Apr 29, 2023

Doxense CI Report for Windows 10

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr-clang-ide on Linux CentOS 7

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr-macos-m1 on macOS Ventura 13.x

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr-macos on macOS Ventura 13.x

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr on Linux CentOS 7

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr-clang on Linux CentOS 7

Uh oh!

foundationdb-ci commented Apr 29, 2023

Result of foundationdb-pr-cluster-tests on Linux CentOS 7

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kakaiu commented Mar 8, 2023 •

edited

Loading