Merged upstream 7.1.29 #60

oleg68 · 2023-04-03T08:19:38Z

No description provided.

When VV is enabled, the comparison of storage server version and read version should use the original read version, otherwise, the client may get the wrong transaction_too_old error.

Fix transaction_too_old error when version vector is enabled

…reCompiler.cmake

Cherry pick PR 8630 to release 7.1

This is a patch to release-7.1 after resolving conflicts from commit in main branch, in order to enable byteLimit in release-7.1 A fraction of byteLimit will be used as the limit to fetch index. For the indexes fetched, fetch records for them in batch. byteLimit always count the index size, it also count record if exist, it at least return 1 index-record entry and always include the last entry despite that adding the last entry despite it might exceed limit. There is a Knob STRICTLY_ENFORCE_BYTE_LIMIT, when it is set, records will be discarded once the byteLimit is hit, despite they are fetched. Otherwise, return the whole batch.

This reverts commit fadcb08.

* Add SS read range bytes metrics. (apple#8697) * Fix build failure * clang-fmt * fmt

add bytelimit for prefetch (release-7.1)

Rocksdb suggest compact range checks

RocksDB 7.7.3 version upgrade

The number of released bytes exceeds the number of acquired bytes in locks. This is because the bytes counted towards release is calculated after a "wait", when more bytes could be allocated.

Fix backup worker assertion failure [release-7.1]

To fix simulation failures where the knob value is too small.

PTree improvements [release-7.1]

…te-7.1 [Release 7.1] Enhance fdbbackup query command to estimate data processing from a specific snapshot to a target version

Set max length as well to avoid TraceEventOverflow.

Use KeyspaceSnapshotFile to filter range files

To reduce the number of network requests.

Refactor decoder to read file as a whole once

…elease-7.1] (apple#9640) * Add DcLag tests and workload * Add disableSimSpeedup to clog network longer * Ignore the DcLag test * Refactor LogRouter's pullAsyncData * Switch DC if log router peek becomes stuck Trying to a different DC if this happens. * Enable DcLag test * Require at least 2 regions and having satellites * Simplify DcLag code * Limit connection failures to be within tests In particular, disable connection failures when initializing the database during the startup phase, i.e., before running with test specs. * Revert disableSimSpeedup * Fix conflicts after cherrypick * More fixes after cherrypick * Refactor to address comments * Use a constant for connectionFailuresDisableDuration * Fix ClogTlog workload valgrind error * Address comments * Reduce running time for DcLag The switch can happen quicker than the workload detection time, so need to adjust detection time lower than LOG_ROUTER_PEEK_SWITCH_DC_TIME.

Seed storage servers are recruited as the intial set of storage servers when a database is first created. They function a little bit differently than normal, and do not set an initial version like storages normally do when they get recruited (typically equal to the recovery version). Version correction is a feature where versions advance in sync with the clock, and are equal across FDB clusters. To allow different FDB clusters to have matching versions, they must share the same base version. This defaults to the Unix epoch, and clusters with the version epoch enabled will have a current version equal to the number of microseconds since the Unix epoch. When the version epoch is enabled on a cluster, it causes a one time jump from the clusters current version to the version based on the epoch. After a recovery, the recovery version sent to storages should have advanced by a significant amount. The recovery path contained a `BUGGIFY` to randomly advance the recovery version in simulation, testing the version epoch being enabled. However, it was also advancing the version during an initial recovery, when the seed storage servers are recruited. If a set of storage servers were recruited as seed servers, but another recovery occurred before the bootstrap process was complete, the randomly selected version increase could be smaller during the second recovery than during the first. This could cause the initial set of seed servers to think they should be at a version larger than what the cluuster was actually at. The fix contained in this commit is to only cause a random version jump when the recovery is occuring on an existing database, and not when it is recruiting seed storages. This commit fixes an issue found in simulation, reproducible with: Commit: 93dc4bf Test: fast/DataLossRecovery.toml Seed: 3101495991 Buggify: on Compiler: clang

When the ClogTlog is running, we may already pass the 450s, i.e., SIM_SPEEDUP_AFTER_SECONDS, and clogging is no longer effective. If that's the case, we want to finish the test quickly.

Fix issue where the versions on seed storage servers decreased [release-7.1]

…7.1-29

jzhou77 and others added 30 commits November 5, 2022 11:36

Fix transaction_too_old error when version vector is enabled

7121350

When VV is enabled, the comparison of storage server version and read version should use the original read version, otherwise, the client may get the wrong transaction_too_old error.

Fix assertions w.r.t. VV

3371992

Avoid using oldest version as read version for VV

c3f25a2

Disable a debugging trace event

18add2a

Cherry pick 8630

339543a

Address review comments

4173acf

enable AVX and update version for 7.1.25 release

b4bd84b

skip proxy when fetching kubectl

6cb7ac4

Merge pull request apple#8710 from jzhou77/release-7.1

3a4641a

Fix transaction_too_old error when version vector is enabled

add generated.go

1439943

update version after 7.1.25 release

cf0aed4

Add changes to generated.go from PR8761, and remove change to Configu…

e58d388

…reCompiler.cmake

Merge remote-tracking branch 'origin/release-7.1' into r71

cf8b9fa

Update generated.go

91f452e

Update generated.go with 8761

3b52fc4

Merge pull request apple#8740 from apple/r71

288e922

Cherry pick PR 8630 to release 7.1

Rocksdb stats level knob. (apple#8713)

347bd92

Adding counters for singlekey clear requests (apple#8792)

c2321bf

debug seg fault

fadcb08

Revert "debug seg fault"

f6dc3ba

This reverts commit fadcb08.

[release-7.1] Add SS read range bytes metrics. (apple#8697) (apple#8724)

f9338d6

* Add SS read range bytes metrics. (apple#8697) * Fix build failure * clang-fmt * fmt

Merge pull request apple#8802 from hfu94/byte71

b0f84b8

add bytelimit for prefetch (release-7.1)

Rocksdb suggest compact range checks

074e6f7

Merge pull request apple#8862 from neethuhaneesha/suggest-compacts-7.1

5839b6f

Rocksdb suggest compact range checks

RocksDB 7.7.3 version upgrade

a716127

Merge pull request apple#8880 from neethuhaneesha/rocksdb_version-7.1

76df95d

RocksDB 7.7.3 version upgrade

Fix backup worker assertion failure

5a12979

The number of released bytes exceeds the number of acquired bytes in locks. This is because the bytes counted towards release is calculated after a "wait", when more bytes could be allocated.

Merge pull request apple#8887 from jzhou77/release-7.1

ae6ae7c

Fix backup worker assertion failure [release-7.1]

Increase buggified lock bytes for backup workers

5dbbbe9

To fix simulation failures where the knob value is too small.

sfc-gh-anoyes and others added 26 commits February 27, 2023 14:40

Avoid repeated search in VersionedMap::erase(iterator) (apple#9143)

1ae5b1f

Merge branch 'release-7.1' into zhewu/backup-size-estimate-7.1

038ee58

Use KeyspaceSnapshotFile to filter range files

5737512

Merge pull request apple#9508 from jzhou77/release-7.1

88d0f12

PTree improvements [release-7.1]

Merge pull request apple#9506 from halfprice/zhewu/backup-size-estima…

e3397c2

…te-7.1 [Release 7.1] Enhance fdbbackup query command to estimate data processing from a specific snapshot to a target version

Change mutation and KV logging to SevInfo

c12e0f1

Set max length as well to avoid TraceEventOverflow.

Output in HEX format for easy regex matching

23702f1

Merge branch 'release-7.1' into restore

f89610e

Merge pull request apple#9511 from jzhou77/restore

bf9c642

Use KeyspaceSnapshotFile to filter range files

Refactor decoder to read file as a whole once

8559e5c

To reduce the number of network requests.

Add more trace events

b59b4ab

Merge pull request apple#9560 from jzhou77/restore

ac51a07

Refactor decoder to read file as a whole once

Added 7.1.28 and 7.1.29 release notes

dd379cc

Reduce running time for ClogTlog

c7b524c

When the ClogTlog is running, we may already pass the 450s, i.e., SIM_SPEEDUP_AFTER_SECONDS, and clogging is no longer effective. If that's the case, we want to finish the test quickly.

Merge pull request apple#9668 from jzhou77/release-7.1

4688bb2

Fix issue where the versions on seed storage servers decreased [release-7.1]

disable AVX for 7.1.28 release

a60d512

enable AVX and update version for 7.1.29 release

1b2517a

Merge commit '1b2517abce552441e3d0ed8836d1cc3f40e61a2a' into ow-fork-…

0373b4c

…7.1-29

Removed an old stuff for clang compilation

ec5e948

Fixed compilation error with clang 15

2732d33

Fixed deploy on debian

a1750a3

Moved the test deploy logic from thw git workflow file to a bash script.

080e298

Fixed workflow syntax

07a1398

Added libatomic to the dockerfile

68d44f4

oleg68 requested a review from foxyholic April 3, 2023 08:19

foxyholic approved these changes Apr 4, 2023

View reviewed changes

foxyholic merged commit 9779c5f into owtech:ow-fork-7.1 Apr 4, 2023

oleg68 deleted the ow-fork-7.1-29 branch April 5, 2023 11:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Merged upstream 7.1.29 #60

Merged upstream 7.1.29 #60

Uh oh!

oleg68 commented Apr 3, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Merged upstream 7.1.29 #60

Merged upstream 7.1.29 #60

Uh oh!

Conversation

oleg68 commented Apr 3, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants