feat: Force embedded ridbags when distributed #10549

ikysil · 2025-09-11T07:06:55Z

What does this PR do?
Force embedded RID bags to avoid replication errors in distributed mode.
See https://orientdb.dev/docs/3.2.x/general/Concurrency.html#concurrency-when-adding-edges

Motivation
Minimize surprises when updating large graphs in distributed mode.

Related issues
N/A

Additional Notes
N/A

Checklist

I have run the build using mvn clean package command
My unit tests cover both failure and success scenarios

tglman · 2025-09-11T13:49:30Z

@ikysil

What issue do you have with the tree ridbag in distributed ? do you use lightweight edges ?

Force embedded RID bags to avoid replication errors in distributed mode. See https://orientdb.dev/docs/3.2.x/general/Concurrency.html#concurrency-when-adding-edges

ikysil · 2025-12-08T20:48:49Z

Hi @tglman

The number of full syncs dropped to almost zero in a distributed cluster of 3 MASTER and 2 REPLICA nodes after applying this configuration.

We don't use lightweight edges (AFAIR).

Scenario before applying these properties:

We use round-robin connection strategy on a client, so that write load is distributed between masters.
We observed that vertex version is not modified when only edges were added and/or removed.
One master commits a change with edge(s) modifications only - vertex version is not modified.
Another master commits a change with property modification, modifying vertex version.
If edge modification is applied after property modification, the cluster can not agree during the second phase of the transaction, complaining about stale version.
A node retries a couple of times, then asks for a delta sync, retries that a few times, fails, then asks for a full sync.
Occasionally, we observe behavior described in 【BUG】Three masters in cluster mode, any master node re-pulled up leads to cluster synchronization data jamming #10427. We have to pull affected node from a cluster and add it back later - during quiet time.

The full sync is very expensive for us and takes more than 30 minutes.

tglman · 2025-12-09T14:05:32Z

Hi,

This sounds like a bug with the integration of ridbag trees in the distributed flow, I will try to write a test case for this scenario, this may be due to the fact that changes in the ridbag tree do not change the version of the document, allowing concurrent write with other write operations, we do have some distributed scheduling to avoid concurrent writing in the same document, but maybe this check is skipped for changes in ridbag trees.

Any support on reproducing the case is welcome!

…ags, as described by PR #10549

tglman · 2025-12-22T18:09:16Z

Hi,

I could write a somewhat minimal test case that reproduced the case and fixed it, so from the next hotfix the issue with inverted apply of distributed transaction due to ridbags tree is solved.

the commit that fix it is referring to this PR.

Bye

…ags, as described by PR #10549

tglman · 2025-12-24T10:57:38Z

Hi,

the 3.2.48 is released that should fix this case.

ikysil · 2025-12-31T13:56:08Z

Hi @tglman

TY for the fix and release.
It takes some time to verify as we don't want to risk production data integrity.

Let's close this PR as it is not needed by itself.
I will add a comment after the verification.

feat: Force embedded RID bags in distributed mode

caab146

Force embedded RID bags to avoid replication errors in distributed mode. See https://orientdb.dev/docs/3.2.x/general/Concurrency.html#concurrency-when-adding-edges

ikysil force-pushed the feat-force-embedded-ridbags-when-distributed branch from 182b64f to caab146 Compare December 8, 2025 20:28

tglman added a commit that referenced this pull request Dec 22, 2025

fix: corrected inverted order of apply of transactions with tree ridb…

7f440e2

…ags, as described by PR #10549

tglman added a commit that referenced this pull request Dec 22, 2025

fix: corrected inverted order of apply of transactions with tree ridb…

d0d58dd

…ags, as described by PR #10549

ikysil closed this Dec 31, 2025

ikysil deleted the feat-force-embedded-ridbags-when-distributed branch December 31, 2025 13:56

Ak1yama-mio mentioned this pull request Jan 12, 2026

【BUG】Three masters in cluster mode, any master node re-pulled up leads to cluster synchronization data jamming #10427

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: Force embedded ridbags when distributed #10549

feat: Force embedded ridbags when distributed #10549

ikysil commented Sep 11, 2025 •

edited

Loading

Uh oh!

tglman commented Sep 11, 2025

Uh oh!

ikysil commented Dec 8, 2025 •

edited

Loading

Uh oh!

tglman commented Dec 9, 2025

Uh oh!

tglman commented Dec 22, 2025

Uh oh!

tglman commented Dec 24, 2025

Uh oh!

ikysil commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

Uh oh!

feat: Force embedded ridbags when distributed #10549

feat: Force embedded ridbags when distributed #10549

Conversation

ikysil commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tglman commented Sep 11, 2025

Uh oh!

ikysil commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tglman commented Dec 9, 2025

Uh oh!

tglman commented Dec 22, 2025

Uh oh!

tglman commented Dec 24, 2025

Uh oh!

ikysil commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

ikysil commented Sep 11, 2025 •

edited

Loading

ikysil commented Dec 8, 2025 •

edited

Loading