It is better to have seamless switching between sync and async inserts in terms of deduplication.
The issues right now:
- different directories
- different hashes: sync insert uses hash from part writes, async uses hash from the data
That leads to the situation that insert which is made as sync would not be deduplicated with the same insert but async. And vice versa.
The goal is to make switching between sync and async inserts transparent.