Skip to content

[Feature] limit parallelly read file memory usage #1061

@leaves12138

Description

@leaves12138

Search before asking

  • I searched in the issues and found nothing similar.

Motivation

Parallelly read file in AbstractFileStoreScan class plan method will run a lot of memory while files number is large.

Solution

Make memory usage limit to
'''
readManifests
.parallelStream()
.filter(this::filterManifestFileMeta)
.flatMap(m -> readManifestFileMeta(m).stream())
.filter(this::filterByStats)
.collect(Collectors.toList()))
'''

Anything else?

No response

Are you willing to submit a PR?

  • I'm willing to submit a PR!

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions