Change Snapshot option from file cache to TTree auto-flush parameter by amadio · Pull Request #595 · root-project/root

amadio · 2017-05-30T13:28:54Z

Please take a look. Although the tests pass, this is not supposed to be merged yet, as I have yet to understand why it's so slow when we pass a positive auto-flush value to snapshot the tree.

phsft-bot · 2017-05-30T13:29:06Z

Starting build on gcc49/centos7, native/mac1012, gcc49/slc6, gcc62/slc6, native/ubuntu14 with CMake flags -Dvc=OFF -Dimt=ON -Dccache=ON

amadio · 2017-05-30T13:57:12Z

tree/treeplayer/inc/ROOT/TDFInterface.hxx

            }
            trees[slot]->Fill();
-            if (files[slot]->GetBytesWritten() >= cachesize) files[slot]->Write();
+            if (autoflush > 0 && trees[slot]->GetEntries() % autoflush == 0)


I wanted to have used the condition below, but somehow that blocks progress and the snapshot gets stuck when autoflush < 0.

if ((autoflush > 0 && trees[slot]->GetEntries() % autoflush == 0) || (autoflush < 0 && trees[slot]->GetZipBytes() >= -autoflush)) files[slot]->Write();

From where that code stands, it should 'never' act when autoflush is negative anyway (because it needs to act only after at least one cluster has been closed).

That is true. However, never acting when autoflush is negative means that buffers are never sent to be merged until the end, in which case the user may blow up the memory on his system and lose his whole analysis...

in which case the user may blow up the memory

The memory is still limited by the TTree itself (i.e. the size of the cluster * the compression) and adding the test where you have would not, by definition, change that limit ....

The memory of the TTree is limited. However, the memory of the TMemFile where things end up is not (i.e. TBufferMergerFile).

For the record, the disconnect was me mis-reading the current code which instead of using tree->GetAutoFlush() was using the parameter passed by the user. Using tree->GetAutoFlush is essential as the value changes as the TTree move from the first events (where GetAutoFlush is negative and expressed the desired size in compressed data size and during which the real cluster size is being determined) to the normal regime (when GetAutoFlush returns the size of the cluster in number of event).

Ok, after the discussion I understand the problem and the solution now. However, after I changed the variable for the function call to tree->GetAutoFlush(), buffers are written every 4KB, which is very slow.

If by 'buffer are written' you mean the 'Write' is triggered very often ... then I am very confused (because it should only triggered when the cluster is full to 30Mb of compressed data) ... something must still be wrong with the test itself ...

what are the value of GetAutoFlush and GetEntries when the if-statement triggers?

My bad. I used the wrong branch of the tests by mistake. Those still use megabytes as the unit, which made the write happen way too often.

phsft-bot · 2017-05-30T14:02:31Z

Build failed on mac1012/native.
See console output.

Failing tests:

projectroot.math.mathcore.test.mathcore_testLogLExecPolicy

phsft-bot · 2017-05-30T17:28:50Z

Starting build on gcc49/centos7, native/mac1012, gcc49/slc6, gcc62/slc6, native/ubuntu14 with CMake flags -Dvc=OFF -Dimt=ON -Dccache=ON

phsft-bot · 2017-05-30T18:05:04Z

Build failed on ubuntu14/native.
See console output.

Failing tests:

projectroot.math.mathcore.test.mathcore_testLogLExecPolicy

phsft-bot · 2017-05-30T18:19:48Z

Build failed on slc6/gcc62.
See console output.

Failing tests:

projectroot.math.mathcore.test.mathcore_testLogLExecPolicy

phsft-bot · 2017-05-31T06:34:07Z

Starting build on gcc49/centos7, native/mac1012, gcc49/slc6, gcc62/slc6, native/ubuntu14 with CMake flags -Dvc=OFF -Dimt=ON -Dccache=ON

phsft-bot · 2017-05-31T08:41:43Z

Starting build on gcc49/centos7, native/mac1012, gcc49/slc6, gcc62/slc6, native/ubuntu14 with CMake flags -Dvc=OFF -Dimt=ON -Dccache=ON

phsft-bot · 2017-05-31T10:19:34Z

Starting build on gcc49/centos7, native/mac1012, gcc49/slc6, gcc62/slc6, native/ubuntu14 with CMake flags -Dvc=OFF -Dimt=ON -Dccache=ON

phsft-bot · 2017-05-31T11:08:31Z

Build failed on mac1012/native.
See console output.

Failing tests:

projectroot.math.mathcore.test.mathcore_testLogLExecPolicy

amadio assigned dpiparo May 30, 2017

amadio changed the title ~~[WIP] Change Snapshot option from file cache to TTree auto-flush parameter~~ WIP:Change Snapshot option from file cache to TTree auto-flush parameter May 30, 2017

amadio changed the title ~~WIP:Change Snapshot option from file cache to TTree auto-flush parameter~~ WIP: Change Snapshot option from file cache to TTree auto-flush parameter May 30, 2017

amadio changed the title ~~WIP: Change Snapshot option from file cache to TTree auto-flush parameter~~ WIP: Change Snapshot option from file cache to TTree auto-flush parameter ⚠️ May 30, 2017

amadio changed the title ~~WIP: Change Snapshot option from file cache to TTree auto-flush parameter ⚠️~~ WIP: Change Snapshot option from file cache to TTree auto-flush parameter May 30, 2017

amadio commented May 30, 2017

View reviewed changes

amadio force-pushed the tdf-snapshot-autoflush branch from 53b33e3 to aa40254 Compare May 30, 2017 17:28

amadio force-pushed the tdf-snapshot-autoflush branch from aa40254 to 8dce471 Compare May 31, 2017 06:33

amadio force-pushed the tdf-snapshot-autoflush branch from 8dce471 to fab2053 Compare May 31, 2017 08:41

amadio changed the title ~~WIP: Change Snapshot option from file cache to TTree auto-flush parameter~~ Change Snapshot option from file cache to TTree auto-flush parameter May 31, 2017

[TDF] Use TTree auto-flush parameter in snapshot action

2353b78

amadio force-pushed the tdf-snapshot-autoflush branch from fab2053 to 2353b78 Compare May 31, 2017 10:19

amadio merged commit 8461f69 into root-project:master May 31, 2017

amadio deleted the tdf-snapshot-autoflush branch May 31, 2017 10:19

phsft-bot mentioned this pull request Feb 23, 2018

[cxxmodules] Refactor LoadCoreModules #1665

Merged

phsft-bot mentioned this pull request Jan 16, 2022

[cxxmodules] Use the global module index only when no rootmap candidate is found #9592

Merged

Conversation

amadio commented May 30, 2017

Uh oh!

phsft-bot commented May 30, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

phsft-bot commented May 30, 2017

Failing tests:

Uh oh!

phsft-bot commented May 30, 2017

Uh oh!

phsft-bot commented May 30, 2017

Failing tests:

Uh oh!

phsft-bot commented May 30, 2017

Failing tests:

Uh oh!

phsft-bot commented May 31, 2017

Uh oh!

phsft-bot commented May 31, 2017

Uh oh!

phsft-bot commented May 31, 2017

Uh oh!

phsft-bot commented May 31, 2017

Failing tests:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants