Optimized based on output from profiler on large sheets by torerefsnes · Pull Request #112 · monitorjbl/excel-streaming-reader

torerefsnes · 2017-09-22T07:49:46Z

I ran across a performance issue with large spreadsheets, and investigated this using the YourKit profiler.

Two bottlenecks that could be addressed were identified: one is heavy use of a regexp in a function used to split a cell reference string, and the other because of unnecessary formatting of a value, where the formatting could be deferred until if/when the value was actually used.

With my set of files, total processing time went from 944 to 525 seconds with these changes included.

Note to reviewer: the change from regexp to a simplified method when splitting the cell reference works for all tests in the project, and on my testset. If I missed a format that could occur here, please let me know and I will add tests and update the code.

monitorjbl · 2017-09-30T15:43:48Z

Interesting, will have to confirm after merging. Regardless, deferring the cost of number parsing until they're used seems like a good idea.

monitorjbl · 2017-09-30T16:23:50Z

I'm not seeing the massive improvement you did with my 250k row test sheet. It is roughly 10% faster, so there's definitely an improvement but it's not nearly as marked as what you saw.

torerefsnes · 2017-09-30T18:47:24Z

If you’re interested, I’ll make my test set available.

monitorjbl · 2017-09-30T19:03:39Z

If it's not private data, that would be very helpful

Optimized based on output from profiler on large sheets

192d5e2

monitorjbl merged commit 3e0ff13 into monitorjbl:master Sep 30, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Optimized based on output from profiler on large sheets#112

Optimized based on output from profiler on large sheets#112
monitorjbl merged 1 commit intomonitorjbl:masterfrom
torerefsnes:performance-enhancements-with-large-files

torerefsnes commented Sep 22, 2017

Uh oh!

monitorjbl commented Sep 30, 2017

Uh oh!

monitorjbl commented Sep 30, 2017

Uh oh!

torerefsnes commented Sep 30, 2017

Uh oh!

monitorjbl commented Sep 30, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

torerefsnes commented Sep 22, 2017

Uh oh!

monitorjbl commented Sep 30, 2017

Uh oh!

monitorjbl commented Sep 30, 2017

Uh oh!

torerefsnes commented Sep 30, 2017

Uh oh!

monitorjbl commented Sep 30, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants