ForwardMsgCache expiration by tconkling · Pull Request #82 · streamlit/streamlit

tconkling · 2019-09-10T18:53:29Z

This implements ForwardMsgCache expiration on both the client and the server.

The global.maxCachedMessageAge config value specifies the maximum age of a message in the cache. The age of a message is defined by the number of times a referencing report has finished running since the message was accessed by that report. So when a report finishes running for the first time, the age of all its cached messages is 1.
Our default maxCachedMessageAge is 2, which means that messages will remain cached even after not having been referenced for a report run.
To determine the age of messages in its cache, the server tracks a report_run_count alongside each ReportSession in its websocket->reportsession dictionary. The server increments this value each time it processes a report_finished ForwardMsg message for a report.
When the server increments report_finished, it also asks the cache to remove any expired messages. I haven't done anything fancy here - the cache just iterates through all its entries, does an age check, and deletes them if they're expired. @tvst, I know you were potentially concerned about performance issues with the "iterate the whole cache" strategy. We could add a simple timer to this function to collect local metrics to see if it's actually an issue?
Similarly, when the client receives a report_finished ForwardMsg, it performs the same cache expiration step. This means that the server and client's caches should stay in sync. (I haven't thought hard enough about whether it's possible for them to get briefly out of sync, but if they do, we have the new /message endpoint as a fallback.)
If a report fails to run due to a compilation error, neither the server nor the client will increment the report_run_count for that session. (The run-count incrementing is conditional on the status value of the report_finished message, which is set to FINISHED_WITH_COMPILE_ERROR when that happens.)

There are client and server tests for all this new logic. But the easiest way to see the expiration in action is to run something like examples/core/message_deduping.py and watch the debug output on the server and the client. (You'll see logs for cache hits and misses - you should never see a miss! - on the client; and logs for sending cached message refs on the server.)

This example just creates a big dataframe and sends it twice
The first time the report is run, the message will only be delivered to the client once.
If you rerun the report, the message won't be delivered at all because it'll be cached.
If you remove the st.dataframe() calls and rerun the report, the dataframe will not be removed from the cache because it won't be old enough. Re-adding those calls and rerunning the report should confirm this.
If you remove the dataframe calls and re-run the report twice with them missing, they will then be expired from the cache. Re-adding them should result in the message being resent to the client.

…t_run_count

This value needs to be updated after a report's messages have been sent

…port_finished message

tvst

Still reviewing this, but wanted to send out some comments.

frontend/src/lib/ForwardMessageCache.ts

frontend/src/lib/WebsocketConnection.tsx

lib/streamlit/config.py

lib/streamlit/server/Server.py

tvst

LGTM after fixes

lib/streamlit/ForwardMsgCache.py

This is no longer accessed from anywhere but the server thread

tconkling added 14 commits September 4, 2019 09:38

Remove unused param

0f1a482

ForwardMessageCache tests

c2ea50d

config: global.maxCachedMessageAge

a058a0d

MessageCache expiration

ce28516

message_cache entries get expired when ReportSession increments repor…

fe08c78

…t_run_count

Frontend: ForwardMessageCache expiration

ac4d12a

Removing report_run_count from ReportSession

ba53975

This value needs to be updated after a report's messages have been sent

Server increments a session's report_run_count when it processes a re…

8ee449e

…port_finished message

Fix Server shutdown, and test for the breakage

959b728

Tweaks

d190781

App.handleReportFinished cleanup

15f822a

MessageCache.remove_expired_session_entries tweaks

eca2c1a

cleanup

d4e4ccd

MessageCache -> ForwardMsgCache

4f74ebc

tconkling requested review from monchier and tvst September 10, 2019 18:53

fix a comment

845e371

tvst reviewed Sep 13, 2019

View reviewed changes

tvst approved these changes Sep 13, 2019

View reviewed changes

lib/streamlit/ForwardMsgCache.py Outdated Show resolved Hide resolved

lib/streamlit/ForwardMsgCache.py Outdated Show resolved Hide resolved

tconkling added 5 commits September 17, 2019 08:59

WebsocketConnection.props -> args, per Thiago

0cf9907

spell out >, >= in docstrings

ca3e8f8

report -> script in docstring

f30675c

ForwardMsgCache: remove threading lock

ecbf6b0

This is no longer accessed from anywhere but the server thread

Log an error if report_run_count < prev_run_count

5418ab3

tconkling merged commit bbbb286 into streamlit:feature/hashing Sep 17, 2019

tconkling deleted the tim/MessageCacheEviction branch September 17, 2019 18:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ForwardMsgCache expiration#82

ForwardMsgCache expiration#82
tconkling merged 20 commits intostreamlit:feature/hashingfrom
tconkling:tim/MessageCacheEviction

tconkling commented Sep 10, 2019

Uh oh!

tvst left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tvst left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tconkling commented Sep 10, 2019

Uh oh!

tvst left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tvst left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants