add out_of_band hook #1648

wjordan · 2018-09-13T19:49:34Z

This PR adds an out_of_band hook that is invoked when the worker is idle.

The hook runs immediately after a request has finished processing and there are no busy threads on the worker. The worker doesn't accept new requests until this code finishes.

This hook can be used for running out-of-band garbage collection as a solution to #450, e.g.:

# config/puma.rb
require 'gctools/oobgc'
out_of_band {GC::OOB.run}

It could also be useful for scheduling/deferring other asynchronous tasks to be run during idle periods on the server.

evanphx

Given the impact of this feature, namely that it enables folks to do work out of band from requests but doesn't explicitly do any of that work, I'm inclined to accept this.

I'd like some thoughts from @schneems and @nateberkopec too.

schneems

Overall seems like an good experiment. Have you tried running with this patch in production? Any numbers to share?

schneems · 2018-09-13T20:35:07Z

lib/puma/thread_pool.rb

          # spin up the max number of threads.
-          return if @todo.size - @waiting < @max - @spawned
+          busy_threads = @spawned - @waiting + @todo.size
+          return busy_threads if @max > busy_threads


I'm really jumpy around this line of code. This controls a disproportionate amount of what puma does compared to it's size. While the method is only a few lines of code there's 30 lines of docs (most of which I wrote).

Original code

@todo.size - @waiting < @max - @spawned

Can be re-written as

@todo.size - @waiting + @spawned < @max

or flipping it

@max > @todo.size - @waiting + @spawned

re-aranging

@max > @spawned - @waiting + @todo.size

And finally

busy_threads = @spawned - @waiting + @todo.size @max > busy_threads

So that looks fine. Just makes me nervous.

The other question to ask: is this the correct logic.

@waiting is the count of threads that have been spawned but are idle. So busy count would be the spawned count minus the idle count.

We then need to handle for the case where a request has been processed but not picked up yet so we subtract the backlog.

So that makes sense.

Edit 7 years later: "so we subtract the backlog." is incorrect, we're adding the backlog (@todo.size). Which: if there's one idle thread and there's one request in the queue then we count the number of "busy" threads as one because that thread is about to be busy (even if it isn't right this second). A bug here is that we're not accounting for the case where @todo.size is > @waiting. When this code was written, it was possible, but usually only by a small count. This code has persisted and lives on in Puma 7 in the form ofbusy_threads methods and stats and now the todo queue is no longer as constrained. So I that bug has a larger impact.

I'm less confident the logic is correct to begin with, only that it's unchanged by the algebraic refactoring.

In fact, during some local benchmarking I noticed some weird anomalies where busy_threads was logged as 2 after only a single request was added to the worker-process, so I do suspect there might be some concurrency edge-case not handled perfectly by the current logic.

Anyway, it's certainly possible to leave the equation untouched by this PR if that makes it any less nerve-wracking. I figured making the 'busy threads' count more explicit in this method makes the logic easier to follow, but the change isn't strictly necessary.

after only a single request was added to the worker-process

Was this a web page load or did you make the request via curl? if it's a webpage don't forget there are assets that each require a different request.

there might be some concurrency edge-case not handled perfectly by the current logic.

Yes there's a known race condition between the time where a request is added to the @todo array and when a new thread picks it up and starts working on it. So it's possible that @todo.size might be greater than 0 but all threads be idle.

Anyway, it's certainly possible to leave the equation untouched by this PR if that makes it any less nerve-wracking. I figured making the 'busy threads' count more explicit in this method makes the logic easier to follow, but the change isn't strictly necessary.

I'm fine with the change just wanted to be sure I talked through it out loud and it preserved behavior.

wjordan · 2018-09-14T18:45:26Z

Overall seems like an good experiment. Have you tried running with this patch in production? Any numbers to share?

I just ran a ~12-hour side-by-side comparison of two Puma servers running Ruby 2.5, with/without the OOBGC from tmm1/gctools#17 along with this PR. The difference on my current production workload is slight but still noticeable.

Average response duration over a 3-hour interval of moderate traffic was 53.31ms without oobgc, and 49.34 with.
Average total memory utilization after 12-hours of uptime was at 53.2% without oobgc, and 49.0% with.

So both response duration and memory utilization are slightly improved by about ~7-8% running out-of-band GC on my workload.

wjordan mentioned this pull request Sep 13, 2018

Ruby 2.1: Out-of-Band GC #450

Closed

add out_of_band hook

53d0234

wjordan force-pushed the oob branch from f87deb2 to 53d0234 Compare September 13, 2018 20:01

evanphx self-requested a review September 13, 2018 20:03

evanphx approved these changes Sep 13, 2018

View reviewed changes

schneems approved these changes Sep 13, 2018

View reviewed changes

evanphx merged commit 011be8e into puma:master Feb 20, 2019

wjordan mentioned this pull request Dec 11, 2019

Even accept #1920

Closed

GuiTeK mentioned this pull request Mar 12, 2020

out_of_band hook not working when using multiple threads #2177

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add out_of_band hook #1648

add out_of_band hook #1648

Uh oh!

wjordan commented Sep 13, 2018

Uh oh!

evanphx left a comment

Uh oh!

schneems left a comment

Uh oh!

schneems Sep 13, 2018

Uh oh!

schneems Sep 13, 2018 •

edited

Loading

Uh oh!

wjordan Sep 14, 2018

Uh oh!

schneems Sep 14, 2018

Uh oh!

wjordan commented Sep 14, 2018 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add out_of_band hook #1648

add out_of_band hook #1648

Uh oh!

Conversation

wjordan commented Sep 13, 2018

Uh oh!

evanphx left a comment

Choose a reason for hiding this comment

Uh oh!

schneems left a comment

Choose a reason for hiding this comment

Uh oh!

schneems Sep 13, 2018

Choose a reason for hiding this comment

Uh oh!

schneems Sep 13, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wjordan Sep 14, 2018

Choose a reason for hiding this comment

Uh oh!

schneems Sep 14, 2018

Choose a reason for hiding this comment

Uh oh!

wjordan commented Sep 14, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

schneems Sep 13, 2018 •

edited

Loading

wjordan commented Sep 14, 2018 •

edited

Loading