Lightweight at-most-once logging for API usage #20698

dzhulgakov · 2019-05-20T06:10:45Z

Idea is that when PyTorch is used in a custom build environment (e.g. Facebook), it's useful to track usage of various APIs centrally. This PR introduces a simple very lightweight mechanism to do so - only first invocation of a trigger point would be logged. This is significantly more lightweight than #18235 and thus we can allow to put logging in e.g. TensorImpl.

Also adds an initial list of trigger points. Trigger points are added in such a way that no static initialization triggers them, i.e. just linking with libtorch.so will not cause any logging. Further suggestions of what to log are welcomed.

Test plan:
Used PYTORCH_API_USAGE_STDERR=1 env with various scenarios, verified that logging is indeed triggered.
Given the only-once nature of logging, I'm not sure adding unittest would be that beneficial as it might be impact by how multiple unittests are linked together in one binary.

facebook-github-bot

@dzhulgakov has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

soumith · 2019-05-20T12:56:56Z

c10/util/Logging.h

+ *
+ * In order to ensure light-weightness of logging, we utilize static variable
+ * trick - LogAPIUsage will be invoked only once and further invocations will
+ * just do an atomic check.


an atomic check on for example a hot-loop, wouldn't that add a few hundred nano-seconds per TensorImpl usage?
Is the logging only enabled if the env variable is set, and the atomic check skippable if it's not?

The logging will be enabled all the time in e.g. FB environment. The envvar thing is just for testing in default implementation.

I've tried to measure overhead and if I don't wipe out cache (it's hard to measure with cold cache as the overhead is so tiny), it comes down to something like 0.3ns (nanoseconds) so it's definitely fine. The static var initialization trick is used widely (including for singletons), so it should be cheap to sprinkle around - https://stackoverflow.com/questions/23829389/does-a-function-local-static-variable-automatically-incur-a-branch

Summary: Resubmit #20698 which got messed up. Idea is that when PyTorch is used in a custom build environment (e.g. Facebook), it's useful to track usage of various APIs centrally. This PR introduces a simple very lightweight mechanism to do so - only first invocation of a trigger point would be logged. This is significantly more lightweight than #18235 and thus we can allow to put logging in e.g. TensorImpl. Also adds an initial list of trigger points. Trigger points are added in such a way that no static initialization triggers them, i.e. just linking with libtorch.so will not cause any logging. Further suggestions of what to log are welcomed. Pull Request resolved: #20745 Differential Revision: D15429196 Pulled By: dzhulgakov fbshipit-source-id: a5e41a709a65b7ebccc6b95f93854e583cf20aca

apaszke

Why is this log-once thing of any use? How were the places where we add the log selected? It all seems very arbitrary to me and only contributes to the messiness of our codebase.

apaszke · 2019-07-01T12:16:01Z

torch/utils/data/dataloader.py

                 batch_sampler=None, num_workers=0, collate_fn=default_collate,
                 pin_memory=False, drop_last=False, timeout=0,
                 worker_init_fn=None):
+        torch._C._log_api_usage_once("python.data_loader")


Well those are not free, so why are we hardcoding them in the core even though literally no other user than FB cares about this feature?

i think it's time to actually flesh out a design and expose logging properly. i think this feature would be pretty useful to a wider set of folks -- anyone doing thousands of experiments fleet-wide.

I'm ok with adding a API to do this, but some of the decisions taken in this one seem fairly arbitrary to me.

Lightweight logging for once-only API usage

d3059b9

dzhulgakov requested review from ezyang, gchanan, nataliakliushkina and smessmer May 20, 2019 06:10

dzhulgakov requested review from apaszke, mrshenli and pietern as code owners May 20, 2019 06:10

facebook-github-bot reviewed May 20, 2019

View reviewed changes

dzhulgakov merged commit d3059b9 into pytorch:master May 20, 2019

soumith reviewed May 20, 2019

View reviewed changes

dzhulgakov deleted the api-metrics branch May 21, 2019 05:01

dzhulgakov mentioned this pull request May 21, 2019

Lightweight at-most-once logging for API usage #20745

Closed

apaszke reviewed Jul 1, 2019

View reviewed changes

kazhang mentioned this pull request Nov 18, 2021

What's purpose of logging? pytorch/vision#4957

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Lightweight at-most-once logging for API usage #20698

Lightweight at-most-once logging for API usage #20698

Uh oh!

dzhulgakov commented May 20, 2019 •

edited

Loading

Uh oh!

facebook-github-bot left a comment

Uh oh!

soumith May 20, 2019

Uh oh!

dzhulgakov May 21, 2019

Uh oh!

apaszke left a comment

Uh oh!

apaszke Jul 1, 2019

Uh oh!

soumith Jul 1, 2019

Uh oh!

apaszke Jul 2, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Lightweight at-most-once logging for API usage #20698

Lightweight at-most-once logging for API usage #20698

Uh oh!

Conversation

dzhulgakov commented May 20, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

soumith May 20, 2019

Choose a reason for hiding this comment

Uh oh!

dzhulgakov May 21, 2019

Choose a reason for hiding this comment

Uh oh!

apaszke left a comment

Choose a reason for hiding this comment

Uh oh!

apaszke Jul 1, 2019

Choose a reason for hiding this comment

Uh oh!

soumith Jul 1, 2019

Choose a reason for hiding this comment

Uh oh!

apaszke Jul 2, 2019

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dzhulgakov commented May 20, 2019 •

edited

Loading