0% found this document useful (0 votes)

69 views5 pages

TF Better Performance With TF - Function

Uploaded by

RodrigoCastellano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views5 pages

TF Better Performance With TF - Function

Uploaded by

RodrigoCastellano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

TF Better performance with tf.

function
In TensorFlow 2, eager execution is turned on by default. The user interface is intuitive and
flexible (running one-off operations is much easier and faster), but this can come at the
expense of performance and deployability.

You can use tf.function to make graphs out of your programs. It is a transformation tool that
creates Python-independent dataflow graphs out of your Python code. This will help you create
performant and portable models, and it is required to use SavedModel .

This guide will help you conceptualize how tf.function works under the hood, so you can
use it effectively.

The main takeaways and recommendations are:

Debug in eager mode, then decorate with @tf.function .

Don't rely on Python side effects like object mutation or list appends.
tf.function works best with TensorFlow ops; NumPy and Python calls are converted to
constants.

Basics
Usage
A tf.function that you define (for example by applying the @tf.function decorator) is just
like a core TensorFlow operation: You can execute it eagerly; you can compute gradients; and
so on.

tf.function s can be faster than eager code, especially for graphs with many small ops. But
for graphs with a few expensive ops (like convolutions), you may not see much speedup.

Tracing
This section exposes how tf.function works under the hood, including implementation
details which may change in the future. However, once you understand why and when tracing
happens, it's much easier to use tf.function effectively!

What is "tracing"?
A tf.function runs your program in a TensorFlow Graph. However, a tf.Graph cannot
represent all the things that you'd write in an eager TensorFlow program. For instance, Python
supports polymorphism, but tf.Graph requires its inputs to have a specified data type and
dimension. Or you may perform side tasks like reading command-line arguments, raising an
error, or working with a more complex Python object; none of these things can run in
a tf.Graph .

tf.function bridges this gap by separating your code in two stages:

1. In the first stage, referred to as "tracing", tf.function creates a new tf.Graph . Python
code runs normally, but all TensorFlow operations (like adding two Tensors) are deferred:
they are captured by the tf.Graph and not run.
2. In the second stage, a tf.Graph which contains everything that was deferred in the first
stage is run. This stage is much faster than the tracing stage.

Depending on its inputs, tf.function will not always run the first stage when it is called.
See "Rules of tracing" below to get a better sense of how it makes that determination. Skipping
the first stage and only executing the second stage is what gives you TensorFlow's high
performance.

When tf.function does decide to trace, the tracing stage is immediately followed by the
second stage, so calling the tf.function both creates and runs the tf.Graph . Later you will
see how you can run only the tracing stage with get_concrete_function .

When you pass arguments of different types into a tf.function , both stages are run:

@tf.functiondef
double(a):
print("Tracing with", a)
return a + a
print(double(tf.constant(1)))
print()
print(double(tf.constant(1.1)))
print()
print(double(tf.constant("a")))
print()

Note that if you repeatedly call a tf.function with the same argument type, TensorFlow will
skip the tracing stage and reuse a previously traced graph, as the generated graph would be
identical. You can use pretty_printed_concrete_signatures() to see all of the available
traces

So far, you've seen that tf.function creates a cached, dynamic dispatch layer over
TensorFlow's graph tracing logic. To be more specific about the terminology:
A tf.Graph is the raw, language-agnostic, portable representation of a TensorFlow
computation.
Tracing is the process through which new tf.Graph s are generated from Python code.
An instance of tf.Graph is specialized to the specific input types it was traced with.
Differing types require retracing.
Each traced tf.Graph has a corresponding ConcreteFunction .
A tf.function manages a cache of ConcreteFunction s and picks the right one for your
inputs.
tf.function wraps the Python function that will be traced, returning
a tf.types.experimental.PolymorphicFunction object.

Rules of tracing

When called, a tf.function first evaluates the type of each input argument using
the tf.types.experimental.TraceType of each argument. This is used to construct
a tf.types.experimental.FunctionType describing the signature of the
desired ConcreteFunction . We compare this FunctionType to the FunctionType s of
existing ConcreteFunction s. If a matching ConcreteFunction is found, the call is dispatched
to it. If no match is found, a new ConcreteFunction is traced for the desired FunctionType .

If multiple matches are found, the most specific signature is chosen. Matching is done
by subtyping, much like normal function calls in C++ or Java, for instance. For
example, TensorShape([1, 2]) is a subtype of TensorShape([None, None]) and so a call to
the tf.function with TensorShape([1, 2]) can be dispatched to
the ConcreteFunction produced with TensorShape([None, None]) but if
a ConcreteFunction with TensorShape([1, None]) also exists then it will be prioritized since
it is more specific.

The TraceType is determined from input arguments as follows:

For Tensor , the type is parameterized by the Tensor 's dtype and shape ; ranked
shapes are a subtype of unranked shapes; fixed dimensions are a subtype of unknown
dimensions
For Variable , the type is similar to Tensor , but also includes a unique resource ID of the
variable, necessary to correctly wire control dependencies
For Python primitive values, the type corresponds to the value itself. For example,
the TraceType of the value 3 is LiteralTraceType<3> , not int .
For Python ordered containers such as list and tuple , etc., the type is parameterized
by the types of their elements; for example, the type of [1,
2] is ListTraceType<LiteralTraceType<1>, LiteralTraceType<2>> and the type
for [2, 1] is ListTraceType<LiteralTraceType<2>, LiteralTraceType<1>> which is
different.
For Python mappings such as dict , the type is also a mapping from the same keys but to
the types of values instead of the actual values. For example, the type of {1: 2, 3: 4} ,
is MappingTraceType<<KeyValue<1, LiteralTraceType<2>>>, <KeyValue<3,
LiteralTraceType<4>>>> . However, unlike ordered containers, {1: 2, 3: 4} and {3:
4, 1: 2} have equivalent types.
For Python objects which implement the __tf_tracing_type__ method, the type is
whatever that method returns.
For any other Python objects, the type is a generic TraceType , and the matching
precedure is:
First it checks if the object is the same object used in the previous trace (using
Python id() or is ). Note that this will still match if the object has changed, so if you
use Python objects as tf.function arguments it's best to use immutable ones.
Next it checks if the object is equal to the object used in the previous trace (using
Python == ).
Note that this procedure only keeps a weakref to the object and hence only works as long
as the object is in scope/not deleted.

Controlling retracing
Retracing, which is when your tf.function creates more than one trace, helps ensure that
TensorFlow generates correct graphs for each set of inputs. However, tracing is an expensive
operation! If your tf.function retraces a new graph for every call, you'll find that your code
executes more slowly than if you didn't use tf.function .

To control the tracing behavior, you can use the following techniques:

Pass a fixed input_signature to tf.function

This forces tf.function to constrain itself to only
one tf.types.experimental.FunctionType composed of the types enumerated by
the input_signature . Calls that cannot be dispatched to this FunctionType will throw an
error.
Use unknown dimensions for flexibility

Since TensorFlow matches tensors based on their shape, using a None dimension as a
wildcard will allow tf.function s to reuse traces for variably-sized input. Variably-sized input
can occur if you have sequences of different length, or images of different sizes for each batch.
You can check out the Transformer and Deep Dream tutorials for examples.

@tf.function(input_signature=(tf.TensorSpec(shape=[None], dtype=tf.int32),))
def g(x):
print('Tracing with', x)
return x

Use reduce_retracing for automatic flexibility

When reduce_retracing is enabled, tf.function automatically identifies supertypes of the
input types it is observing and chooses to trace more generalized graphs automatically. It is less
efficient than setting the input_signature directly but useful when many types need to be
supported.

@tf.function(reduce_retracing=True)
def g(x):
print('Tracing with', x)
return x

Better Performance With TF - Function - TensorFlow Core
No ratings yet
Better Performance With TF - Function - TensorFlow Core
45 pages
AML Lecture1.3
No ratings yet
AML Lecture1.3
72 pages
TensorFlow Basics and Examples
No ratings yet
TensorFlow Basics and Examples
33 pages
TensorFlow Eager vs. Graph Execution
No ratings yet
TensorFlow Eager vs. Graph Execution
29 pages
Tensorflow 2 Tutorial PDF
100% (4)
Tensorflow 2 Tutorial PDF
66 pages
01 - Lecture Slide - Overview of Tensorflow
100% (1)
01 - Lecture Slide - Overview of Tensorflow
65 pages
Tensorflow PDF
No ratings yet
Tensorflow PDF
62 pages
TensorFlow Tutorial, CME 323, 4-12-2018
No ratings yet
TensorFlow Tutorial, CME 323, 4-12-2018
40 pages
Deeplearning Lab Manual
No ratings yet
Deeplearning Lab Manual
29 pages
Chapter DeepLearningwithTensorFlow
No ratings yet
Chapter DeepLearningwithTensorFlow
19 pages
Tensor
No ratings yet
Tensor
19 pages
Introduction To TensorFlow
No ratings yet
Introduction To TensorFlow
3 pages
Tensorflow 1
No ratings yet
Tensorflow 1
100 pages
DL Unit II
No ratings yet
DL Unit II
29 pages
CSE488 - Lab7 - Neural Networks and TensorFlow
No ratings yet
CSE488 - Lab7 - Neural Networks and TensorFlow
21 pages
Lec2 - Intro To Tensorflow
No ratings yet
Lec2 - Intro To Tensorflow
120 pages
02 - Lecture Note - TensorFlow Ops
No ratings yet
02 - Lecture Note - TensorFlow Ops
21 pages
Graph and Eager Execution
No ratings yet
Graph and Eager Execution
5 pages
TensorFlow Machine Learning Guide
No ratings yet
TensorFlow Machine Learning Guide
119 pages
Introduction to TensorFlow Basics
No ratings yet
Introduction to TensorFlow Basics
30 pages
TensorFlow Reduce Sum Tutorial
No ratings yet
TensorFlow Reduce Sum Tutorial
44 pages
Tensorflow Basics
No ratings yet
Tensorflow Basics
12 pages
Tensor Flow 101
100% (8)
Tensor Flow 101
58 pages
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
100% (1)
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
22 pages
TensorFlow Basics for Beginners
No ratings yet
TensorFlow Basics for Beginners
2 pages
Tensorflow Tutorial: Benedict Diederich
No ratings yet
Tensorflow Tutorial: Benedict Diederich
22 pages
Tensorflow 1.X Vs 2.X. - Summary of Changes: Search Sign Up Sign in
No ratings yet
Tensorflow 1.X Vs 2.X. - Summary of Changes: Search Sign Up Sign in
5 pages
1 TensorFlow
No ratings yet
1 TensorFlow
66 pages
Lecture 3 Tensors
No ratings yet
Lecture 3 Tensors
25 pages
What Is TensorFlow
No ratings yet
What Is TensorFlow
38 pages
Computational Graph
No ratings yet
Computational Graph
17 pages
TensorFlow Workshop
No ratings yet
TensorFlow Workshop
49 pages
Deep Learning Day1
No ratings yet
Deep Learning Day1
56 pages
Implementing Neural Networks with TensorFlow
No ratings yet
Implementing Neural Networks with TensorFlow
24 pages
Tensorflow
No ratings yet
Tensorflow
22 pages
Tutorial 3
No ratings yet
Tutorial 3
9 pages
TensorFlow Overview and Basics
No ratings yet
TensorFlow Overview and Basics
18 pages
TensorFlow Basics and Neural Networks
100% (1)
TensorFlow Basics and Neural Networks
38 pages
Appendix Tensorflow PDF
50% (8)
Appendix Tensorflow PDF
14 pages
TensorFlow 2.x Basics: Tensors Guide
No ratings yet
TensorFlow 2.x Basics: Tensors Guide
50 pages
TensorFlow for ML Enthusiasts
No ratings yet
TensorFlow for ML Enthusiasts
4 pages
S06 DNN Tensorflow PyTorch Wip
No ratings yet
S06 DNN Tensorflow PyTorch Wip
24 pages
Notebook - Tensorflow Keras
No ratings yet
Notebook - Tensorflow Keras
25 pages
Tensorflow
No ratings yet
Tensorflow
29 pages
Getting Started - TensorFlow
0% (1)
Getting Started - TensorFlow
14 pages
Introduction To TensorFlow and Keras
No ratings yet
Introduction To TensorFlow and Keras
3 pages
TensorFlow Basics for Beginners
No ratings yet
TensorFlow Basics for Beginners
32 pages
Introduction To Tensorflow and Keras
No ratings yet
Introduction To Tensorflow and Keras
4 pages
Unit III
No ratings yet
Unit III
28 pages
Lecture8 Computational Graph Pytorch TF
No ratings yet
Lecture8 Computational Graph Pytorch TF
64 pages
Tensor Flow
No ratings yet
Tensor Flow
9 pages
PyTorch for Deep Learning Beginners
No ratings yet
PyTorch for Deep Learning Beginners
31 pages
Understanding Logits in TensorFlow
No ratings yet
Understanding Logits in TensorFlow
94 pages
Automatic Differentiation With Torch - Autograd - PyTorch Tutorials 2.6.0+cu124 Documentation
No ratings yet
Automatic Differentiation With Torch - Autograd - PyTorch Tutorials 2.6.0+cu124 Documentation
4 pages
PyTorch for Deep Learning Experts
No ratings yet
PyTorch for Deep Learning Experts
72 pages
TF Basics
No ratings yet
TF Basics
3 pages
Introduction to TensorFlow Basics
No ratings yet
Introduction to TensorFlow Basics
21 pages
Programme Luigi
No ratings yet
Programme Luigi
1 page
Problem 7.1 (P&W 10.1 + 10.2a) : 2 0 0 Ikr 2 2 0 0 Ikr 0 0 2 2 2 Ikr
No ratings yet
Problem 7.1 (P&W 10.1 + 10.2a) : 2 0 0 Ikr 2 2 0 0 Ikr 0 0 2 2 2 Ikr
2 pages
WC6 Soln
No ratings yet
WC6 Soln
3 pages
Problem 5.1 (P&W 7.2) : 2 m i (k ·r−ω t) j i (k ·r−ω t) m
No ratings yet
Problem 5.1 (P&W 7.2) : 2 m i (k ·r−ω t) j i (k ·r−ω t) m
3 pages
Monte Carlo Simulations in Physics: Kari Rummukainen Department of Physical Sciences, University of Oulu
No ratings yet
Monte Carlo Simulations in Physics: Kari Rummukainen Department of Physical Sciences, University of Oulu
34 pages
WC2 Solution
No ratings yet
WC2 Solution
2 pages
The Godfather Film Review and Insights
No ratings yet
The Godfather Film Review and Insights
2 pages
Proposal Pag 151
0% (2)
Proposal Pag 151
2 pages
Boosting Literacy in Schools
No ratings yet
Boosting Literacy in Schools
2 pages
Fundraising Proposal for Community Fair
No ratings yet
Fundraising Proposal for Community Fair
2 pages
Encouraging Young People To Take College Courses
No ratings yet
Encouraging Young People To Take College Courses
2 pages
Servicenow Interview Questions From Interviewbit
No ratings yet
Servicenow Interview Questions From Interviewbit
20 pages
Visitor Management System Project Report
No ratings yet
Visitor Management System Project Report
99 pages
CD Lecture 1
No ratings yet
CD Lecture 1
25 pages
Action Elements
No ratings yet
Action Elements
13 pages
Mid Career Resume Example
No ratings yet
Mid Career Resume Example
2 pages
Operations Security Management Guide
No ratings yet
Operations Security Management Guide
17 pages
Ooad Tutorial
No ratings yet
Ooad Tutorial
89 pages
Java Programs
No ratings yet
Java Programs
7 pages
Pps Project
No ratings yet
Pps Project
14 pages
Abhishek ATS Resume
No ratings yet
Abhishek ATS Resume
1 page
Vishal Resume Final
No ratings yet
Vishal Resume Final
1 page
M.S. Ramaiah Institute of Technology,: (Autonomous Institute Affiliated To Vtu) Bangalore - 560 054
No ratings yet
M.S. Ramaiah Institute of Technology,: (Autonomous Institute Affiliated To Vtu) Bangalore - 560 054
1 page
C Viva Questions
86% (7)
C Viva Questions
9 pages
Unit I
No ratings yet
Unit I
51 pages
DataFlair Year Wise Courses
No ratings yet
DataFlair Year Wise Courses
5 pages
IT Professional with Diverse Experience
No ratings yet
IT Professional with Diverse Experience
2 pages
Fundamentals of Systems Analysis and Design
No ratings yet
Fundamentals of Systems Analysis and Design
43 pages
Compiler Design Case Study 2
No ratings yet
Compiler Design Case Study 2
6 pages
Control Statements
No ratings yet
Control Statements
22 pages
Blockchain Hands On Tutorial
100% (1)
Blockchain Hands On Tutorial
51 pages
Shubham Report
No ratings yet
Shubham Report
47 pages
Candidate 2 - (61 Out of 70) - Additional Comments
No ratings yet
Candidate 2 - (61 Out of 70) - Additional Comments
1 page
Bauhaus: Program Analysis Tool Suite
100% (1)
Bauhaus: Program Analysis Tool Suite
12 pages
Tekla Open API: Release Notes
No ratings yet
Tekla Open API: Release Notes
12 pages
C Interview Questions
No ratings yet
C Interview Questions
17 pages
Programming Languages Course Plan
No ratings yet
Programming Languages Course Plan
2 pages
Software Testing Methodologies Guide
No ratings yet
Software Testing Methodologies Guide
32 pages
Compiler Design: Introduction & Lexical Analysis
No ratings yet
Compiler Design: Introduction & Lexical Analysis
22 pages
Sai Chander Reddy Gudipati Profile
No ratings yet
Sai Chander Reddy Gudipati Profile
5 pages
IDE (Integrated Development Environment) by HASEEB AHMED KHATEEB
No ratings yet
IDE (Integrated Development Environment) by HASEEB AHMED KHATEEB
31 pages

TF Better Performance With TF - Function

Uploaded by

TF Better Performance With TF - Function

Uploaded by

TF Better performance with tf.

The main takeaways and recommendations are:

Debug in eager mode, then decorate with @tf.function .

tf.function bridges this gap by separating your code in two stages:

The TraceType is determined from input arguments as follows:

Pass a fixed input_signature to tf.function

Use reduce_retracing for automatic flexibility

You might also like