100% found this document useful (1 vote)

640 views35 pages

Intro To Deep Learning

This document provides an introduction to deep learning with GPUs. It discusses what deep learning is and how it has become a popular approach for developing artificial intelligence. It explains how deep learning uses neural networks with many layers to automatically learn representations of data for tasks like image classification. GPUs are well-suited for deep learning due to their parallel processing capabilities and are being widely used for deep learning research and applications. Popular deep learning frameworks that support GPU acceleration are also discussed.

Uploaded by

CNueman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

640 views35 pages

Intro To Deep Learning

Uploaded by

CNueman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

INTRODUCTION TO

DEEP LEARNING WITH GPUS

July 2015

1 What is Deep Learning?

AGENDA

2 Deep Learning software

3 Deep Learning deployment

What is Deep Learning?

DEEP LEARNING & AI

Deep Learning has become the most popular
approach to developing Artificial Intelligence
(AI) machines that perceive and understand
the world

CUDA for
Deep Learning

The focus is currently on specific perceptual

tasks, and there are many successes.
Today, some of the worlds largest internet
companies, as well as the foremost research
institutions, are using GPUs for deep learning
in research and production

PRACTICAL DEEP LEARNING EXAMPLES

Image Classification, Object Detection, Localization,

Action Recognition, Scene Understanding

Speech Recognition, Speech Translation,

Natural Language Processing

Pedestrian Detection, Traffic Sign Recognition

Breast Cancer Cell Mitosis Detection,

Volumetric Brain Image Segmentation
5

TRADITIONAL MACHINE PERCEPTION

HAND TUNED FEATURES
Raw data

Feature extraction

Classifier/
detector

Result

SVM,
shallow neural net,

HMM,
shallow neural net,

Clustering, HMM,
LDA, LSA

Speaker ID,
speech transcription,

Topic classification,
machine translation,
sentiment analysis
6

DEEP LEARNING APPROACH

Train:

Errors
Dog

Dog
Cat
Raccoon

Cat

Honey badger

Deploy:

Dog

SOME DEEP LEARNING USE CASES

Jeff Dean, Google, GTC 2015

ARTIFICIAL NEURAL NETWORK (ANN)

A collection of simple, trainable mathematical units that
collectively learn complex functions
Biological neuron

Artificial neuron
y

w1
x1

w3
x2

From Stanford cs231n lecture notes

y=F(w1x1+w2x2+w3x3)
F(x)=max(0,x)

ARTIFICIAL NEURAL NETWORK (ANN)

A collection of simple, trainable mathematical units that
collectively learn complex functions
Hidden layers

Input layer

Output layer

Given sufficient training data an artificial neural network can approximate very complex
functions mapping raw data to output decisions
10

DEEP NEURAL NETWORK (DNN)

Raw data

Low-level features

Mid-level features

High-level features

Application components:

Input

Result

Task objective
e.g. Identify face
Training data
10-100M images
Network architecture
~10 layers
1B parameters
Learning algorithm
~30 Exaflops
~30 GPU days

DEEP LEARNING ADVANTAGES

Robust
No need to design the features ahead of time features are automatically
learned to be optimal for the task at hand
Robustness to natural variations in the data is automatically learned

Generalizable
The same neural net approach can be used for many different applications
and data types

Scalable
Performance improves with more data, method is massively parallelizable
12

CONVOLUTIONAL NEURAL NETWORK (CNN)

Inspired by the human visual

cortex
Learns a hierarchy of visual
features

Local pixel level features are

scale and translation invariant
Learns the essence of visual
objects and generalizes well
13

CONVOLUTIONAL NEURAL NETWORK (CNN)

RECURRENT NEURAL NETWORK (RNN)

DNNS DOMINATE IN PERCEPTUAL TASKS

Slide credit: Yann Lecun, Facebook & NYU

WHY IS DEEP LEARNING HOT NOW?

Three Driving Factors
Big Data Availability

New DL Techniques

GPU acceleration

350 millions
images uploaded
per day
2.5 Petabytes of
customer data
hourly

100 hours of video

uploaded every
minute

GPUs and Deep Learning

GPUs THE PLATFORM FOR DEEP LEARNING

Image Recognition Challenge
1.2M training images 1000 object categories

GPU Entries
120
100

110

80
60

Hosted by

40
20

0
2010

2011

2012

2013

2014

person
car

bird

helmet

frog

motorcycle

person
dog
chair

person
hammer
flower pot
power drill

GPU-ACCELERATED DEEP LEARNING

GPUS MAKE DEEP LEARNING ACCESSIBLE

GOOGLE DATACENTER

STANFORD AI LAB

Deep learning with COTS HPC

systems
A. Coates, B. Huval, T. Wang, D. Wu,
A. Ng, B. Catanzaro

ICML 2013

$1M Artificial Brain on the Cheap

Now You Can Build Googles

1,000 CPU Servers

2,000 CPUs 16,000 cores

600 kWatts
$5,000,000

3 GPU-Accelerated Servers
12 GPUs 18,432 cores

4 kWatts
$33,000

WHY ARE GPUs GOOD FOR DEEP LEARNING?

Neural
Networks

GPUs

Inherently
Parallel

Matrix
Operations

FLOPS

Bandwidth

GPUs deliver -- same or better prediction accuracy

- faster results
- smaller footprint
- lower power
- lower cost
22

GPU ACCELERATION

Training A Deep, Convolutional Neural Network

Training Time
CPU

Training Time
GPU

GPU
Speed Up

64 images

64 s

7.5 s

8.5X

128 images

124 s

14.5 s

8.5X

256 images

257 s

28.5 s

9.0X

Batch Size

ILSVRC12 winning model: Supervision

Dual 10-core Ivy Bridge CPUs

7 layers

1 Tesla K40 GPU

5 convolutional layers + 2 fully-connected

CPU times utilized Intel MKL BLAS library

ReLU, pooling, drop-out, response normalization

GPU acceleration from CUDA matrix libraries

(cuBLAS)

Implemented with Caffe

Training time is for 20 iterations

DL software landscape

HOW TO WRITE APPLICATIONS USING DL

Speech
Understanding

Image
Language
END USER APPLICATIONS
Analysis
Processing

Deep Learning Frameworks(Industry standard or research frameworks)

Libraries(Key compute intensive commonly used building blocks)

System Software(Drivers)

Hardware Which can accelerate DL building blocks

HOW NVIDIA IS HELPING DL STACK

Speech
Understanding

Image
Language
Analysis
Processing
END USER APPLICATIONS
DIGITS

accelerated
DL Frameworks
(Caffe,or
Torch,
Theano)
Deep GPU
Learning
Frameworks(Industry
standard
research
frameworks)

Libraries(Key
used building
blocks)
Performancecompute
librariesintensive
(cuDNN, commonly
cuBLAS)- Highly
optimized

System
Software(Drivers)
CUDA- Best
Parallel
Programming Toolkit

HardwareGPU Which
Worlds
can accelerate
best DL Hardware
DL building blocks
26

GPU-ACCELERATED
DEEP LEARNING FRAMEWORKS
CAFFE

TORCH

THEANO

KALDI

Domain

Deep Learning
Framework

Scientific Computing
Framework

Math Expression
Compiler

Speech Recognition
Toolkit

cuDNN

2.0

Multi-GPU

via DIGITS 2

In Progress

(nnet2)

Multi-CPU

(nnet2)

License

BSD-2

GPL

BSD

Apache 2.0

Interface(s)

Command line,
Python, MATLAB

Lua, Python,
MATLAB

Python

C++, Shell scripts

Embedded (TK1)

http://developer.nvidia.com/deeplearning
All three frameworks covered in the associated Intro to DL hands-on lab

CUDNN V2 - PERFORMANCE
v3 coming soon

CPU is 16 core Haswell E5-2698 at 2.3 GHz, with 3.6 GHz Turbo
GPU is NVIDIA Titan X
28

HOW GPU ACCELERATION WORKS

Application Code

Compute-Intensive Functions

GPU

Rest of Sequential
CPU Code

5% of Code

~ 80% of run-time

CPU

CUDNN ROUTINES
Convolutions 80-90% of the execution time
Pooling - Spatial smoothing

Activations - Pointwise non-linear function

https://developer.nvidia.com/cudnn
30

DIGITS
Interactive Deep Learning GPU Training System
Data Scientists & Researchers:
Quickly design the best deep neural
network (DNN) for your data
Visually monitor DNN training quality in
real-time
Manage training of many DNNs in
parallel on multi-GPU systems

DIGITS 2 - Accelerate training of a

single DNN using multiple GPUs
https://developer.nvidia.com/digits
31

DL deployment

DEEP LEARNING DEPLOYMENT WORKFLOW

DEEP LEARNING LAB SERIES SCHEDULE

7/22 Class #1 - Introduction to Deep Learning
7/29 Office Hours for Class #1
8/5 Class #2 - Getting Started with DIGITS interactive training system for image classification
8/12 Office Hours for Class #2

8/19 Class #3 - Getting Started with the Caffe Framework

8/26 Office Hours for Class #3
9/2
9/9

Class #4 - Getting Started with the Theano Framework

Office Hours for Class #4

9/16 Class #5 - Getting Started with the Torch Framework

9/23 Office Hours for Class #5

More information available at developer.nvidia.com/deep-learning-courses

HANDS-ON LAB
1. Create an account at nvidia.qwiklab.com
2. Go to Introduction to Deep Learning lab at bit.ly/dlnvlab1
3. Start the lab and enjoy!

Only requires a supported browser, no NVIDIA GPU necessary!

Lab is free until end of Deep Learning Lab series

Convolutional Neural Networks in Python
100% (3)
Convolutional Neural Networks in Python
141 pages
Deep Learning Book
100% (5)
Deep Learning Book
42 pages
A Survey of Evolution of Image Captioning PDF
No ratings yet
A Survey of Evolution of Image Captioning PDF
18 pages
Deep Learning - Fundamentals, Theory and Applications 2019 PDF
100% (11)
Deep Learning - Fundamentals, Theory and Applications 2019 PDF
168 pages
Machine Learning
No ratings yet
Machine Learning
20 pages
Deep Learning
94% (33)
Deep Learning
540 pages
Deep Learning With Tensorflow
100% (1)
Deep Learning With Tensorflow
70 pages
Deep Learning in Computer Vision - Principles and Applications
100% (4)
Deep Learning in Computer Vision - Principles and Applications
339 pages
(Studies in Computational Intelligence) Witold Pedrycz, Shyi-Ming Chen - Deep Learning - Algorithms and Applications-Springer (2020)
100% (7)
(Studies in Computational Intelligence) Witold Pedrycz, Shyi-Ming Chen - Deep Learning - Algorithms and Applications-Springer (2020)
368 pages
Machine Learning Interpretability Guide
No ratings yet
Machine Learning Interpretability Guide
447 pages
Neural Networks and Deep Learning - Deep Learning Explained To Your Granny - A Visual Introduction For Beginners Who Want To Make Their Own Deep Learning Neural Network (Machine Learning)
100% (5)
Neural Networks and Deep Learning - Deep Learning Explained To Your Granny - A Visual Introduction For Beginners Who Want To Make Their Own Deep Learning Neural Network (Machine Learning)
84 pages
Applied Deep Learning Book (Tools, Techniques & Implementation)
100% (2)
Applied Deep Learning Book (Tools, Techniques & Implementation)
355 pages
TensorFlow For Machine Intelligence
100% (27)
TensorFlow For Machine Intelligence
305 pages
Dive Into Deep Learning
100% (3)
Dive Into Deep Learning
291 pages
Machine Learning Paradigms
100% (10)
Machine Learning Paradigms
336 pages
Deep Learning Algorithms
100% (4)
Deep Learning Algorithms
412 pages
Machine Learning Masterclass
100% (11)
Machine Learning Masterclass
108 pages
Visual Introduction Deep Learning v21-02
100% (6)
Visual Introduction Deep Learning v21-02
236 pages
Computer Vision and Simulation
100% (1)
Computer Vision and Simulation
191 pages
Computer Vision Methods For Fast Image Classification and Retrieval 2020
100% (5)
Computer Vision Methods For Fast Image Classification and Retrieval 2020
144 pages
Computer Vision and Action Recognition A Guide For Image Processing and Computer Vision Community For Action Understanding
No ratings yet
Computer Vision and Action Recognition A Guide For Image Processing and Computer Vision Community For Action Understanding
228 pages
Machine Learning From Scratch PDF
89% (9)
Machine Learning From Scratch PDF
124 pages
Machine Learning - The Mastery Bible - The Definitive Guide To Machine Learning Data Science PDF
100% (6)
Machine Learning - The Mastery Bible - The Definitive Guide To Machine Learning Data Science PDF
331 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
Machine Learning Simplified
100% (1)
Machine Learning Simplified
109 pages
Tensorflow 2 Tutorial PDF
100% (4)
Tensorflow 2 Tutorial PDF
66 pages
An Analysis of Convolutional Neural Network Architectures
No ratings yet
An Analysis of Convolutional Neural Network Architectures
54 pages
Math of Deep Learning Neural Networks
No ratings yet
Math of Deep Learning Neural Networks
9 pages
Algorithms and Architectures of Artificial Intelligence
100% (2)
Algorithms and Architectures of Artificial Intelligence
185 pages
Deep Learning For NLP and Speech Recogni
100% (10)
Deep Learning For NLP and Speech Recogni
640 pages
Anitha S. Pillai and Roberto Tedesco - Machine Learning and Deep Learning in Natural Language Processing-CRC Press (2024)
100% (2)
Anitha S. Pillai and Roberto Tedesco - Machine Learning and Deep Learning in Natural Language Processing-CRC Press (2024)
245 pages
Machine Learning Handouts
No ratings yet
Machine Learning Handouts
110 pages
Convolutional Neural Networks in Python Master Data Science and Machine Learning With Modern Deep Le
100% (3)
Convolutional Neural Networks in Python Master Data Science and Machine Learning With Modern Deep Le
178 pages
Foundations of Computer Vision
88% (8)
Foundations of Computer Vision
443 pages
Understanding Attention Mechanisms in Deep Learning
No ratings yet
Understanding Attention Mechanisms in Deep Learning
104 pages
10.1007@978 3 030 15628 2
100% (2)
10.1007@978 3 030 15628 2
552 pages
Deep Learning Decoding Problems
100% (1)
Deep Learning Decoding Problems
103 pages
Machine Learning Projects in Python
100% (17)
Machine Learning Projects in Python
135 pages
Understanding Machine Learning
100% (73)
Understanding Machine Learning
416 pages
(FREE PDF Sample) Deep Generative Modeling Jakub M. Tomczak Ebooks
No ratings yet
(FREE PDF Sample) Deep Generative Modeling Jakub M. Tomczak Ebooks
47 pages
Burkov's Guide to Machine Learning
100% (11)
Burkov's Guide to Machine Learning
135 pages
Deep Learning For Image Classification: GEOINT Training
No ratings yet
Deep Learning For Image Classification: GEOINT Training
75 pages
Intro To Deep Learning
No ratings yet
Intro To Deep Learning
39 pages
Deep Learning 1737909076
No ratings yet
Deep Learning 1737909076
29 pages
Deep Learning Frameworks & Techniques
No ratings yet
Deep Learning Frameworks & Techniques
5 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Hardware Architectures For Deep Neural Networks-MIT'16
No ratings yet
Hardware Architectures For Deep Neural Networks-MIT'16
300 pages
Deep Learning
No ratings yet
Deep Learning
127 pages
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
No ratings yet
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
92 pages
NN-DL Unit-V Chapter-II Applications
No ratings yet
NN-DL Unit-V Chapter-II Applications
19 pages
Deep Learning
100% (4)
Deep Learning
32 pages
Deep Learning - Introduction
No ratings yet
Deep Learning - Introduction
10 pages
AD3501 Deep Learning PRAISE
No ratings yet
AD3501 Deep Learning PRAISE
24 pages
DL - Unit - 1 - Foundations of Deep Learning
No ratings yet
DL - Unit - 1 - Foundations of Deep Learning
35 pages
Lect 2 Common Architectural Principles of Deep Networks
No ratings yet
Lect 2 Common Architectural Principles of Deep Networks
20 pages
Introduction To Deep Learning: Nandita Bhaskhar
No ratings yet
Introduction To Deep Learning: Nandita Bhaskhar
56 pages
Deep Learning Day 27
No ratings yet
Deep Learning Day 27
43 pages
Deep Learning Fundamentals
No ratings yet
Deep Learning Fundamentals
19 pages
Deep Learning (DL) - Comprehensive Summary
No ratings yet
Deep Learning (DL) - Comprehensive Summary
9 pages
Deep Neural Network Hardware Architectures
No ratings yet
Deep Neural Network Hardware Architectures
65 pages
Final RAWE Manual 2024-25
No ratings yet
Final RAWE Manual 2024-25
77 pages
Skass S.2 E.O.T MTC 2024
100% (1)
Skass S.2 E.O.T MTC 2024
6 pages
Quality Assurance in Product Lifecycle
No ratings yet
Quality Assurance in Product Lifecycle
9 pages
SET 500 AC Drive User Manual
No ratings yet
SET 500 AC Drive User Manual
110 pages
Hitchhikers Guide To EARM
No ratings yet
Hitchhikers Guide To EARM
28 pages
Bullish Candlestick Patterns
No ratings yet
Bullish Candlestick Patterns
165 pages
Professional Ethics
No ratings yet
Professional Ethics
11 pages
Sharmin Shanaz: Personal Profile
No ratings yet
Sharmin Shanaz: Personal Profile
4 pages
Robert Maxwell - The Sinking of Captain Bob
No ratings yet
Robert Maxwell - The Sinking of Captain Bob
14 pages
Operator Terminal Guide ''EXTER''
No ratings yet
Operator Terminal Guide ''EXTER''
54 pages
RPL Form 1 Documentation
No ratings yet
RPL Form 1 Documentation
5 pages
Introduction To Financial Reporting
No ratings yet
Introduction To Financial Reporting
40 pages
SAP S4HANA ERP Introduction Guide
No ratings yet
SAP S4HANA ERP Introduction Guide
37 pages
5-2 Day 2 Answers
No ratings yet
5-2 Day 2 Answers
3 pages
Pricing Strategy: Steps in Setting Price
100% (1)
Pricing Strategy: Steps in Setting Price
6 pages
Lab 2 - Flip Flops and Counters
No ratings yet
Lab 2 - Flip Flops and Counters
27 pages
Suzuki GSX1300RL1 Parts Catalogue
No ratings yet
Suzuki GSX1300RL1 Parts Catalogue
128 pages
BS ISO 45001 Occupational Health and Safety Management Systems
100% (1)
BS ISO 45001 Occupational Health and Safety Management Systems
1 page
Ultra Low Profile Torx Screws Datasheet
No ratings yet
Ultra Low Profile Torx Screws Datasheet
1 page
Discretion vs Rules in Monetary Policy
No ratings yet
Discretion vs Rules in Monetary Policy
11 pages
Unnase Mock: Uganda Advanced Certificate of Education
No ratings yet
Unnase Mock: Uganda Advanced Certificate of Education
5 pages
Consumer Buying Process for Cars
No ratings yet
Consumer Buying Process for Cars
4 pages
Good Governance Notes
No ratings yet
Good Governance Notes
4 pages
EVERFI FinancialLiteracy SavingsAccounts StudentActivity
No ratings yet
EVERFI FinancialLiteracy SavingsAccounts StudentActivity
2 pages
Levi-Civita Symbol in Tensor Analysis
No ratings yet
Levi-Civita Symbol in Tensor Analysis
2 pages
AC Condensate Drain Sizing and Layout
No ratings yet
AC Condensate Drain Sizing and Layout
13 pages
Financial Ratio & Z-Score Analysis
No ratings yet
Financial Ratio & Z-Score Analysis
10 pages
Job Application Form
No ratings yet
Job Application Form
6 pages
Computer Safety & Maintenance Guide
No ratings yet
Computer Safety & Maintenance Guide
16 pages

Intro To Deep Learning

Uploaded by

Intro To Deep Learning

Uploaded by

INTRODUCTION TO

DEEP LEARNING WITH GPUS

1 What is Deep Learning?

2 Deep Learning software

What is Deep Learning?

DEEP LEARNING & AI

The focus is currently on specific perceptual

PRACTICAL DEEP LEARNING EXAMPLES

Image Classification, Object Detection, Localization,

Speech Recognition, Speech Translation,

Pedestrian Detection, Traffic Sign Recognition

Breast Cancer Cell Mitosis Detection,

TRADITIONAL MACHINE PERCEPTION

DEEP LEARNING APPROACH

SOME DEEP LEARNING USE CASES

Jeff Dean, Google, GTC 2015

ARTIFICIAL NEURAL NETWORK (ANN)

From Stanford cs231n lecture notes

ARTIFICIAL NEURAL NETWORK (ANN)

DEEP NEURAL NETWORK (DNN)

DEEP LEARNING ADVANTAGES

CONVOLUTIONAL NEURAL NETWORK (CNN)

Inspired by the human visual

Local pixel level features are

CONVOLUTIONAL NEURAL NETWORK (CNN)

RECURRENT NEURAL NETWORK (RNN)

DNNS DOMINATE IN PERCEPTUAL TASKS

Slide credit: Yann Lecun, Facebook & NYU

WHY IS DEEP LEARNING HOT NOW?

100 hours of video

GPUs and Deep Learning

GPUs THE PLATFORM FOR DEEP LEARNING

GPU-ACCELERATED DEEP LEARNING

GPUS MAKE DEEP LEARNING ACCESSIBLE

Deep learning with COTS HPC

$1M Artificial Brain on the Cheap

Now You Can Build Googles

1,000 CPU Servers

WHY ARE GPUs GOOD FOR DEEP LEARNING?

GPUs deliver -- same or better prediction accuracy

Training A Deep, Convolutional Neural Network

ILSVRC12 winning model: Supervision

Dual 10-core Ivy Bridge CPUs

1 Tesla K40 GPU

5 convolutional layers + 2 fully-connected

CPU times utilized Intel MKL BLAS library

ReLU, pooling, drop-out, response normalization

GPU acceleration from CUDA matrix libraries

Implemented with Caffe

HOW TO WRITE APPLICATIONS USING DL

Deep Learning Frameworks(Industry standard or research frameworks)

Libraries(Key compute intensive commonly used building blocks)

Hardware Which can accelerate DL building blocks

HOW NVIDIA IS HELPING DL STACK

C++, Shell scripts

HOW GPU ACCELERATION WORKS

Activations - Pointwise non-linear function

DIGITS 2 - Accelerate training of a

DEEP LEARNING DEPLOYMENT WORKFLOW

DEEP LEARNING LAB SERIES SCHEDULE

8/19 Class #3 - Getting Started with the Caffe Framework

Class #4 - Getting Started with the Theano Framework

9/16 Class #5 - Getting Started with the Torch Framework

More information available at developer.nvidia.com/deep-learning-courses

Only requires a supported browser, no NVIDIA GPU necessary!

Lab is free until end of Deep Learning Lab series

You might also like