Deep Learning

The document outlines key milestones in the deep learning revolution from the 2010s to 2023, highlighting breakthroughs such as AlexNet, Word2Vec, GANs, and transformers. It also details advancements in various applications including handwriting recognition, speech recognition, and protein structure prediction. The evolution of models like GPT-3 and DALL·E 2 illustrates the transformative impact of deep learning on natural language processing and generative art.

Uploaded by

metimebest

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views20 pages

Deep Learning

Uploaded by

metimebest

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

The Deep Learning Revolution (2010s)

2012: Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton achieve a significant breakthrough with the
AlexNet model, which wins the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) by a
large margin. AlexNet's success demonstrates the power of deep convolutional neural networks
(CNNs) and GPU acceleration for training deep models.

2013: Word2Vec, developed by Tomas Mikolov and colleagues at Google, introduces a new approach
to learning word embeddings, significantly advancing natural language processing (NLP) tasks.

2014: Ian Goodfellow and his colleagues introduce Generative Adversarial Networks (GANs), a
novel approach to generating realistic data through adversarial training.

2014: The Deep Q-Network (DQN), developed by DeepMind, achieves human-level performance in
playing Atari games, showcasing the potential of deep reinforcement learning.

2015: ResNet (Residual Networks), developed by Kaiming He and colleagues at Microsoft Research,
wins the ILSVRC with a significant margin. ResNet's introduction of skip connections helps in training
very deep networks.
Success in Handwriting Recognition

Graves et. al., in 2009 outperformed all

entries in an international Arabic handwriting
recognition competition.
Success in Speech Recognition
Dahl et. al., showed relative error reduction of
16.0% and 23.2% over the state of the art
system.
New Record on MNIST
Ciresan et. al., set a new record on the
MNIST dataset in 2010 using good old
backpropagation on GPUs (GPUs enter
scene)
First Superhuman Visual Pattern Recognition

D.C. Ciresan et. al., achieved 0.56%

error rate in the IJCNN Traﬃc Sign
Recognition Competition.
Winning more Visual Recognition Challenges
From Cats to Convolutional Neural Networks
Hubel and Wiesel Experiment

Experimentally showed that each

neuron has a fixed receptive field — i.e.,
a neuron will fire only in response to a
visual stimuli in a specific region in the
visual space.
Neocognitron
Used for Handwritten character
recognition and pattern recognition
(Fukushima et. al.)
Convolutional Neural Network
Handwriting digit recognition using
backpropagation over a Convolutional
Neural Network (LeCun et. al.)
LeNet-5
Introduced the (now famous) MNIST dataset
(LeCun et. al.)

An algorithm inspired by an experiment on

cats is today used to detect cats in videos :-)
Better Optimization Methods
Faster convergence, better accuracies.
The Curious Case of Sequences
Sequences:

● They are everywhere.

● Time Series, speech, music, text, video
● Each unit in the sequence interacts with other units
● Need models to capture this interaction
Hopfield Network
Content-addressable memory systems for
storing and retrieving patterns.
Jordan Network
The output state of each time step is fed to
the next time step thereby allowing
interactions between time steps in the
sequence.
Elman Network
The hidden state of each time step is fed
to the next time step thereby allowing
interactions between time steps in the
sequence.
Drawbacks of RNNs
Hochreiter et. al. and Bengio et. al. showed the difficulty in training RNNs (the
problem of exploding and vanishing gradients)
Long Short Term Memory
Showed that LSTMs can solve complex
long time lag tasks that could never be
solved before.
Sequence to Sequence Learning
● Initial success in using RNNs/LSTMs
for large-scale Sequence to
Sequence Learning Problems
● Introduction of Attention which
inspired a lot of research over the
next years
Attention is All You NEED: Transformers
● Introduced by Ashish Vaswani et
al. in 2017, transformers leverage
self-attention mechanisms to
process sequences more
effectively than traditional RNNs.
● A Breakthrough in Natural
Language Processing (NLP).
The Deep Learning Revolution (2010s -2020s)
2018: OpenAI releases GPT (Generative Pre-trained Transformer), setting new benchmarks in NLP
tasks with its ability to generate coherent and contextually relevant text.

2020: OpenAI releases GPT-3, a language model with 175 billion parameters, pushing the boundaries
of what is possible with NLP and generating significant public interest and debate about the future of
AI.

2021: DeepMind's AlphaFold achieves a breakthrough in protein structure prediction, demonstrating

the impact of deep learning on scientific discovery.

2022: The DALL·E 2 and Stable Diffusion models showcase the ability of deep learning models to
generate high-quality images from textual descriptions, revolutionizing the field of generative art and
creative AI.

2023: Google Research introduces PaLM (Pathways Language Model), a large-scale language
model designed to improve understanding and reasoning across multiple languages and tasks.

The Evolution of Deep Learning
No ratings yet
The Evolution of Deep Learning
53 pages
CS231n Deep Learning Overview
No ratings yet
CS231n Deep Learning Overview
66 pages
Deep Learning Most Important Ideas PDF
No ratings yet
Deep Learning Most Important Ideas PDF
16 pages
NN DL Unit - III
No ratings yet
NN DL Unit - III
19 pages
XCXCXCXCXCXCXCXC
No ratings yet
XCXCXCXCXCXCXCXC
20 pages
Lesson 02 Introduction To Deep Learning
No ratings yet
Lesson 02 Introduction To Deep Learning
74 pages
AI and ML Workshop PPTX - 250131 - 193538
No ratings yet
AI and ML Workshop PPTX - 250131 - 193538
44 pages
人造的智力和深度学习
No ratings yet
人造的智力和深度学习
27 pages
Deep Learning Evolution
No ratings yet
Deep Learning Evolution
7 pages
Deep Learning-1
No ratings yet
Deep Learning-1
20 pages
Deep Learning: A Comprehensive Guide
No ratings yet
Deep Learning: A Comprehensive Guide
12 pages
Deep Learning Course Overview
No ratings yet
Deep Learning Course Overview
30 pages
How Different Large Language Models Shape Your Data Observability Strategy 1709132287
No ratings yet
How Different Large Language Models Shape Your Data Observability Strategy 1709132287
23 pages
ETH Zurich Talk - April 14, 2025
No ratings yet
ETH Zurich Talk - April 14, 2025
84 pages
Chapter-2 (Deep Learning)
No ratings yet
Chapter-2 (Deep Learning)
18 pages
Deep Learning
No ratings yet
Deep Learning
37 pages
Unit IV
No ratings yet
Unit IV
21 pages
? What Is Deep Learning
No ratings yet
? What Is Deep Learning
2 pages
Autonomous Driving Deep Learning Survey
No ratings yet
Autonomous Driving Deep Learning Survey
33 pages
Technical Topic BC PDF
No ratings yet
Technical Topic BC PDF
14 pages
Paper 12
No ratings yet
Paper 12
3 pages
Deep Learning Insights by Yann LeCun
No ratings yet
Deep Learning Insights by Yann LeCun
72 pages
Lec 1 Intro
No ratings yet
Lec 1 Intro
54 pages
22 Selected Top Papers On Deep Learning
No ratings yet
22 Selected Top Papers On Deep Learning
393 pages
Lec01 Intro
No ratings yet
Lec01 Intro
47 pages
Deep Learning
100% (4)
Deep Learning
32 pages
Deep Learning Advances in Computer Vision
No ratings yet
Deep Learning Advances in Computer Vision
6 pages
CS480 Lecture November 28th
No ratings yet
CS480 Lecture November 28th
96 pages
Deep Learning History
No ratings yet
Deep Learning History
1 page
Introduction To Deep Learning: by Gargee Sanyal
No ratings yet
Introduction To Deep Learning: by Gargee Sanyal
20 pages
Deep Learning 15 May 2014
No ratings yet
Deep Learning 15 May 2014
70 pages
Chapter1. Introduction To Deep Learning
No ratings yet
Chapter1. Introduction To Deep Learning
21 pages
Deep Learning Unit-2
No ratings yet
Deep Learning Unit-2
33 pages
DL 4
No ratings yet
DL 4
5 pages
1.4 History Trends
No ratings yet
1.4 History Trends
12 pages
Research On Deep Learning
No ratings yet
Research On Deep Learning
3 pages
Introduction To Deep Learning: 1 General Overview
No ratings yet
Introduction To Deep Learning: 1 General Overview
29 pages
0 Introduction
No ratings yet
0 Introduction
132 pages
22a Neural
No ratings yet
22a Neural
46 pages
How Powerful Is AI - A Deep Learning Literature Review by Alban Tchikladze
No ratings yet
How Powerful Is AI - A Deep Learning Literature Review by Alban Tchikladze
10 pages
Deep Learning Resources Guide
No ratings yet
Deep Learning Resources Guide
5 pages
19 Deep Learning
100% (1)
19 Deep Learning
49 pages
On The Origin of Deep Learning: Haohan Wang Bhiksha Raj
No ratings yet
On The Origin of Deep Learning: Haohan Wang Bhiksha Raj
72 pages
Listofpapers1 0
No ratings yet
Listofpapers1 0
8 pages
Deep Learning Hardware Evolution
No ratings yet
Deep Learning Hardware Evolution
82 pages
Tubingen DL Notes
No ratings yet
Tubingen DL Notes
151 pages
Eai Endorsed Transactions: Review of Alexnet For Medical Image Classification
No ratings yet
Eai Endorsed Transactions: Review of Alexnet For Medical Image Classification
13 pages
Module 1
No ratings yet
Module 1
16 pages
Deep Learning Neural Networks Overview
No ratings yet
Deep Learning Neural Networks Overview
31 pages
Deep Learning for Video Experts
100% (1)
Deep Learning for Video Experts
114 pages
Neural Networks and Deep Learning A Comprehensive Overview of Modern Techniques and Applications
No ratings yet
Neural Networks and Deep Learning A Comprehensive Overview of Modern Techniques and Applications
15 pages
Faiml Unit 3
No ratings yet
Faiml Unit 3
6 pages
Bascis of AI - Module 2 - Complementary Study Material - 4
No ratings yet
Bascis of AI - Module 2 - Complementary Study Material - 4
4 pages
Ama U3
No ratings yet
Ama U3
19 pages
Neural Networks and Deep Learning: A Comprehensive Overview of Modern Techniques and Applications
No ratings yet
Neural Networks and Deep Learning: A Comprehensive Overview of Modern Techniques and Applications
15 pages
Embracing 'Yes' in Faith and Life
No ratings yet
Embracing 'Yes' in Faith and Life
4 pages
Plan - Level 00 Fixed Link Bridge - Type F: A01 A02 A03 A04 A05 A06 A07 A08 A09 A10 A11 A12 A13
No ratings yet
Plan - Level 00 Fixed Link Bridge - Type F: A01 A02 A03 A04 A05 A06 A07 A08 A09 A10 A11 A12 A13
1 page
2020 Monthly Revenue and Order Analysis
No ratings yet
2020 Monthly Revenue and Order Analysis
1,087 pages
ECLD Assignment
No ratings yet
ECLD Assignment
16 pages
Hydraulic Vehicle Lift ST List of Parts and Appendices Stertil B.V B
No ratings yet
Hydraulic Vehicle Lift ST List of Parts and Appendices Stertil B.V B
24 pages
Hymn of Grateful Praise Lyrics
No ratings yet
Hymn of Grateful Praise Lyrics
2 pages
Conditional-Formulas Gempesaw
No ratings yet
Conditional-Formulas Gempesaw
9 pages
Thref Photonic Memristors
No ratings yet
Thref Photonic Memristors
15 pages
Ethics of Animal Use in Research
No ratings yet
Ethics of Animal Use in Research
17 pages
Population Grid and Location Quotient of Land Cover
No ratings yet
Population Grid and Location Quotient of Land Cover
18 pages
List of Companies For Internship
No ratings yet
List of Companies For Internship
2 pages
Ablative Armor or Bio-Armor - v5
No ratings yet
Ablative Armor or Bio-Armor - v5
2 pages
Literary Analysis for Students
No ratings yet
Literary Analysis for Students
3 pages
Health Advice For International Travel: Renato M. Espinoza, MD, Mhped
No ratings yet
Health Advice For International Travel: Renato M. Espinoza, MD, Mhped
52 pages
Kai Greene's Leg Workout Routine
No ratings yet
Kai Greene's Leg Workout Routine
2 pages
PLTU Mamuju CFB Operation Guide
No ratings yet
PLTU Mamuju CFB Operation Guide
20 pages
Landscape Architecture
95% (44)
Landscape Architecture
417 pages
Palletpack 460: Function Package
No ratings yet
Palletpack 460: Function Package
2 pages
Nso Level2 Class 6 Set 3
No ratings yet
Nso Level2 Class 6 Set 3
8 pages
Foot Operated Hydraulic Lift
No ratings yet
Foot Operated Hydraulic Lift
28 pages
LEGO Set Retirement Dates
No ratings yet
LEGO Set Retirement Dates
21 pages
Bad Movie Physics Final
No ratings yet
Bad Movie Physics Final
10 pages
Liquid Limit Test
100% (1)
Liquid Limit Test
14 pages
Komplikasi DM: Hemi Sinorita
No ratings yet
Komplikasi DM: Hemi Sinorita
26 pages
Types of Transistors: BJT and MOSFET
No ratings yet
Types of Transistors: BJT and MOSFET
33 pages
Halide Perovskite Memristors As Flexible and Reconfigurable Physical Unclonable Functions
No ratings yet
Halide Perovskite Memristors As Flexible and Reconfigurable Physical Unclonable Functions
11 pages
Nitrified Pressurized Mud Cap Drilling Enables Continued Drilling With Zero NPT Through Highly
No ratings yet
Nitrified Pressurized Mud Cap Drilling Enables Continued Drilling With Zero NPT Through Highly
1 page
Green Computing: Sustainable Practices
No ratings yet
Green Computing: Sustainable Practices
2 pages
CH 8-The-Prisoner-of-Zenda PDF
No ratings yet
CH 8-The-Prisoner-of-Zenda PDF
8 pages
Biodegradable Plastics from Potato Starch and Milk
No ratings yet
Biodegradable Plastics from Potato Starch and Milk
17 pages

Deep Learning

Uploaded by

Deep Learning

Uploaded by

The Deep Learning Revolution (2010s)

Graves et. al., in 2009 outperformed all

D.C. Ciresan et. al., achieved 0.56%

Experimentally showed that each

An algorithm inspired by an experiment on

● They are everywhere.

2021: DeepMind's AlphaFold achieves a breakthrough in protein structure prediction, demonstrating

You might also like