0% found this document useful (0 votes)

60 views50 pages

Advanced Computer Vision Course

Lecture notes of CV801

Uploaded by

Abrham Gebreselasie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

60 views50 pages

Advanced Computer Vision Course

Lecture notes of CV801

Uploaded by

Abrham Gebreselasie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

CV801: Advanced Computer Vision

Week 1 Lecture 2
Class Participation and Peer-Review (10% Weightage)
Class-participation: 5%
• In-person Attendance: 3%.
• Full mark: In-person attendance in 18 out of 30 lectures AND 7 out of 15 labs

• Reading research papers in advance, and providing correct answers for the in-class room Quizzes-2%

Peer Review: 5%
• Participate in the discussions related to project presentations and paper presentations of other
students: 1%
• 1-page review report on Projects of other groups ( Each person write two peer-review report): 4%

2
Introduction and Overview of Computer Vision
What is Computer Vision?

• Ability of computers
• To understand visual data
• For example, images, videos…

• Automate tasks
• Which human visual system can perform
What is Computer Vision?
• To extract “meaning” from pixels. To bridge the gap between image pixels and
“meaning” (semantic)!

What we see!
What computer sees!
What do we have here?

Seems easy ……..

Wrong! Vision is Hard
• Vision is an amazing feature of natural intelligence
• Around 50% of neural tissues of human brain is directly or indirectly
related to vision, which assists in visual learning.

Hardware perspective:
Is that a Massive digital data collections
queen or a
bishop?
Why Study Computer Vision?
• Engineering point of view - Computer Vision helps to solve many
practical problems: business potential
• Scientific point of view - Human kind of visual system is one of
the grand challenges of Artificial Intelligence (AI)
AI itself is a grand challenge of computing
• Massive visual data on internet

More than 70 million photos are shared on Instagram every day (more than 50 billion photos in total)

300 million images a day (More than 350 billion photos in total)

Business potential Substantial Commercial Interest

• Google
• Meta AI/Facebook
• Apple

Autonomous Driving Security Computer vision

Health
technology can
improve our lives

Biometric Access Comfort: Robot Fun: Virtual Avatar

Why Study Computer Vision?

12
Why Study Computer Vision?

• CVPR conference ranking (Engineering) as of 2024

13
Why Study Computer Vision?
• CVPR papers
2023 2024
Why Study Computer Vision?
Substantial Commercial Interest

List of CVPR 2022 sponsors

CV801 Topics vs Major topics in CVPR 2023

• Covering 8 Out of 12 top CVPR 2023 topics

• Covering ~12 topics

16
Acceptance Rate for Each Topic: CVPR 2024

17
Common Computer Vision Tasks

18
Common Computer Vision Tasks
Image Categorization/Recognition:

CAT
Common Computer Vision Tasks

Scene Recognition:
Is this an outdoor image?
21
Activity Recognition

Activity:
What is this person doing in this image?
Common Computer Vision Tasks: Detection

Detection:
Where is a car in this image?
Common Computer Vision Tasks: Detection

24
Semantic Segmentation

GRASS, CAT, TREE, SKY

25
Instance Segmentation

DOG, DOG, CAT

26
Common Computer Vision Tasks: Segmentation

Semantic Object Instance

Classification
Segmentation Detection Segmentation

CAT GRASS, CAT, TREE, DOG, DOG, CAT DOG, DOG, CAT
SKY

No spatial extent No objects, just pixels Multiple Objects

Video Instance Segmentation

28
Research Paper Presentations (10% Weightage)
Objective
• Learn to systematically introduce a research topic
• Improve teaching and presentation skills
• Involve in critical discussions about research papers
How to Select a Topic?
• Suggested topics.
• Specialized Applications of Segmentation: Eg. medical image segmentation (~3 presentations)
• Vision Foundation Models: Segment Anything Model (SAM) (~2 presentations)
• Efficient Architectures for Computer Vision Applications: State-space Models and Mamba (~4 presentations)
• Conversational LLMs and Vision-Language Models (~2 presentations)
• Image Generation using Diffusion Models (~5 presentations)
• Remote sensing, change detection (~2 presentations)
• Human-centric Vision (~2 presentations)
• All presenters on the same topic should work together to systematically introduce the concepts.

29
Specialized Applications of Segmentation: 3D Medical Image segmentation

UNETR: Transformers for 3D Medical Image Segmentation, WACV 2022

30
Remote Sensing Change Detection

Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review.
[Link]

34
Foundation Models in Vision

Foundational Models Defining a New Era in Vision: A Survey and Outlook

38
[Link]
Generalizable Localization Models
Segment Anything Model (SAM- [Link]
SAM for Synthetic Embryo Detection, Counting and Segmentation
(without training the model on target dataset or target category)

Embryo detection & counting Segmentation

Input Count=307
39
Large Language Models

40
Multi-Model LLMs
[Link]
Multi-Model LLMs
Image Generation Using Diffusion Models
Diffusion Models in Vision: A Survey [Link]

“A diffusion model is a deep generative model that is based on two stages, a forward diffusion stage and
a reverse diffusion stage. In the forward diffusion stage, the input data is gradually perturbed over
several steps by adding Gaussian noise. In the reverse stage, a model is tasked at recovering the original
input data by learning to gradually reverse the diffusion process, step by step “

Forward

Reverse
Image Generation (i)

1. Diffusion Models 2. Multi Model LLM Meets Diffusion Models

Eg: For Person Image Synthesis, CVPR 2023

[Link]
Image Generation (ii)

3. 3D-aware Image Generation 4. Image Generation for Healthcare Applications

ICCV 2023 MICCAI 2023

[Link]
Human-centric Scene Understanding

Example: Pedestrian detection, Multi-camera person search, Crowd counting, Pose estimation, Activity
recognition

Pedestrian Detection Person Search Crowd Counting Human Pose Estimation

[Link]
ARCHITECTURE DESIGN CHOICES FOR
REAL-WORLD VISION APPLICATIONS
• Development of Efficient network architectures
For image classification, object detection, segmentation
and human pose estimation in images and videos.

Vision Mamba

• Mamba for Medical Image Segmentation

[Link]
Questions?
Survey Outcome
Expected Deep learning and CNN backgrounds

• Perceptron. • Regularization

• Multi-layer Perceptron • Dropout

• Backpropagation • Data Augmentation

• Stochastic gradient descent. • Batch normalization

• Cross entropy loss

• CNN layer
58
Summary
• Course Overview
• Introduction and Overview of Computer Vision
• Common Computer Vision tasks

[Link]

CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
ComputerVision Intro
No ratings yet
ComputerVision Intro
50 pages
Making Machines See Class 12 Notes
No ratings yet
Making Machines See Class 12 Notes
6 pages
Computer Vision Research Document
No ratings yet
Computer Vision Research Document
3 pages
Computer Vision 2011
100% (1)
Computer Vision 2011
103 pages
CV SVD L01 P1 Intro
No ratings yet
CV SVD L01 P1 Intro
35 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
LectureNotes PDF
No ratings yet
LectureNotes PDF
212 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
Unit 1
No ratings yet
Unit 1
186 pages
Lec 1 - 2
No ratings yet
Lec 1 - 2
39 pages
Computer Vision: In-Depth Overview
No ratings yet
Computer Vision: In-Depth Overview
5 pages
Computer Vision Intro
No ratings yet
Computer Vision Intro
2 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
Abhijith Vision
No ratings yet
Abhijith Vision
17 pages
Ilovepdf Merged Compressed
No ratings yet
Ilovepdf Merged Compressed
1,100 pages
01 - Introduction
No ratings yet
01 - Introduction
37 pages
Intro
No ratings yet
Intro
66 pages
8394 Making Machines See
No ratings yet
8394 Making Machines See
50 pages
CV Digital Notes
No ratings yet
CV Digital Notes
77 pages
CompVisNotes PDF
No ratings yet
CompVisNotes PDF
115 pages
Computer Vision
No ratings yet
Computer Vision
6 pages
Lecture 1
100% (1)
Lecture 1
21 pages
Intro to Computer Vision Course
No ratings yet
Intro to Computer Vision Course
76 pages
Computer Vision: Key Concepts & Tasks
No ratings yet
Computer Vision: Key Concepts & Tasks
4 pages
Computer Visiondk
No ratings yet
Computer Visiondk
12 pages
CXVXFV
No ratings yet
CXVXFV
12 pages
Computer Vision Image Processing Answers
No ratings yet
Computer Vision Image Processing Answers
3 pages
CV Notes
No ratings yet
CV Notes
75 pages
Computer Vision Revision Notes
No ratings yet
Computer Vision Revision Notes
2 pages
Lec01 Intro
No ratings yet
Lec01 Intro
55 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
Computer Vision Presentation
No ratings yet
Computer Vision Presentation
10 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
A Comprehensive Guide To Computer Vision
No ratings yet
A Comprehensive Guide To Computer Vision
6 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
CV 01 Introduction
No ratings yet
CV 01 Introduction
14 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
Computer Vision for Beginners
No ratings yet
Computer Vision for Beginners
26 pages
Format of 1st Page - Seminar
No ratings yet
Format of 1st Page - Seminar
3 pages
Group 17 Computer Vision @Lcd-1
No ratings yet
Group 17 Computer Vision @Lcd-1
25 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
Unit - 2 Computer Vision
No ratings yet
Unit - 2 Computer Vision
27 pages
CV - Lecture 1 - Iintroduction
No ratings yet
CV - Lecture 1 - Iintroduction
24 pages
What Is Computer Vision in 2025? A Beginners Guide: Artificial Intelligence
No ratings yet
What Is Computer Vision in 2025? A Beginners Guide: Artificial Intelligence
48 pages
Lecture Notes
No ratings yet
Lecture Notes
144 pages
grp3 Computervision
No ratings yet
grp3 Computervision
28 pages
Applications of Computer Vision in AI
No ratings yet
Applications of Computer Vision in AI
30 pages
Computer Vision Seminar Santosh
No ratings yet
Computer Vision Seminar Santosh
10 pages
CV Unit 1 Overview of Computer Vison and Application
No ratings yet
CV Unit 1 Overview of Computer Vison and Application
51 pages
Technologies 12 00015
No ratings yet
Technologies 12 00015
40 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
Computer Vision Seminar Report 2023
No ratings yet
Computer Vision Seminar Report 2023
37 pages
CV Unit 1
No ratings yet
CV Unit 1
17 pages
Lec 1
No ratings yet
Lec 1
51 pages
Consumer Behavior Study Guide
No ratings yet
Consumer Behavior Study Guide
2 pages
2024PGP186 R1 Task3
No ratings yet
2024PGP186 R1 Task3
3 pages
Grade 11 Earth Science Lesson Plan
No ratings yet
Grade 11 Earth Science Lesson Plan
4 pages
NTS Guidelines - PR 2018
No ratings yet
NTS Guidelines - PR 2018
36 pages
HassHeilWeir ThomasCalculusEarlyTranscendentals 15E
No ratings yet
HassHeilWeir ThomasCalculusEarlyTranscendentals 15E
1 page
Report Card-Class 1
No ratings yet
Report Card-Class 1
44 pages
T1.1 Branches of Biology
No ratings yet
T1.1 Branches of Biology
3 pages
Geography of Ethiopia and the Horn
No ratings yet
Geography of Ethiopia and the Horn
316 pages
Dri Resume 3
No ratings yet
Dri Resume 3
19 pages
Benazir Taleemi Wazaif Enrollment Guide
No ratings yet
Benazir Taleemi Wazaif Enrollment Guide
2 pages
Forms For Athletics
100% (1)
Forms For Athletics
12 pages
CSIR-UGC NET Information Bulletin 2012
No ratings yet
CSIR-UGC NET Information Bulletin 2012
46 pages
Immediate Download Proceedings of International Ethical Hacking Conference 2019 eHaCON 2019 Kolkata India Mohuya Chakraborty Ebooks 2024
100% (9)
Immediate Download Proceedings of International Ethical Hacking Conference 2019 eHaCON 2019 Kolkata India Mohuya Chakraborty Ebooks 2024
62 pages
Upper Secondary Academy V Map GSE 32 40 Places and Home
No ratings yet
Upper Secondary Academy V Map GSE 32 40 Places and Home
1 page
Co-Teaching Lesson Plan
No ratings yet
Co-Teaching Lesson Plan
5 pages
First-Conditional 17310
No ratings yet
First-Conditional 17310
3 pages
Vignan Univ-ECE R16 Syllabus
100% (1)
Vignan Univ-ECE R16 Syllabus
188 pages
Thermodynamics
No ratings yet
Thermodynamics
22 pages
Benefits of Voluntary Work for Teens
No ratings yet
Benefits of Voluntary Work for Teens
5 pages
K Quantum Computing
No ratings yet
K Quantum Computing
14 pages
Annexure J1 Pre-Moderation
No ratings yet
Annexure J1 Pre-Moderation
2 pages
Soal Ulangan Harian Bahasa Inggris XII
No ratings yet
Soal Ulangan Harian Bahasa Inggris XII
3 pages
Essential Visa Interview Questions for Students
100% (1)
Essential Visa Interview Questions for Students
14 pages
Understanding Data Analytics Types
No ratings yet
Understanding Data Analytics Types
6 pages
Karen Letter of Recommendation
No ratings yet
Karen Letter of Recommendation
1 page
LP 4 Sentence Outline
No ratings yet
LP 4 Sentence Outline
4 pages
Linguistics Is The Science of Language
No ratings yet
Linguistics Is The Science of Language
225 pages
Transport Nagar 1100
No ratings yet
Transport Nagar 1100
25 pages
Borang PLBS
No ratings yet
Borang PLBS
25 pages
Graduate Physical Therapy Transcript
No ratings yet
Graduate Physical Therapy Transcript
2 pages