0% found this document useful (0 votes)

11 views9 pages

Unit 3 Making Machine Sees

Uploaded by

renuchoubey2680

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views9 pages

Unit 3 Making Machine Sees

Uploaded by

renuchoubey2680

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

G.D.

GOENKA PUBLIC SCHOOL

Shivpuri Link Road, Gwalior
CLASS – XII Artificial Intelligence
UNIT -3 MAKING MACHINE SEES

What is Computer Vision?

 Computer Vision (CV) is a part of Artificial Intelligence (AI) that helps machines see, understand,
and analyze images and videos—just like humans do.
 It allows computers to make decisions or give suggestions based on what they see.

How is it Similar to Human Vision?

 Just like humans use eyes and brain to see and understand, machines use:
o Cameras (like eyes)
o Algorithms & AI models (like the brain)

What Does CV Do?

CV helps machines to:

 Detect objects (like cars, faces, or animals)

 Classify images (e.g., cat or dog)
 Recognize faces
 Find defects in products in factories
 Monitor roads, buildings, and machines in real time

Why is Computer Vision Useful?

 Fast: Much quicker than humans

 Accurate: Less chance of error
 Works Non-stop: Can run 24/7
 Objective: No personal bias
 Scalable: Can handle huge amounts of data

Deep Learning in CV:

 CV uses deep learning models to become smarter and more accurate.

 These models are so advanced that in some tasks (like face recognition), they perform even better than
humans.
 Sometimes also called Machine Vision.

WORKING OF COMPUTER VISION

What is Computer Vision?

 Computer Vision is a branch of AI that focuses on helping computers understand images and videos.
 It processes and analyzes digital images to recognize objects, patterns, or meaning just like a human
would.

1. Basics of Digital Images

 A digital image is a picture stored in a computer using numbers.

 It can be created by:
o Drawing in software (like MS Paint or Photoshop)
o Clicking a photo using a digital camera
o Scanning a physical photo

2. Interpretation of Image in Digital Form

When a computer processes an image, it perceives it as a collection of tiny squares known as pixels. Each pixel, short
for "picture element," represents a specific color value. These pixels collectively form the digital image. During the
process of digitization, an image is converted into a grid of pixels. The resolution of the image is determined by the
number of pixels it contains; the higher the resolution, the more detailed the image appears and the closer it resembles
the original scene.

What Are Pixels?

 A pixel (short for "picture element") is the smallest square in a digital image.
 Each pixel shows one color.
 When combined, thousands or millions of pixels make up the whole image.

How Do Computers Read Images?

 Computers don’t “see” images. They read numbers representing each pixel.
 The process of turning an image into a grid of pixels is called digitization.

What is Resolution?

 Resolution = Number of pixels in an image.

 More pixels = clearer and more detailed image.
Black & White Images (Monochrome):

 Each pixel has a value from 0 to 255:

o 0 = Black
o 255 = White
o Numbers in between = Shades of grey

3.3 COMPUTER VISION – PROCESS

Computer Vision typically follows 5 stages. Below are the first two stages explained clearly:

3.3.1 Image Acquisition

 This is the first step where digital images or videos are captured. Image acquisition is the initial stage in
the process of computer vision, involving the capture of digital images or videos.
 Images can be taken from:
o Digital cameras
o Scanners
o Design software (e.g., Photoshop)
o Medical equipment like MRI or CT scans

Key Points:

 High-resolution devices = Clearer and more detailed images

 Lighting and camera angle affect image quality
 This stage provides the raw data for the entire Computer Vision system

Examples:

 A camera taking a picture of a classroom

 An MRI scanner capturing a brain image

3.3.2 Preprocessing

 Preprocessing in computer vision aims to enhance the quality of the acquired image. Preprocessing
improves image quality before it is analyzed by AI or models.

Common Preprocessing Techniques:

1. Noise Reduction
o Removes unwanted disturbances like blurriness or random spots.
o Example: Cleaning grainy photos taken in the dark

2. Image Normalization

o Standardizes pixel values across images for consistency.

o Makes pixel values consistent (e.g., scale from 0–255 to 0–1)
o Helps the AI model learn better

3. Resizing/Cropping
o Changes image size or shape for uniformity.
o Changes the size or aspect ratio of the image to make it uniform.
o Example: Resize all images to 224×224 pixels.
4. Histogram Equalization
o Improves brightness and contrast.
o Adjusts the brightness and contrast of an image.
o Example: Enhances a dark image to show more details

Purpose of Preprocessing:

 Clean up images (remove noise)

 Highlight important features
 Make all images consistent and uniform

3.3.3 Feature Extraction

What is it?

 Feature Extraction means finding important patterns in an image that help the computer recognize or
understand it.
 These features help in identifying objects, textures, colors, etc.

Common Feature Extraction Methods:

o Edge detection identifies the boundaries between different regions in an image where there is a significant
change in intensity
o Corner detection identifies points where two or more edges meet. These points are areas of high curvature
in an image, focused on identifying sharp changes in image gradients, which often correspond to corners
or junctions in objects.
o Texture analysis extracts features like smoothness, roughness, or repetition in an image
o Colour-based feature extraction quantifies colour distributions within the image, enabling
discrimination between different objects or regions based on their colour characteristics.

In Deep Learning:

 Convolutional Neural Networks (CNNs) automatically extract features during training—no need to
manually define them.

3.3.4 Detection and Segmentation

Detection and segmentation are fundamental tasks in computer vision, focusing on identifying objects or regions
of interest within an image.

Single Object Tasks:

1. Classification:
o Tells what type of object is in the image.
o Example: Recognizing if the image has a cat or a dog.
2. Classification + Localization:
o Tells the object’s class and its location (using bounding boxes).
Multiple Object Tasks:

1. Object Detection:
o Finds multiple objects in an image.
o Draws bounding boxes around each one and labels them.
o Popular Algorithms:
 R-CNN
 YOLO (You Only Look Once)
 SSD (Single Shot Detector)

2. Image Segmentation:
o Divides an image into regions by classifying each pixel.
o More detailed than object detection.

Types of Segmentation:

 Semantic Segmentation:
o Labels all objects of the same type together.
o Example: All animals in one class, without telling which is which.
 Instance Segmentation:
o Labels each object separately, even if they are of the same type.
o Example: Two dogs in the same image will be identified as Dog 1 and Dog 2.
o

3.3.5 High-Level Processing

Purpose:

 This is the final stage where the computer understands and makes decisions based on the objects it
detected.

What It Does:

 Recognizes objects and scenes

 Understands relationships between objects
 Analyzes context (e.g., a doctor is in an operating room)
 Helps in decision-making for real-life uses like:
o Autonomous vehicles
o Medical diagnostics
o Smart surveillance

Summary: 5 Stages of Computer Vision Process

1. Image Acquisition – Capturing the image

2. Preprocessing – Cleaning and preparing the image
3. Feature Extraction – Identifying patterns and features
4. Detection/Segmentation – Finding and separating objects
5. High-Level Processing – Understanding and decision-making
3.4 Applications of Computer Vision

Computer Vision is already part of many everyday tools. Some key applications include:

1. Facial Recognition
o Used by apps like Facebook to detect and tag faces in photos.
2. Healthcare
o Detects diseases, tumours, or irregularities in medical images (like MRI scans).
3. Self-Driving Cars
o Helps cars understand surroundings, detect traffic signs, people, and other vehicles.
4. OCR (Optical Character Recognition)
o Converts images of text (printed or handwritten) into editable digital text.
5. Machine Inspection
o Detects faults or defects in manufactured products during quality checks.
6. 3D Model Building
o Builds 3D models from real-world objects; used in robotics, gaming, AR/VR.
7. Surveillance
o CCTV cameras analyze videos to spot suspicious behavior and ensure safety.
8. Fingerprint & Biometric Recognition
o Verifies user identity using fingerprint scans or facial features.

3.5 Challenges of Computer Vision

Even though CV is powerful, it faces several difficulties:

1. Reasoning and Interpretation

o CV must not just see but understand images, which requires complex logic and reasoning.
2. Image Acquisition Issues
o Factors like poor lighting, different camera angles, and crowded scenes make image capture
difficult.
3. Privacy and Security Concerns
o CV systems (like face recognition) can raise privacy issues and are often debated.
4. False or Duplicate Content
o Fake images/videos or data breaches can fool CV systems, leading to misinformation or security
risks.

3.6 The Future of Computer Vision

 CV has grown from simple tasks to advanced systems that mimic human-level understanding.
 Deep learning and large datasets have made this possible.
 In the future, we may see:
o Smart healthcare tools that detect diseases early
o Immersive AR/VR experiences
o More intelligent, safe, and helpful AI tools

Vision Ahead:

If used ethically and with innovation, Computer Vision will positively transform industries and lives worldwide.

Making Machines See Class 12 Notes
No ratings yet
Making Machines See Class 12 Notes
6 pages
Making Machines See (Unit-3)
No ratings yet
Making Machines See (Unit-3)
8 pages
8394 Making Machines See
No ratings yet
8394 Making Machines See
50 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
Unit 1
No ratings yet
Unit 1
20 pages
Unit 1
No ratings yet
Unit 1
200 pages
CV SVD L01 P1 Intro
No ratings yet
CV SVD L01 P1 Intro
35 pages
Computer Vision Revision Notes - 250322 - 101703
No ratings yet
Computer Vision Revision Notes - 250322 - 101703
4 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
Making Machines See QAndA
No ratings yet
Making Machines See QAndA
3 pages
Class 10th Computer Vision Revision Notes
No ratings yet
Class 10th Computer Vision Revision Notes
4 pages
Digital Image Processing Overview
No ratings yet
Digital Image Processing Overview
20 pages
IPCV Unit 01
No ratings yet
IPCV Unit 01
18 pages
Computer Vision Class X (23-24)
No ratings yet
Computer Vision Class X (23-24)
4 pages
Xiiaiunit3making Machines See
No ratings yet
Xiiaiunit3making Machines See
12 pages
Computer Vision
No ratings yet
Computer Vision
30 pages
Lec 1 - 2
No ratings yet
Lec 1 - 2
39 pages
New Seminar
No ratings yet
New Seminar
11 pages
Computer Vision Advancement Rebecca
No ratings yet
Computer Vision Advancement Rebecca
17 pages
Computer Vision: Facial Recognition
No ratings yet
Computer Vision: Facial Recognition
9 pages
Class X Computer Vision
No ratings yet
Class X Computer Vision
7 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
1 Intro To CV
No ratings yet
1 Intro To CV
76 pages
Computer Vision Presentation
No ratings yet
Computer Vision Presentation
10 pages
Chapter One-3
No ratings yet
Chapter One-3
8 pages
A Comprehensive Guide To Computer Vision
No ratings yet
A Comprehensive Guide To Computer Vision
6 pages
Computer Vision
No ratings yet
Computer Vision
8 pages
Computer Vision
No ratings yet
Computer Vision
28 pages
Lec1 - Computer Vision - v1
No ratings yet
Lec1 - Computer Vision - v1
38 pages
Computer Vision for Tech Enthusiasts
No ratings yet
Computer Vision for Tech Enthusiasts
44 pages
What Is Computer Vision
No ratings yet
What Is Computer Vision
18 pages
Format of 1st Page - Seminar
No ratings yet
Format of 1st Page - Seminar
3 pages
CV Unit 1
No ratings yet
CV Unit 1
17 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Computer Vision Basics for Beginners
No ratings yet
Computer Vision Basics for Beginners
21 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
Computer Vision
No ratings yet
Computer Vision
10 pages
Exploring Computer Vision Applications
No ratings yet
Exploring Computer Vision Applications
19 pages
Overview of Computer Vision Techniques
No ratings yet
Overview of Computer Vision Techniques
3 pages
Computer Vision Assignment
No ratings yet
Computer Vision Assignment
10 pages
Computer Vision: In-Depth Overview
No ratings yet
Computer Vision: In-Depth Overview
5 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Applications of Computer Vision in AI
No ratings yet
Applications of Computer Vision in AI
30 pages
Ai CV Notes
No ratings yet
Ai CV Notes
6 pages
Unit 3 Making Machines See The World of Computer Vision
No ratings yet
Unit 3 Making Machines See The World of Computer Vision
10 pages
Unit - 2 Computer Vision
No ratings yet
Unit - 2 Computer Vision
27 pages
Computer Vision Class 10 Notes
No ratings yet
Computer Vision Class 10 Notes
5 pages
Computer Vision and Data Science Notes
No ratings yet
Computer Vision and Data Science Notes
11 pages
Computer Vision
No ratings yet
Computer Vision
4 pages
Computer Vision Technology
No ratings yet
Computer Vision Technology
29 pages
CXVXFV
No ratings yet
CXVXFV
12 pages
Computer Visiondk
No ratings yet
Computer Visiondk
12 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
4 pages
CV Overview
No ratings yet
CV Overview
83 pages
Computer Vision
No ratings yet
Computer Vision
29 pages
Lec 1
No ratings yet
Lec 1
51 pages
Computer Vision (Ist Unit)
No ratings yet
Computer Vision (Ist Unit)
31 pages
Unit 1 Introduction
No ratings yet
Unit 1 Introduction
25 pages
Accessing GeoWebFace FTP Data
No ratings yet
Accessing GeoWebFace FTP Data
1 page
Day 2 ME Hydraulic Power System HPC - HCU
No ratings yet
Day 2 ME Hydraulic Power System HPC - HCU
147 pages
State Transition Matrix in Control Systems
No ratings yet
State Transition Matrix in Control Systems
33 pages
8086 Assembly: Mastering Arrays
No ratings yet
8086 Assembly: Mastering Arrays
19 pages
Design Real Time Battery Monitoring System Using Labview Interface For Arduino (Lifa)
No ratings yet
Design Real Time Battery Monitoring System Using Labview Interface For Arduino (Lifa)
4 pages
Actuator Selection Computation (Control and Guard)
No ratings yet
Actuator Selection Computation (Control and Guard)
3 pages
Altosonic V12 Altosonic V12 Altosonic V12 Altosonic V12: Ultrasonic Gas Flowmeter For Custody Transfer
No ratings yet
Altosonic V12 Altosonic V12 Altosonic V12 Altosonic V12: Ultrasonic Gas Flowmeter For Custody Transfer
40 pages
Python IEEE Projects 2023-2024
No ratings yet
Python IEEE Projects 2023-2024
8 pages
Advance Electrical Design & Engineering Institute
No ratings yet
Advance Electrical Design & Engineering Institute
1 page
Energy Conservation and Management Guide
No ratings yet
Energy Conservation and Management Guide
28 pages
Manual Servicio Mack
100% (12)
Manual Servicio Mack
90 pages
COMP 1807 Group Coursework - 2024 Term 3
No ratings yet
COMP 1807 Group Coursework - 2024 Term 3
10 pages
Infosphere Information Server Installation
No ratings yet
Infosphere Information Server Installation
7 pages
ITI - Interested List - xlsx3.7.21
No ratings yet
ITI - Interested List - xlsx3.7.21
28 pages
Cv-Iqra Yakub
No ratings yet
Cv-Iqra Yakub
2 pages
DAX Functions (Slides)
No ratings yet
DAX Functions (Slides)
17 pages
Equity 2023-2024
No ratings yet
Equity 2023-2024
2 pages
Vikas Kumar
No ratings yet
Vikas Kumar
1 page
SWL50E Wheel Loader Parts Catalog
No ratings yet
SWL50E Wheel Loader Parts Catalog
174 pages
Silicon Junction FETs Specs & Applications
No ratings yet
Silicon Junction FETs Specs & Applications
1 page
Principle of EE1 Lesson 5
No ratings yet
Principle of EE1 Lesson 5
61 pages
Chevrolet Luv Dmax CNG Wiring Harness: O Sensor
100% (1)
Chevrolet Luv Dmax CNG Wiring Harness: O Sensor
1 page
Effective Manpower Planning Strategies
No ratings yet
Effective Manpower Planning Strategies
6 pages
Procedure For TPM .Maintenance
No ratings yet
Procedure For TPM .Maintenance
4 pages
Encanto (2021) Google Drive MP4 (Official)
No ratings yet
Encanto (2021) Google Drive MP4 (Official)
16 pages
Digital Impact on Skincare Purchases
No ratings yet
Digital Impact on Skincare Purchases
2 pages
ETAGs as EADs Under EU Regulation 305/2011
No ratings yet
ETAGs as EADs Under EU Regulation 305/2011
13 pages
2013 Camry ATF Exchange Guide
No ratings yet
2013 Camry ATF Exchange Guide
18 pages
Reduced Harmonics Technologyin Altivar 212 Adjustable Speed Drive Controllers
No ratings yet
Reduced Harmonics Technologyin Altivar 212 Adjustable Speed Drive Controllers
8 pages
TH L42DT50T
No ratings yet
TH L42DT50T
70 pages

Unit 3 Making Machine Sees

Uploaded by

Unit 3 Making Machine Sees

Uploaded by

G.D.

GOENKA PUBLIC SCHOOL

What is Computer Vision?

How is it Similar to Human Vision?

What Does CV Do?

CV helps machines to:

 Detect objects (like cars, faces, or animals)

Why is Computer Vision Useful?

 Fast: Much quicker than humans

Deep Learning in CV:

 CV uses deep learning models to become smarter and more accurate.

WORKING OF COMPUTER VISION

What is Computer Vision?

1. Basics of Digital Images

 A digital image is a picture stored in a computer using numbers.

2. Interpretation of Image in Digital Form

What Are Pixels?

How Do Computers Read Images?

 Resolution = Number of pixels in an image.

 Each pixel has a value from 0 to 255:

3.3 COMPUTER VISION – PROCESS

3.3.1 Image Acquisition

 High-resolution devices = Clearer and more detailed images

 A camera taking a picture of a classroom

Common Preprocessing Techniques:

o Standardizes pixel values across images for consistency.

 Clean up images (remove noise)

3.3.3 Feature Extraction

Common Feature Extraction Methods:

3.3.4 Detection and Segmentation

Single Object Tasks:

3.3.5 High-Level Processing

 Recognizes objects and scenes

Summary: 5 Stages of Computer Vision Process

1. Image Acquisition – Capturing the image

3.5 Challenges of Computer Vision

Even though CV is powerful, it faces several difficulties:

1. Reasoning and Interpretation

3.6 The Future of Computer Vision

You might also like