0% found this document useful (0 votes)

29 views2 pages

Python Project

The document describes a Python project that uses OpenCV and Tesseract OCR to extract text from an image file. It imports required packages, reads an image, performs preprocessing like grayscale conversion and thresholding, finds contours, crops text blocks and applies OCR to recognize the text.

Uploaded by

study.aaaashishhh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views2 pages

Python Project

Uploaded by

study.aaaashishhh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

PYTHON PROJECT

CODE :

i# Import required packages

import cv2
import pytesseract

# Mention the installed location of Tesseract-OCR in your system

[Link].tesseract_cmd = '/opt/homebrew/bin/tesseract'

# Read image from which text needs to be extracted

img = [Link]("[Link]")

# Preprocessing the image starts

# Convert the image to gray scale

gray = [Link](img, cv2.COLOR_BGR2GRAY)

# Performing OTSU threshold

ret, thresh1 = [Link](gray, 0, 255, cv2.THRESH_OTSU | cv2.THRESH_BINARY_INV)

# Specify structure shape and kernel size.

# Kernel size increases or decreases the area
# of the rectangle to be detected.
# A smaller value like (10, 10) will detect
# each word instead of a sentence.
rect_kernel = [Link](cv2.MORPH_RECT, (18, 18))

# Applying dilation on the threshold image

dilation = [Link](thresh1, rect_kernel, iterations = 1)

# Finding contours
contours, hierarchy = cv2. ndContours(dilation, cv2.RETR_EXTERNAL,
cv2.CHAIN_APPROX_NONE)

# Creating a copy of image

im2 = [Link]()

# A text le is created and ushed

le = open("[Link]", "w+")
[Link]("")
[Link]()

# Looping through the identi ed contours

# Then rectangular part is cropped and passed on
# to pytesseract for extracting text from it
# Extracted text is then written into the text le
for cnt in contours:
x, y, w, h = [Link](cnt)

# Drawing a rectangle on copied image

rect = [Link](im2, (x, y), (x + w, y + h), (0, 255, 0), 2)

# Cropping the text block for giving input to OCR

cropped = im2[y:y + h, x:x + w]

# Open the le in append mode

fi
fi
fi
fi
fi
fi
fl
fi
fi
le = open("[Link]", "a")

# Apply OCR on the cropped image

text = pytesseract.image_to_string(cropped)

# Appending the text into le

[Link](text)
[Link]("\n")

# Close the le
[Link]

OUTPUT :

IMAGE FILE :

RECOGNIZED TEXT :
fi
fi
fi
fi
fi
fi

Word Extraction-1
No ratings yet
Word Extraction-1
2 pages
Module # 10C - Text Recognition With Tesseract OCR
No ratings yet
Module # 10C - Text Recognition With Tesseract OCR
8 pages
Written Notes
No ratings yet
Written Notes
5 pages
Preprocessing Task
No ratings yet
Preprocessing Task
7 pages
Python OCR Tool for Developers
No ratings yet
Python OCR Tool for Developers
5 pages
OCR Implementation Guide
No ratings yet
OCR Implementation Guide
2 pages
Ahsbsdns
No ratings yet
Ahsbsdns
1 page
We Used Tesseract OCR For Train The Data and Recognize The Character From Digital Image Under The Apache 2
No ratings yet
We Used Tesseract OCR For Train The Data and Recognize The Character From Digital Image Under The Apache 2
1 page
Python CAPTCHA Breaking with OCR
No ratings yet
Python CAPTCHA Breaking with OCR
4 pages
Tesseract OCR Setup and Usage
No ratings yet
Tesseract OCR Setup and Usage
15 pages
Extracting Text From Scanned PDF Using Pytesseract & Open CV
No ratings yet
Extracting Text From Scanned PDF Using Pytesseract & Open CV
9 pages
Ocr Nanonets Tesseract
No ratings yet
Ocr Nanonets Tesseract
39 pages
C) Le Script But Not Complet Partie 1
No ratings yet
C) Le Script But Not Complet Partie 1
13 pages
F) Maybe Is Full Script Complet
No ratings yet
F) Maybe Is Full Script Complet
35 pages
Simple Python OCR Server Setup
No ratings yet
Simple Python OCR Server Setup
8 pages
OpenCV OCR and Text Recognition With Tesseract - PyImageSearch
No ratings yet
OpenCV OCR and Text Recognition With Tesseract - PyImageSearch
65 pages
Python Tesseract
No ratings yet
Python Tesseract
2 pages
Code Snippets
No ratings yet
Code Snippets
2 pages
Optical Character Recognition by Open Source OCR Tool Tesseract A Case Study
No ratings yet
Optical Character Recognition by Open Source OCR Tool Tesseract A Case Study
7 pages
Remove Text from Images with CV2 & Keras
No ratings yet
Remove Text from Images with CV2 & Keras
18 pages
ML Report
No ratings yet
ML Report
5 pages
LẬP TRÌNH XỬ LÝ ẢNH
No ratings yet
LẬP TRÌNH XỬ LÝ ẢNH
8 pages
Build Your Own Optical Character Recognition (Ocr) System Using Google'S Tesseract and Opencv
No ratings yet
Build Your Own Optical Character Recognition (Ocr) System Using Google'S Tesseract and Opencv
10 pages
Ocr
No ratings yet
Ocr
4 pages
(2022-MM) SPTS Single-Point Text Spotting
No ratings yet
(2022-MM) SPTS Single-Point Text Spotting
12 pages
Handwritten Text Recognition Guide
No ratings yet
Handwritten Text Recognition Guide
5 pages
License Plate Detection with OpenCV
No ratings yet
License Plate Detection with OpenCV
2 pages
Text Detection in Road Signs Using OCR
No ratings yet
Text Detection in Road Signs Using OCR
3 pages
Tesseract OCR: A Comprehensive Study
No ratings yet
Tesseract OCR: A Comprehensive Study
12 pages
OCR Techniques and Python Implementation
No ratings yet
OCR Techniques and Python Implementation
110 pages
98DSP
No ratings yet
98DSP
8 pages
Approach 4
No ratings yet
Approach 4
3 pages
Raspberry Pi License Plate System
No ratings yet
Raspberry Pi License Plate System
21 pages
Image Text Extraction Guide
No ratings yet
Image Text Extraction Guide
20 pages
Text Extraction From Image: Team Members CH - Suneetha (19mcmb22) Mohit Sharma (19mcmb13)
No ratings yet
Text Extraction From Image: Team Members CH - Suneetha (19mcmb22) Mohit Sharma (19mcmb13)
20 pages
AI Advantage and Disadvantage 1
No ratings yet
AI Advantage and Disadvantage 1
14 pages
Step by Step Process
No ratings yet
Step by Step Process
8 pages
CV Lab Manual
No ratings yet
CV Lab Manual
45 pages
Exp 3
No ratings yet
Exp 3
21 pages
Optical Character Recognition Overview
No ratings yet
Optical Character Recognition Overview
6 pages
Tesseract Ocr
No ratings yet
Tesseract Ocr
3 pages
Numerical & Symbolic Computing Lab 03
No ratings yet
Numerical & Symbolic Computing Lab 03
9 pages
Refined Shape
No ratings yet
Refined Shape
2 pages
Tesseract OCR Engine Overview
No ratings yet
Tesseract OCR Engine Overview
15 pages
OCR App Development Guide
No ratings yet
OCR App Development Guide
12 pages
Ip Lab Programs
No ratings yet
Ip Lab Programs
34 pages
Programs 8,11,12
No ratings yet
Programs 8,11,12
5 pages
Tesseract I CD Ar 2007
No ratings yet
Tesseract I CD Ar 2007
5 pages
Handwritten Text Recognition with TensorFlow
No ratings yet
Handwritten Text Recognition with TensorFlow
37 pages
Remove Face Rectangle in OpenCV
0% (1)
Remove Face Rectangle in OpenCV
2 pages
Updated Code That Flags Faulty Jpgs
No ratings yet
Updated Code That Flags Faulty Jpgs
3 pages
OpenCV Drawing Functions Overview
No ratings yet
OpenCV Drawing Functions Overview
23 pages
Automatic Number Plate Recognition System Roadmap
No ratings yet
Automatic Number Plate Recognition System Roadmap
8 pages
Ut It Lites All 2 Continue
No ratings yet
Ut It Lites All 2 Continue
7 pages
Prac 2 ACV-merged
No ratings yet
Prac 2 ACV-merged
8 pages
Point Operations in Image Processing
No ratings yet
Point Operations in Image Processing
7 pages
CHANGELOG
No ratings yet
CHANGELOG
2 pages
Syllabus - Statistical Analysis With Software Application
100% (7)
Syllabus - Statistical Analysis With Software Application
4 pages
Q3 English ActivitySheets
No ratings yet
Q3 English ActivitySheets
4 pages
Addison Public Library Homework Help
100% (1)
Addison Public Library Homework Help
4 pages
Ramsey Growth Model
No ratings yet
Ramsey Growth Model
56 pages
Research Protocol Essentials Guide
No ratings yet
Research Protocol Essentials Guide
17 pages
Movie Review
No ratings yet
Movie Review
2 pages
Micro Teaching Is A Best Example For Simulation
No ratings yet
Micro Teaching Is A Best Example For Simulation
2 pages
HR Practices at NRDC Report
No ratings yet
HR Practices at NRDC Report
72 pages
LP Grade 2
No ratings yet
LP Grade 2
2 pages
SNW Job Description
No ratings yet
SNW Job Description
1 page
MTech Data Science Program FAQ
No ratings yet
MTech Data Science Program FAQ
13 pages
Lesson Plan For Demo English 9
No ratings yet
Lesson Plan For Demo English 9
7 pages
3 Perspectives On Safety - Jorunn Tharaldsen
No ratings yet
3 Perspectives On Safety - Jorunn Tharaldsen
21 pages
1
No ratings yet
1
6 pages
Should You Use Your Iphone For Work?
No ratings yet
Should You Use Your Iphone For Work?
2 pages
Pgmath 2025
No ratings yet
Pgmath 2025
4 pages
Determining Scour Depth Around Structures in Gravel-Bed Rivers
No ratings yet
Determining Scour Depth Around Structures in Gravel-Bed Rivers
119 pages
2013 Physical Education Examination Paper
No ratings yet
2013 Physical Education Examination Paper
31 pages
Research Chapter 1
No ratings yet
Research Chapter 1
20 pages
A World of Regions
100% (4)
A World of Regions
20 pages
Business Statistics-B.com I
0% (1)
Business Statistics-B.com I
3 pages
Language Registers - Description
No ratings yet
Language Registers - Description
7 pages
Kecil. Tugas Akhir Mahasiswa Program Studi Teknik Kimia, Universitas Padjadjaran
No ratings yet
Kecil. Tugas Akhir Mahasiswa Program Studi Teknik Kimia, Universitas Padjadjaran
4 pages
CodesPractice MPOB
No ratings yet
CodesPractice MPOB
22 pages
Non-Projected Av Aids
64% (11)
Non-Projected Av Aids
58 pages
Enclosure No. 7 Sample Workplace Application Plan (WAP) Template
No ratings yet
Enclosure No. 7 Sample Workplace Application Plan (WAP) Template
4 pages
The Witch
No ratings yet
The Witch
11 pages
Contemporary World Output
100% (1)
Contemporary World Output
2 pages
Analisis Kualitas Pelayanan Terhadap Kepuasan Pasien Berobat Di Puskesmas Pembantu Desa Pasir Utama
No ratings yet
Analisis Kualitas Pelayanan Terhadap Kepuasan Pasien Berobat Di Puskesmas Pembantu Desa Pasir Utama
11 pages
HR Planning for Business Leaders
No ratings yet
HR Planning for Business Leaders
13 pages

Python Project

Uploaded by

Python Project

Uploaded by

PYTHON PROJECT

i# Import required packages

# Mention the installed location of Tesseract-OCR in your system

# Read image from which text needs to be extracted

# Preprocessing the image starts

# Convert the image to gray scale

# Performing OTSU threshold

# Specify structure shape and kernel size.

# Applying dilation on the threshold image

# Creating a copy of image

# A text le is created and ushed

# Looping through the identi ed contours

# Drawing a rectangle on copied image

# Cropping the text block for giving input to OCR

# Open the le in append mode

# Apply OCR on the cropped image

# Appending the text into le

You might also like