Seed Germination Prediction Using Machine Learning
Seed Germination Prediction Using Machine Learning
Project Report On
OF
BACHELOR’S OF ENGINEERING IN
INSTRUMENTATION ENGINEERING
BY
Dattaram Gawade (B190254610)
(B19025610)
(B190254609)
Akhsat Fulfagar (B19025609)
Under the Guidance of
Dr.A.A.Shinde
Institute Vission
“To Uplift The Common Masses By Rendering Value Added Education”.
Institute Mission
“Empowering Society Through Dynamic Education”.
Vision Of Department
To Be Recognized As One Of The Best Instrumentation Engineering Programs
By Developing Globally Competent Engineers, Researchers And Entrepreneurs To
Solve Real Life Problems Through Skill-Based Education.
Mission Of Department
It is a bonafide work carried out by them under the supervision of Dr. A.A.
Shinde and it is approved for the partial fulfilment of the requirement of Sav-
itribai Phule Pune University for the award of the Degree of Bachelor of
Engineering(Instrumentation), in the Acadamic year 2023-24. This project
report has not been earlier submitted to any other or university for the award of
degree or diploma.
The SDG Goal Responsible Consumption and Production focus on ensuring
sustainable consumption and production patterns. Our Seed Germination Pre-
diction project using Machine Learning falls under this SDG because it helps to
reduce waste by predicting which seeds are most likely to germinate successfully.
This ensures that resources such as water, fertilizers, and land are used more effi-
ciently.
Dr. A.A.Shinde
Project guide and Head of Department
Instrumentation Engineering
It is our great pleasure in expressing sincere and deep gratitude towards our
guide Dr.A.A.Shinde,Associate Professor and Head of the Department of In-
strumentation Engineering Department for her stimulating guidance, continuous
encouragement and supervision throughout the course of present work. We take
the opportunity to thank, project coordinator Dr.B.N.Mohapatra and all fac-
ulty members and staff of Department of Instrumentation Engineering AISSMS’s
IOIT, Pune, for their cooperation provided to us in many ways. We would like
to thank Mr.R.S.Wadkar, Deputy Director of Agriculture Office, Pune for the
project idea, constant support, continuous guidance and interaction definitely
helped us in many ways. We are extremely thankful to our Honourable Princi-
pal, Dr.P.B.Mane, for providing us infrastructural facilities to work in, without
which this work would not have been possible. Lastly, we are thankful to all those
who have directly or indirectly supported our project work.
-Dattaram Gawade
-Akshat Fulfagar
i
Abstract
For seed researchers, evaluating seed germination is a crucial step in assessing the
effectiveness and quality of seeds. Seed evaluations are often completed by hand,
which is a laborious, time-consuming, and prone to mistake procedure. Because
of the tremendous advancements in photography, electronics, and computer sci-
ence over the last several decades, infrared thermal imaging technology has seen
several technical advancements, clever uses for quality control and non-destructive
testing of many agro-food products. Thermal imaging is a potential non-contact
imaging technique for evaluating different quality attributes because it is based
on the infrared radiation released from target foods. Since large-scale germination
investigations often rely on human color-based threshold adjustments, they are
not well suited for conventional picture analysis methods.
Here, we are providing a machine learning method for high-throughput seed
germination tests and precise seed germination detection.
ii
Contents
Acknowledgement i
Abstact ii
1 Introduction 1
1.1 Seed Germination . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1.1 Types of Seed Germination . . . . . . . . . . . . . . . . . . 2
1.2 Germination Process . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 Germination Rate . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.4 Background of Project . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.5 Objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2 Literature Survey 5
3 Methodology 8
3.1 Hardware Requirements . . . . . . . . . . . . . . . . . . . . . . . . 8
3.1.1 Infrared Thermal Imager . . . . . . . . . . . . . . . . . . . . 8
3.1.2 Germination Paper . . . . . . . . . . . . . . . . . . . . . . . 9
3.1.3 Germination Chamber . . . . . . . . . . . . . . . . . . . . . 10
3.2 Software Requirements . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.2.1 Jupyter Notebook . . . . . . . . . . . . . . . . . . . . . . . . 10
3.2.2 Streamlit . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
4 Block Diagram 12
iii
6.2.1 Experiment Analysis . . . . . . . . . . . . . . . . . . . . . . 24
6.2.2 Model Dashboard . . . . . . . . . . . . . . . . . . . . . . . . 26
6.2.3 Model Code . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
6.2.4 Code Explanation . . . . . . . . . . . . . . . . . . . . . . . . 93
7 Conclusion 110
References 110
Certification 113
PO Mapping 120
SOCME 136
CD 162
iv
List of Figures
1.1 Seed Germination . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
v
List of Tables
6.1 Seeds Count . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
vi
Chapter 1
Introduction
A shoot simultaneously thrusts forward, toward the sun. Eventually, this sprout
will begin to unfurl its leaves, propelled by the stored nutrients. A plant’s au-
tonomous existence begins with germination, when it stops depending on the seed
and starts producing its own energy through photosynthesis.It marks the beginning
1
of a new plant’s journey towards development and maturity, much like nature’s
alarm clock.
1.1.1 Types of Seed Germination
There are two primary processes by which seeds can germinate: epigeal and hy-
pogeal. Plants have evolved several kinds of germinations as adaptations to max-
imize their development in various conditions.
Epigeal
When a seed germinates epigeally, the growing stalk emerges above the soil’s sur-
face. The cotyledons, or seed leaves, are raised into the air during this phase and
develop into the young plant’s first true green leaves. This kind of germination is
typical of plants like sunflowers and beans. The cotyledons are exposed to sunlight
for photosynthesis as the shoot rises and carries them along with it. With this
technique, the plant may effectively use soil nutrients and sunshine for its early
growth and development.
Hypogeal
When a seed germinates hypogeally, the growing branch remains below the soil’s
surface. During this phase, the stem elongates to force the first genuine leaves
above earth, leaving the cotyledons (seed leaves) buried. Plants that frequently
display hypogeal germination are peas and peanuts. Using this strategy, the cotyle-
dons act as nutrient stores for the growing seedling. This tactic shields the tender
branch from damage during the early phases of growth while enabling the young
plant to obtain vital resources.
1. Imbibition: The first step in germination is imbibition, where the dry seed
absorbs water through its seed coat. This water uptake causes the seed to
swell and the seed coat to soften, making it permeable to gases and soluble
nutrients.
2
simpler molecules that provide energy and building materials for the growing
embryo.
3. Respiration: During germination, the seed’s respiration rate increases signif-
icantly. Oxygen is consumed, and carbon dioxide is produced as the seed
converts its stored food into energy. Adequate oxygen supply is crucial dur-
ing this stage, which is why well-aerated soil is important for successful
germination.
4. Growth of the Radicle: The radicle, or embryonic root, is the first part of the
seedling to emerge from the seed. It grows downward into the soil, anchoring
the plant and beginning the process of water and nutrient absorption. The
emergence of the radicle is often considered the definitive sign that germina-
tion has occurred.
5. Emergence of the Shoot: Following the radicle, the shoot (or plumule) begins
to grow upwards towards the soil surface. This shoot consists of the stem
and the first leaves (cotyledons). The growth of the shoot is facilitated by
the elongation of cells in the hypocotyl (the stem-like part of the seedling
below the cotyledons) and the epicotyl (the part above the cotyledons).
6. Development of True Leaves: Once the shoot reaches the surface and is
exposed to light, the cotyledons open up and photosynthesis begins. This
process provides the seedling with the energy required for further growth.
The true leaves, which will be the plant’s main photosynthetic organs, start
to develop after the cotyledons.
7. Establishment of the Seedling: As the seedling continues to grow, it becomes
less reliant on the seed’s food reserves and more dependent on photosynthesis
and nutrient uptake from the soil. The root system expands, the shoot
elongates, and the plant establishes itself as a young seedling capable of
independent growth.
Throughout these stages, various environmental factors such as temperature, light,
moisture, and soil conditions play crucial roles in influencing the success and speed
of germination. Optimal conditions vary among different plant species, making
understanding these requirements essential for effective cultivation and seedling
establishment.
3
of seed viability and quality, influencing agricultural productivity and efficiency.
The percentage of seeds that successfully germinate in a given sample, typically
expressed over a specific time frame.
Germination rate can calculate as suppose 100 seed is taken for germination
test from a batch of 1000 seeds and that 100 seeds are put to germination. After
germination process from that 100 seeds 70 seed are germinated and 30 are not
germination , so the germination rate of that particular batch will be 70%.
1.5 Objectives
1. To reduce the number of manual steps in seed testing, which is highly error-
prone.
2. To automate the process of seed germination testing.
4
Chapter 2
Literature Survey
1. Here Nikita Genze et al. [1] introduce a machine learning approach for
high-throughput seed germination studies and reliable detection using mod-
ern artificial neural networks with region proposals. They compiled tagged
image data on the germination stages of over 2,400 seeds from three dif-
ferent crops: Zea mays (maize), Secale cereale (rye), and Pennisetum glau-
cum (pearl millet), utilizing approximately 23,000 images in total. Transfer
learning was employed to train various state-of-the-art convolutional neural
network (CNN) architectures with region proposals to automatically iden-
tify seedlings in petri dishes and predict their germination potential. There
suggested machine learning-based approach can facilitate the evaluation of
seed germination. Experiments with several seed varieties. In comparison
to traditional and manual approaches, it has reduced error rates and higher
performance, resulting in more precise germination indices and seed quality
evaluations.
2. In this study Srinath Yasam , et al. [2] the creation of the Automated
Germination Classification utilising ORB Feature Extractor with Machine
Learning (AGCORBML) model is the main emphasis of this study, according
to Srinath Yasam, et al. [2]. For the purposes of determining seed quality
and detecting
germination, the proposed AGCORB-ML approach employs hand-crafted
features and ML classifiers. The recently developed AGCORB-ML approach
uses the ORB technique to extract a suitable set of feature vectors from
the germination images. To categorize germination, two machine learning
models—the support vector machine (SVM) and the extreme learning ma-
chine (ELM)—are used. An comprehensive range of trials were conducted
to show the AGCORB-ML technique’s effectiveness in germination catego-
rization. According on the simulation results, the AGCORB-ML approach
5
outperforms existing ML models in a promising manner.
6
acceptable, they exhibit their technological expertise. We also demonstrate
the ability to accurately categories lots using crop-level picture data rather
than lot-specific data. These successes mark a significant turning point in
the application of this technique for industrial seed sorting by germination
outcome for various crops. There suggested method builds on prior attempts
to use optics to forecast seed germination efficiently and without harm, but
unlike them, it makes use of colour imaging and neural network classifica-
tion. In a cohort of seven distinct crops, dispersed across a wide phylogeny
and represented by at least two lots, bred and produced by twelve distinct
enterprises, we used it to robustly enrich seed germinability and usability,
two distinct features of germination, to 90% precision. This new capability is
anticipated to allow seed companies to separate seeds of various crops based
on their germination potential for the first time in high-fidelity, in a single
separation process, and without obligatory seed priming. This will stream-
line the process of seed germination enrich- ment and make lots that, using
modern separation techniques, would have failed enrichment attempts, such
as the lots used in this study, industrially useful. Unfortunately, they only
know the cause of poor germination for a small number of the lots, which
was either genetics production, or prolonged storage. Those lots encompass
the potential causes of mal-germination, and their classification satisfied the
requirements of 90% precision and 80% recall, proving the method’s adapt-
ability to various causes of mal-germination.
7
Chapter 3
Methodology
To predict seed germination using machine learning, we integrate advanced hard-
ware and software tools to create a comprehensive and efficient methodology. The
hardware setup includes an infrared thermal camera to monitor temperature vari-
ations during germination, germination paper to provide a consistent medium, and
a controlled germination chamber to maintain optimal environmental conditions.
These tools enable the precise collection of thermal images and environmental data,
which are essential for accurate predictions. The software aspect involves using
Jupyter Notebook for data preprocessing, feature extraction, and model training,
leveraging Python libraries like OpenCV, pandas, and scikit-learn. Additionally,
Streamlit is employed to develop an interactive web application, allowing users to
input new data and visualize predictions in real-time.
3.1 Hardware Requirements
3.1.1 Infrared Thermal Imager
8
Features:
Users may navigate to the hot and cold spots in a thermal picture by using
the hot and cold point temperature indicators.
Selectable palette
Connecting PC with USB can read the image stored in instrument directly.
This specific kind of paper is used as a substratum in the germination test. This
paper is ideal for checking the ideal moisture content of the different types of seeds
and germination forms. Its unique structure and exceptional wet strength assist
to inhibit the growth of fine seed roots. The provided seed germination paper has
a high capacity for absorption and a pH range of 5-7 with strong bursting power.
The primary criterion used to assess a seed’s viability, vigour, and outward appear-
ance is its germination count. With seed germination paper, the optimal moisture
content is ensured for the greatest range of seeds and germination forms.
9
3.1.3 Germination Chamber
Interactive notebook papers with real-time code, equations, graphics, video, and
other computational outputs can be created with Jupyter Notebook. Program-
mers, data scientists, and students frequently use Jupyter Notebook to record and
illustrate coding processes or just to play around with code.
One kind of integrated development environment (IDE) is Jupyter Notebook.
IDEs are code writing environments with certain auxiliary capabilities. Debugging,
code completion, and syntax highlighting are features found in almost all IDEs.
3.2.2 Streamlit
With the help of the open-source Python module Streamlit, developers can quickly
create eye-catching user interfaces. The simplest method for integrating code into
a web site, particularly for those without any prior front-end experience, is to use
Streamlit: There is no need for prior front-end (html, js, css) expertise or experi-
ence. A free and open-source framework called Streamlit makes it easy to create
and distribute stunning online applications for data science and machine learning.
It is a Python-based library created with machine learning developers in mind. on-
line developers are neither data scientists or machine learning engineers, and they
have little interest in spending weeks learning how to create online applications
using these frameworks. As long as the tool can present data and gather the neces-
sary parameters for modelling, they would prefer something simpler to understand
and operate. Streamlit helps you to develop a stunning-looking application with
only a few lines of code.
11
Chapter 4
Block Diagram
The block diagram for seed germination prediction using machine learning outlines
a systematic workflow integrating both hardware and software components.
12
Figure 4.1: Block Diagram
Initially, the hardware setup includes an infrared thermal camera capturing
temperature profiles of seeds on germination paper within a controlled germina-
tion chamber, ensuring consistent environmental conditions. The captured thermal
images and environmental data are then fed into the preprocessing block, where
image processing techniques extract relevant features and synchronize them with
environmental parameters. These preprocessed features are input into the machine
learning model development block, where algorithms like decision trees, random
forests, and neural networks are trained and validated using labelled data. The
final block involves deploying the trained model via a Streamlit-based interac-
tive web application, allowing users to input new data, visualize predictions, and
make informed decisions about seed germination. This comprehensive diagram
illustrates the seamless integration of hardware data acquisition with advanced
machine learning techniques and user-friendly software interfaces.
To conduct the seed germination process, the methodology involves several
steps. Firstly, three germination papers are acquired and thoroughly wetted to
ensure adequate moisture for seed germination. Next, soybean seeds are care-
fully placed onto two of the wetted germination papers, with appropriate spacing
between each seed. These seeds are then covered with the third germination pa-
per, forming a sandwich-like structure to promote germination. Subsequently, the
prepared seeds are left undisturbed for a duration of 8 days to initiate the ger-
mination process under controlled environmental conditions. After the incubation
period, thermal images of the soybean seeds are captured using specialised thermal
imaging equipment. These thermal images serve as valuable data to discern the
germination status of the seeds.
The next step involves organising the thermal images into a comprehensive
dataset, meticulously labelling each image according to the germination status of
the corresponding seed—whether germinated or dead. This dataset compilation
is crucial for training and testing deep learning algorithms. Finally, deep learning
algorithms, such as convolutional neural networks (CNNs), are employed to classify
the seeds based on the thermal images. Through meticulous training, validation,
and testing processes, the deep learning model can effectively distinguish between
germinated and dead seeds with high accuracy. This comprehensive methodology
ensures a systematic approach to analyse and classify soybean seeds’ germination
status, offering valuable insights for agricultural research and practice.
13
Chapter 5
Design and Implementation
The design and implementation of a seed germination prediction system using
machine learning involves integrating thermal imaging systems, both active and
passive, to capture comprehensive temperature profiles of seeds during the germi-
nation process. In the design phase, we employ a thermal imaging system where
infrared cameras continuously monitor the seeds on germination paper within a
controlled environment. Both active thermal imaging, which involves external
heat sources, and passive thermal imaging, relying on natural heat emission from
the seeds, are utilized to ensure accurate data collection. The processing of ther-
mal images includes extracting relevant features that reflect germination activity,
such as temperature patterns and heat distribution. Factors affecting the quality of
thermal images, such as camera resolution, environmental stability, and seed place-
ment, are meticulously managed to maintain data integrity. In the implementa-
tion phase, these processed thermal images, along with environmental parameters,
are fed into machine learning models to predict germination outcomes. The sys-
tem leverages software tools like Jupyter Notebook for data processing and model
training, and Streamlit for developing an interactive application, thus providing a
robust and user-friendly platform for accurate seed germination prediction.
5.1 Design
5.1.1 Thermal Imaging System
15
thermal characteristics of the items under research, the passive-based approach is
typically utilized.
Measurement in the course of processing. Additionally, as insects in seed lots
produce a lot more heat through their respiration than do sound seeds, it might
be utilized to identify insect infestation in seeds.
In the second method of thermal imaging, emissivity often fails to produce
infrared images with sufficient contrast. Therefore, it is advisable to heat or cool
the samples before capturing thermal image frames. It is advised to use an in cases
when it is not feasible to collect enough thermal signals.
Outside energy source in order to provide a sufficient thermal contrast. In
these situations, a thermal imaging technique known as active thermal imaging
is employed, in which the target items are heated or rapidly cooled in order to
deposit external energy before taking an image.
16
to regulate every one of these elements in reality, understanding them is crucial
for many application schemes.
Since the main objective of thermal imaging is to record the surface tempera-
ture of every object in the camera’s field of view based on the thermal radiation
emitted from these objects, factors that attenuate or augment thermal radiation
before it reaches the detector should be carefully considered. For instance, one
way that specific air parameters, including temperature, relative humidity, and
wind speed, affect the transmission of infrared light from the target object to the
thermal imaging system is by attenuation through absorption or scattering.
17
5.2 Implementation
5.2.1 Germination Test
The warm germination test serves as the standard method for assessing the viabil-
ity of seed batches. In compliance with the Nebraska State Seed Law, commercially
marketed seed must display its germination percentage on the seed tag. This test
subjects the seed to optimal conditions, including ideal light, temperature, and
relative humidity, which vary depending on the plant species under examination.
For conducting the warm germination test on large-seeded crops like soybean and
corn, the necessary supplies include a seed counter tray, a seed-retaining container,
and two moistened paper towels.
The seed counter tray is filled with seeds, ensuring each hole is filled and excess
seed is drained. Upon opening a sliding drawer, fifty seeds are dispensed onto the
paper towel, ensuring uniform placement after removing the counter tray. The
seeds are then covered with a thick and a thin, moistened paper towel, with their
edges sealed together to prevent drying out. The bottom edges of the towels are
folded to prevent seeds from rolling out, and the towels are rolled up for easier
handling. The paper towel is marked with the seed lot number using pencils
designed for writing on damp paper.
18
The thick paper towel and the thin, moist paper towel are layered over the seeds,
and the edges of the paper towels are squeezed together to seal them.
To prevent the seed from rolling out, the towels’ bottoms are folded. I’ve rolled
up the towel. In addition to making handling easier, this prevents the towel from
drying out.
19
For identifying purposes, the towel has the seed lot number in writing. Pencils
designed specifically to write on damp paper are used.
20
The rolled paper towels are placed plastic sacks to prevent drying, placed in a
germination chamber with the proper temperature, humidity and light.
The towels are taken out of the chamber after the designated period of time, which
is usually one week.
The paper towels are unrolled, completely uncovering the germinating seedlings.
Ungerminated seeds and germinated seedlings are separated and counted, and the
germination percentage is calculated.
21
Chapter 6
Experimentation and Results
The Experimentation and Result phase of seed germination prediction using
machine learning involves evaluating the performance of the developed models
and interpreting their predictions. By analyzing the data collected from thermal
imaging systems and environmental sensors, we assess the model’s accuracy,
precision, and recall in predicting seed germination outcomes. The evaluation
metrics indicate how well the machine learning algorithms, such as decision
trees, random forests, and neural networks, have learned from the training data.
Detailed analysis of the results helps identify patterns and correlations between
thermal image features and germination success.
6.1 Experimentation
Soybean seeds were positioned on germination paper so that various germination
stages could be photographed with infrared thermal imaging. To provide a sharp
contrast between the developing radicle and the background, the seeds were put on
a black fabric during the thermal picture capture. To lessen the impact of water
evaporation and the consequent drying out of the seeds, the cloth was moistened
with tap water and the germination paper was covered with butter paper. Reflec-
tions from the camera setup were among the reflections that the lid occasionally
produced. Each picture was taken in an office at room temperature using the same
artificial light source. For this purpose, in a single batch of germination 10 seed
samples were put for germination. Seed Germination required total 8 days to get
fully germinated but on-wards 6th day we are able to see that seed is germinating.
Seed Germination was captured over a time span of 1 days with a resolution of
33x33 (1089 pixels). First we have captured initial images of seed on day 1 before
putting for germination than on-wards 6th day we have captured images everyday
till 9th day.
Day 1
22
Day 7
Day 8
23
Emissivity = 0.96
6.2 Results
6.2.1 Experiment Analysis
360 350 10
24
Preprocessing:
1. Data Collection: Gathered data on various parameters affecting seed germi-
nation such as moisture, temperature, light exposure, and seed type.
2. Data Cleaning: Removed duplicate records and handled missing values either
through imputation or deletion.
3. Data Transformation:
Normalization/Standardization: Features are scaled to maintain uni-
formity and prevent the dominance of particular features.
Encoding Categorical Variables: Categorical variables are transformed
into numerical representations using methods such as one-hot encoding.
4. Data Splitting: The dataset is divided into training and testing sets, ensuring
a balanced representation of both classes.
Model Training:
1. Support Vector Machine (SVM):
Initialized an SVM model with suitable hyperparameters (e.g., linear
or RBF kernel).
Trained the SVM model on the preprocessed training data.
2. Random Forest:
Created a Random Forest model, a robust ensemble learning method.
Trained the Random Forest model on the preprocessed training data.
Evaluation:
1. Accuracy: Achieved an accuracy of 97.14%, indicating the model’s effective-
ness in predicting seed germination outcomes.
2. Epoch Iterations: Iterated over multiple epochs to fine-tune the model pa-
rameters and improve performance.
3. Seed Distribution:
Germinated Seeds: 350
Dead Seeds: 10
The model’s ability to predict both outcomes demonstrates its robust-
ness and effectiveness.
25
6.2.2 Model Dashboard
1. Write code in notepad and save as app.py
2. Open Command Prompt
3. Type the following
26
6.2.3 Model Code
27
6.2.4 Code Explanation
93
The `!pip install opencv-python` command is used to install the OpenCV (Open Source Computer
Vision) library in a Python environment using the pip package manager. Let's break down the
command:
- `!`: In a Jupyter Notebook or similar interactive environment, the exclamation mark (`!`) is used to
run shell commands from within the notebook.
- `pip`: Pip is the package installer for Python, used to install and manage Python packages.
- `opencv-python`: This is the name of the package being installed. OpenCV is a popular computer
vision library, and `opencv-python` is a Python wrapper for the OpenCV library.
When you run `!pip install opencv-python`, you are instructing pip to download and install the
OpenCV package, making the library and its functionalities available for use in your Python
environment. Once the installation is complete, you can import and use OpenCV in your Python
scripts or notebooks for tasks related to computer vision, image processing, and more.
The `import cv2` statement is used to import the OpenCV library into a Python script or environment.
OpenCV (Open Source Computer Vision) is a popular computer vision library that provides a wide
range of tools and functionalities for image and video processing.
Once you have imported OpenCV using `import cv2`, you can use its functions and classes in your
code.
from PIL import Image
import os
- `from PIL import Image`: Importing the `Image` module from the Python Imaging Library (PIL),
which is commonly used for working with images.
- `import os`: Importing the `os` module for interacting with the operating system.
- `import matplotlib.pyplot as plt`: Importing the `pyplot` module from Matplotlib for creating plots
and visualizations.
folder_path = r"D:\Project\images1\after"
- `folder_path`: The variable is set to the path where the images are stored. Make sure to replace
this path with the actual path to your image folder.
files = os.listdir(folder_path)
- `os.listdir(folder_path)`: Using the `os` module to obtain a list of all files in the specified directory
(`folder_path`).
# Filter out only the image files (you can customize this based on your image file extensions)
- `[file for file in files if file.endswith(('.jpg', '.jpeg', '.png', '.bmp'))]`: Using a list comprehension to
filter out only the files with specified image file extensions (`.jpg`, `.jpeg`, `.png`, `.bmp`).
axs = axs.flatten()
for i in range(min(10, len(image_files))): # Ensure you don't go beyond the number of available
images
img = Image.open(filepath)
axs[i].imshow(img)
- `plt.subplots(nrows=2, ncols=5, figsize=(12, 6))`: Creating a 2x5 grid of subplots for displaying the
images.
- `axs = axs.flatten()`: Flattening the 2D array of axes into a 1D array for easier indexing.
- The subsequent loop iterates over the images, loads each one using `Image.open()`, displays it in
the corresponding subplot using `axs[i].imshow(img)`, and turns off axis labels for better visualization
with `axs[i].axis('off')`.
In summary, this code reads and filters images from a specified folder, and then displays the first 10
images (or fewer if there are fewer than 10 images) in a 2x5 grid using Matplotlib.
Perform the same code as ‘after’ images for ‘dead’ images as well
- `cv2.imread`: This is a function in the OpenCV library used to read an image from a file.
- `r'D:\Project\images1\after\1 (1).bmp'`: This is the file path to the image you want to read. The `r`
before the string indicates a raw string, which is used to treat backslashes (`\`) as literal characters.
This line uses the `cv2.imread` function to read an image from the specified file path. The file path
points to a BMP image file (`1 (1).bmp`) located in the `D:\Project\images1\after\` directory. The
function returns a NumPy array representing the image.
It's important to note that the resulting image, when read using `cv2.imread`, will be represented as
a NumPy array with pixel values. This array can be further processed or used for various computer
vision tasks using the OpenCV library. If you want to display the image, additional steps such as using
`cv2.imshow` may be required.
This code is setting up an image data generator using TensorFlow's Keras API for training a neural
network. The image data generator is configured to perform various data augmentation techniques.
Let's go through each part of the code:
import tensorflow as tf
img_height = 224
img_width = 224
batch_size = 32
2. Configuration Parameters:
- `img_height` and `img_width`: Define the height and width to which the images will be resized
during training.
- `batch_size`: Determines the number of images processed in each iteration (batch) during
training.
rescale=1./255,
shear_range=0.2,
zoom_range=0.2,
horizontal_flip=True
train_dataset = train_datagen.flow_from_directory(
r"D:\Project\train1",
target_size=(img_height, img_width),
batch_size=batch_size,
class_mode='binary'
- `class_mode='binary'`: Specifies the type of label array generated by the generator. In this case,
it's set to `'binary'`, indicating binary labels (e.g., for a binary classification task).
In summary, this code sets up an image data generator for training a neural network, applying
various data augmentation techniques to the images from the specified directory. The generated
batches are then typically used for training a model, such as a convolutional neural network (CNN).
This code is similar to the previous one, but it sets up a separate image data generator for validation
using a different directory. Let's break down the code:
validation_dataset = train_datagen.flow_from_directory(
r"D:\Project\validation1",
target_size=(img_height, img_width),
batch_size=batch_size,
class_mode='binary'
- `class_mode='binary'`: Specifies the type of label array generated by the generator. In this case,
it's set to `'binary'`, indicating binary labels (e.g., for a binary classification task).
This code essentially creates a new data generator (`validation_dataset`) that applies the same data
augmentation settings as the one used for training (`train_datagen`). The validation set, located in
the directory specified, will be processed similarly to the training set, allowing for consistency in
preprocessing steps.
Typically, having separate data generators for training and validation helps ensure that the model
generalizes well to new, unseen data. The validation dataset is used to monitor the model's
performance during training and assess its ability to generalize beyond the training set.
The `train_dataset.class_indices` code refers to accessing the class indices assigned by the
`ImageDataGenerator` during the setup of the training dataset. When you use `flow_from_directory`
to generate batches of images, the generator automatically assigns numerical indices to the different
classes based on the subdirectories in the specified training directory.
class_indices = train_dataset.class_indices
1. `train_dataset`: This is an instance of the `ImageDataGenerator` that was created for the training
dataset using `flow_from_directory`.
2. `.class_indices`: This attribute of the `ImageDataGenerator` object provides a dictionary where the
keys are the class names (or subdirectory names in the training directory), and the values are the
corresponding numerical indices assigned to each class.
The `train_dataset.classes` code refers to accessing the class labels assigned to the images in the
training dataset by the `ImageDataGenerator`. When you use `flow_from_directory` to generate
batches of images, the generator assigns numerical labels to the images based on the subdirectories
in the specified training directory.
class_labels = train_dataset.classes
1. `train_dataset`: This is an instance of the `ImageDataGenerator` that was created for the training
dataset using `flow_from_directory`.
2. `.classes`: This attribute of the `ImageDataGenerator` object provides an array of numerical labels
corresponding to the classes of the images in the training dataset. The order of the labels
corresponds to the order in which the images are loaded by the generator.
This code defines a Convolutional Neural Network (CNN) model for binary classification using the
TensorFlow and Keras library. Let's break down each part of the code:
import tensorflow as tf
model = tf.keras.models.Sequential([
tf.keras.layers.MaxPooling2D(2, 2),
tf.keras.layers.MaxPooling2D(2, 2),
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(512, activation='relu'),
tf.keras.layers.Dense(1, activation='sigmoid')
])
- `tf.keras.models.Sequential`: This line creates a sequential model, which is a linear stack of layers.
2. Convolutional Layers:
- `tf.keras.layers.MaxPooling2D(2, 2)`: Max-pooling layer with a pool size of (2, 2) to reduce spatial
dimensions.
- Similar architecture with increasing filter count and decreasing spatial dimensions.
4. Flatten Layer:
- `tf.keras.layers.Flatten()`: Flatten layer to convert the 3D output to 1D before passing to the dense
layers.
5. Dense Layers:
- `tf.keras.layers.Dense(512, activation='relu')`: A dense layer with 512 units and Relu activation.
- `tf.keras.layers.Dense(1, activation='sigmoid')`: The output layer with a single unit and sigmoid
activation for binary classification.
model.summary()
7. Model Summary:
- `model.summary()`: Prints a summary of the model architecture, including layer names, output
shapes, and parameter counts.
In summary, this code defines a simple CNN model for binary classification, compiles it with specified
settings, displays the model summary, and includes an unnecessary import of the RMSprop
optimizer. The model consists of convolutional layers followed by max-pooling layers, a flatten layer,
and dense layers, culminating in an output layer with a sigmoid activation for binary classification.
model = tf.keras.models.Sequential([
tf.keras.layers.MaxPooling2D(2, 2),
])
In this snippet, a MaxPooling2D layer with a pool size of (2, 2) is added to the model. Max pooling
reduces the spatial dimensions of the input data by selecting the maximum value within a specified
window. Note that you should have other layers (like Conv2D) before and after this MaxPooling2D
layer for a complete model.
model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(),
])
In this snippet, after the MaxPooling2D layer, a Flatten layer is added. The Flatten layer is used to
convert the 3D output of the previous layer to a 1D array before connecting it to dense layers. This is
a common practice when transitioning from convolutional layers to fully connected layers in a CNN.
model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(512, activation='relu'),
])
In this snippet, a Flatten layer is added, followed by a Dense layer with 512 units and ReLU
activation. The Flatten layer is used to convert the 3D output of the previous layer to a 1D array
before connecting it to the dense layer.
It's essential to have a well-defined sequence of layers in a `Sequential` model, starting with input
layers, followed by convolutional layers, pooling layers, flattening layers, and then dense layers for
the architecture to make sense. The exact structure depends on the specific requirements of your
task (e.g., image classification, object detection).
The code involves compiling a model and training it using TensorFlow and Keras. Let's break down
each part of the code:
model.compile(loss='binary_crossentropy',
optimizer=RMSprop(lr=0.001),
metrics=['accuracy'])
model_fit = model.fit(train_dataset,
steps_per_epoch=3,
epochs=10,
validation_data=validation_dataset)
2. Model Training:
- `steps_per_epoch=3`: Number of batches to be processed per epoch. This value is set to 3, but in
practice, it should be adjusted based on the size of your dataset.
- `epochs=10`: Number of times the entire training dataset is passed through the model. In this
case, the training will proceed for 10 epochs.
The `model_fit` variable will store information about the training process, including loss and accuracy
values at each epoch. After training completes, we can analyze this information to evaluate how well
our model has learned from the data.
This code demonstrates the process of defining, compiling, and training a neural network for binary
classification using TensorFlow and Keras. Adjustments can be made based on the specific
requirements of your task and dataset.
This code defines a simple Sequential model for binary classification using TensorFlow and Keras. The
model consists of two Dense layers. Let's break down each part of the code:
1. Importing Libraries:
- `Sequential`: A linear stack of layers for building a deep learning model layer by layer.
- `Dense`: A fully connected layer where each neuron is connected to every neuron in the previous
layer.
model = Sequential()
model.add(Dense(1, activation='sigmoid'))
- `model.add(Dense(1, activation='sigmoid'))`: Adds the output Dense layer with 1 unit and a
sigmoid activation function. For binary classification, a sigmoid activation is common in the output
layer.
model.summary()
- `model.summary()`: Prints a summary of the model architecture, including layer names, output
shapes, and parameter counts.
In summary, this code defines a basic feedforward neural network with one hidden layer for binary
classification. The architecture can be adjusted based on the specific requirements of your task.
Remember to customize the input shape, activation functions, and other parameters based on your
dataset and problem.
This code performs image classification using a pre-trained model to predict whether seeds in the
given images are germinated or not. Let's break down each part of the code:
Explanation:
1. File Iteration:
- The outer loop iterates through the files in the directory specified by `dir_path`, assuming it
contains subdirectories with seed images.
- The inner loop iterates through the files in the "after" subdirectory (presumably containing images
of germinated seeds).
6. Germination Check:
- Checks if the predicted probability is greater than 0.2. If yes, it's considered as germinated;
otherwise, it's considered not germinated or dead.
This code assumes that the model (`model`) has been previously defined and trained for seed
germination classification. The threshold of 0.2 for considering germination is arbitrary and can be
adjusted based on the model's performance and specific requirements.
This code demonstrates the use of a pre-trained VGG16 model as a feature extractor, followed by
training a Support Vector Machine (SVM) classifier on the extracted features. Let's break down each
part of the code:
1. Importing Libraries:
- `Model`: Importing the Model class to create a modified version of the VGG16 model.
train_features = base_model.predict(train_dataset)
3. Extracting Features:
5. Train-Test Split:
- Splitting the data into training and testing sets using scikit-learn's `train_test_split` function.
svm_classifier = SVC()
svm_classifier.fit(X_train, y_train)
svm_predictions = svm_classifier.predict(X_test)
# Evaluate accuracy
- Making predictions on the test set using the trained SVM classifier.
- Evaluating the accuracy of the SVM classifier using scikit-learn's `accuracy_score` function.
This code demonstrates a transfer learning approach where the pre-trained VGG16 model is used to
extract features from images, and an SVM classifier is trained on these features for a specific
classification task.
This code demonstrates the use of a Random Forest classifier from scikit-learn for a classification
task. Let's break down each part of the code:
rf_classifier.fit(X_train, y_train)
- `rf_classifier.fit(X_train, y_train)`: Training the Random Forest classifier using the training data
(`X_train` features and `y_train` labels).
rf_predictions = rf_classifier.predict(X_test)
4. Making Predictions on the Test Set:
- `rf_classifier.predict(X_test)`: Making predictions on the test set using the trained Random Forest
classifier.
# Evaluate accuracy
5.*Evaluating Accuracy:
In summary, this code creates a Random Forest classifier, trains it on a training dataset (`X_train` and
`y_train`), makes predictions on a test dataset (`X_test`), and evaluates the accuracy of the classifier.
Random Forests are an ensemble learning method that builds multiple decision trees during training
and outputs the mode of the classes as the prediction. The number of decision trees (`n_estimators`)
is a hyperparameter that can be adjusted based on the dataset and desired performance.
Chapter 7
Conclusion
110
References
111
10. Abram J, Bicksler P . Testing Seed Viability Using Simple Germination
Tests; A Regional Supplement to ECHO Development Notes. Unpublushed.(2011),
Pp 1-4.
12. Saffariha, M., Jahani, A. & Potter, D. ”Seed germination prediction of Salvia
limbata under ecological stresses in protected areas: an artificial intelligence
modeling approach”, BMC Ecol 20, 48 (2020). https://doi.org/10.1186/s12898-
020-00316-4
13. Chitra, H. S. H., Suguna, S., & Sujatha, S. N. A survey on image analysis
techniques in agricultural product. Indian Journal of Science and Technol-
ogy, 9(12), 1-13, (2016).
112
Certification
113
114
115
Log Book
Sr. Internal
Date Work Done
No Guide
Project discussion
1 14/07/2023
with Guide.
Project Discussion
with domain expert
2 21/07/2023
officer Mr. Rawindra
Wardkar sir
116
Created a second
6 25/08/2023 batch of thermal
images.
1. Meeting with
Guide
2. Searching and
7 15/09/2023 collection of
research papers
related to the
project.
Read 15 to 20 Re-
search papers and col-
9 29/09/2023 lected relevant infor-
mation for review pa-
pers.
1. Created a
dataset of
approx 500
thermal images.
12 03/11/2023
2. . Analysing
the dataset and
sorting of im-
ages.
117
1. . Importing
dataset
2. Checking accu-
racy of dataset
13 10/11/2023
using SVM and
Random forest
machine learn-
ing algorithm.
Project Stage I
14 23/11/2023
Demonstration.
Project Stage II
19 02/02/2024
progress presentation.
Pariyojna Pratiyogita
2024 project com-
22 08/04/2024 petition at Sinhgad
academy of engineer-
ing.
118
Project Competition
23 15/04/2024 and Exhibition in
AISSMS IOIT.
1. Project Stage II
Examiantion
24 25/05/2024
2. BlackBook Sub-
mission
119
PO Mapping
2
damentals, and an engineering specialization to the solution of
complex engineering problems. [Engineering Knowledge]
2
principles of mathematics, natural sciences, and engineering sci-
ences. [Problem analysis]
2
cultural, societal, and environmental considerations.[Design/ de-
velopment of solutions]
120
PO 4: Use research-based knowledge and research methods including
design of experiments, analysis and interpretation of data, and syn-
2
thesis of the information to provide valid conclusions. [Conduct in-
vestigations of complex problems].
2
elling to complex engineering activities with an understanding of
the limitations. [Modern tool usage]
2
responsibilities relevant to the professional engineering practice.
[The engineer and society]
2
responsibilities and norms of the engineering practice. [Ethics]
2
in diverse teams, and in multidisciplinary settings. [Individual and
team work]
2
mentation, make effective presentations, and give and receive clear
instructions. [Communication]
2
member and leader in a team, to manage projects and in multidisci-
plinary environments. [Project management and finance]
121
PO 12: Recognize the need for, and have the preparation and ability to
2
engage in independent and lifelong learning in the broadest context
of technological change. [Life-long learning]
2
ment and control to solve the problems related to the environment,
safety, health and agriculture sectors.
2
ming, Programmable logic controller, LabView, Distributed con-
trol system and Internet of thing.
Dr.A.A.Shinde
Project Guide
122
Self Evaluation
Self-Evaluation sheet
Name of Student:
123
Critical Thinking
1. Farmers and Agricultural Industry: Farmers can benefit greatly from accu-
rate seed germination predictions. By knowing in advance which seeds are
likely to germinate successfully, farmers can optimize planting schedules, re-
duce waste of seeds, and increase overall crop yields. This leads to improved
efficiency and profitability in the agricultural industry.
2. Seed Producers and Distributors: Seed producers and distributors can use
machine learning models to predict seed germination rates more accurately.
This allows them to better manage inventory, ensure timely deliveries, and
maintain customer satisfaction by providing high-quality seeds.
124
5. Education and Outreach: Projects like this can also be valuable for educa-
tional purposes. Students and enthusiasts interested in machine learning,
agriculture, or environmental science can learn about the practical applica-
tions of data science in addressing real-world challenges such as optimizing
agricultural practices and enhancing food security.
2. Seed Producers and Distributors: Seed producers and distributors are di-
rectly impacted by the project’s outcomes as they rely on accurate germi-
nation predictions to manage their inventories and meet customer demands
effectively. By incorporating machine learning-generated insights into their
operations, they can ensure the timely availability of high-quality seeds with
predictable germination rates. This enhances their competitiveness in the
market and strengthens customer satisfaction, leading to increased sales and
revenue.
125
comes. Accurate predictions of seed germination enable them to select ap-
propriate seed species for restoration projects, ensuring the successful estab-
lishment of native plant communities. By leveraging machine learning-based
insights, conservationists can enhance the effectiveness of their efforts to re-
habilitate degraded ecosystems, preserve endangered species habitats, and
promote landscape-scale conservation.
5. Educational Institutions: Educational institutions, including universities and
research centers, are directly impacted by the project as it contributes to the
advancement of knowledge in fields such as plant biology, ecology, and data
science. Students and researchers have access to the data generated by the
project, allowing them to study the underlying mechanisms of seed germina-
tion and explore innovative applications of machine learning in agriculture
and environmental science. This fosters interdisciplinary collaboration and
prepares future generations of scientists and professionals to address complex
challenges in food security and sustainability.
Q3: What are the strengths and weaknesses?
Ans:
Strengths:
1. Data-Driven Insights: One of the primary strengths of using machine learn-
ing for seed germination prediction is its ability to analyze vast amounts
of data. Machine learning algorithms can identify complex patterns and
relationships within datasets that may not be apparent through traditional
statistical methods. This enables researchers to extract valuable insights into
the factors influencing seed germination, such as environmental conditions,
seed characteristics, and genetic traits.
2. Improved Accuracy: Machine learning models have the potential to signifi-
cantly improve the accuracy of seed germination predictions. By training on
large and diverse datasets, these models can learn to recognize subtle cues
and patterns associated with successful germination outcomes. This leads
to more reliable predictions, allowing farmers, seed producers, and conserva-
tionists to make informed decisions with greater confidence.
3. Customization and Adaptability: Machine learning techniques offer flexibil-
ity in model development and customization. Researchers can tailor mod-
els to specific crop species, environmental conditions, or geographic regions,
ensuring relevance and applicability to diverse agricultural and ecological
contexts. Moreover, machine learning algorithms can adapt and refine their
predictions over time as new data becomes available, leading to continuous
improvement in accuracy and performance.
126
4. Resource Efficiency: By predicting seed germination outcomes in advance,
machine learning models help optimize resource allocation in agriculture and
conservation. Farmers can minimize the use of seeds, water, fertilizers, and
other inputs by strategically planning planting schedules based on predicted
germination probabilities. Similarly, conservationists can prioritize the se-
lection and deployment of seeds in habitat restoration projects, maximizing
the efficiency and effectiveness of restoration efforts.
Weakness:
127
Q4: What is the best and worst case scenario?
Ans: The best and worst-case scenarios for the project include:
Best-Case Scenario:
1. High Accuracy and Reliability: In the best-case scenario, the machine learn-
ing models exhibit exceptional accuracy and reliability in predicting seed
germination outcomes. These models successfully capture the complex rela-
tionships between environmental factors, seed characteristics, and germina-
tion rates, leading to highly accurate predictions across diverse conditions
and crop species.
2. Optimized Resource Allocation: Farmers, seed producers, and conservation-
ists benefit from optimized resource allocation based on the predictions gen-
erated by the machine learning models. Planting schedules are meticulously
planned to coincide with optimal germination conditions, minimizing seed
wastage and resource usage while maximizing crop yields and restoration
success rates.
3. Improved Agricultural Productivity: The project contributes to significant
improvements in agricultural productivity and sustainability. Farmers achieve
higher yields and profitability through enhanced germination rates and effi-
cient use of inputs, leading to increased food security and economic prosper-
ity in agricultural communities.
4. Enhanced Conservation Efforts: Conservationists leverage the predictive
power of machine learning to prioritize seed selection and habitat restora-
tion projects effectively. Native plant species are reintroduced into degraded
ecosystems with a high likelihood of successful establishment, resulting in
improved biodiversity, ecosystem resilience, and ecological restoration out-
comes.
5. Scientific Advancements: The project drives scientific advancements in the
fields of plant biology, ecology, and data science. Researchers gain valuable
insights into the mechanisms underlying seed germination and ecosystem
dynamics, fostering interdisciplinary collaboration and innovation in agri-
cultural and environmental research.
Worst-Case Scenario:
1. Inaccurate Predictions: In the worst-case scenario, the machine learning
models produce inaccurate predictions of seed germination outcomes. Fac-
tors such as limited or biased data, inadequate model training, or unforeseen
environmental variability contribute to unreliable predictions, leading to sub-
optimal decision-making and resource allocation by stakeholders.
128
2. Resource Misallocation: Farmers and conservationists may experience re-
source misallocation due to unreliable predictions generated by the machine
learning models. Planting schedules may be poorly timed, resulting in de-
creased crop yields, wasted seeds, and inefficient use of resources. Conser-
vation efforts may suffer setbacks as seeds fail to germinate in restoration
projects, undermining biodiversity conservation and ecosystem restoration
goals.
3. Loss of Trust and Adoption: Stakeholders may lose trust in the predictive ca-
pabilities of machine learning models if they consistently produce inaccurate
results. This loss of trust could lead to reluctance or resistance in adopting
machine learning technology for seed germination prediction, hindering its
potential to improve agricultural practices and conservation outcomes.
129
significantly. In these locales, where unpredictable weather patterns and water
scarcity pose constant threats to agriculture, accurate prediction of seed germina-
tion offers a lifeline. By pinpointing optimal planting times and identifying seeds
resilient to local conditions, farmers can mitigate risks and optimize resource usage,
ultimately bolstering agricultural productivity and resilience. Moreover, the tech-
nology holds immense promise in supporting marginalized or vulnerable commu-
nities, including rural farmers, smallholders, and indigenous groups. Often lacking
access to modern agricultural practices, these communities face heightened vul-
nerability to climate change impacts and resource constraints. Implementing seed
germination prediction models empowers them with valuable insights and tools
to navigate uncertain agricultural landscapes, enhancing their ability to sustain
livelihoods and ensure food security. Furthermore, regions characterized by bio-
diversity hotspots, degraded ecosystems, or undergoing habitat restoration efforts
also present significant opportunities for applying predictive modeling. By guid-
ing conservationists in selecting appropriate seeds and restoration strategies, these
models can accelerate ecosystem recovery, promote biodiversity conservation, and
strengthen ecosystem resilience in the face of ongoing environmental challenges.
130
models to guide habitat restoration efforts, selecting native plant species with
high germination probabilities to restore biodiversity, improve soil health,
and enhance ecosystem resilience.
131
with ecological systems makes it difficult to anticipate all potential factors
influencing seed germination accuratel
132
2. Environmental Complexity: Environmental conditions influencing seed ger-
mination are complex and dynamic, making it challenging to capture all rel-
evant factors accurately. Variables such as soil moisture, temperature, light
intensity, and nutrient availability interact in intricate ways, and predicting
their combined effects accurately can be difficult. Moreover, environmental
conditions may vary significantly across different geographical regions and
cropping systems, further complicating predictive modeling efforts.
133
seed germination rates, necessitating more accurate and adaptable predictive
models to optimize agricultural practices and mitigate risks.
3. Resource Scarcity: Scarce resources such as water, land, and seeds require
careful management to meet the demands of a growing population while
minimizing environmental impact. Predictive modeling enables farmers to
make informed decisions about resource allocation, optimizing seed usage,
irrigation practices, and land management to achieve sustainable agricultural
production.
134
not only enhances farm profitability but also promotes food security and economic
resilience in agricultural communities. Seed producers and distributors similarly
benefit from improved inventory management and customer satisfaction, leading
to enhanced market competitiveness and profitability. Conservationists leverage
predictive models to guide habitat restoration efforts, selecting native plant species
with high germination probabilities to restore biodiversity and ecosystem resilience.
Moreover, researchers gain valuable insights into seed biology and environmental
interactions, driving scientific advancements and interdisciplinary collaboration.
Ultimately, the widespread adoption of seed germination prediction using machine
learning fosters technological innovation, sustainable development, and positive so-
cietal impacts, thereby benefiting individuals, communities, and ecosystems alike.
135
SOCME
The SOCME (System Oriented Concept Map Extension) for seed germination
prediction using machine learning involves several key components:
Data Collection and Preprocessing:
Feature Engineering:
Split the dataset into training and testing sets for model evaluation.
136
Model Evaluation:
Assess the performance of each model using metrics like accuracy, precision,
recall, and F1-score.
Validate the models using cross-validation techniques to ensure robustness.
Integration and Deployment:
Integrate the trained model into a user-friendly interface or application.
Ensure scalability and efficiency for real-time or batch prediction of seed
germination.
Deploy the system in relevant environments such as agriculture labs or seed
production facilities.
Continuous Improvement:
Monitor the performance of the deployed system and collect feedback.
Fine-tune the model periodically using additional data or updated techniques
to improve accuracy and reliability.
Throughout the SOCME process, collaboration between domain experts, data
scientists, and stakeholders is essential to ensure the successful development and
deployment of the seed germination prediction system.
137
Paper Publication
138
Plagiarism Reports
139
Seed Germination Prediction using Machine Learning
Abstract : For seed researchers, evaluating seed germination is a crucial step in assessing the
effectiveness and quality of seeds. Seed evaluations are often completed by hand, which is a
laborious, time-consuming, and prone to mistake procedure. Because of the tremendous
advancements in photography, electronics, and computer science over the last several decades,
infrared thermal imaging technology has seen several1 technical advancements . clever uses for
quality control and non-destructive testing of many agro-food products. Thermal imaging is a
potential non-contact imaging technique for evaluating different quality attributes because it is
1
based on the infrared radiation released from target foods. Since large-scale germination
investigations often rely on human color-based threshold adjustments, they are not well suited
for conventional picture analysis methods.
Here, we provide a machine learning method for high-throughput seed germination tests and
precise seed germination detection.
1.1 Seed Germination : The biological process of a seed turning into a sprout or a
budding plant is called germination. Everything begins when a seed, buried in the earth,
comes into contact with the proper elements—moisture, warmth, and occasionally light.
Water is absorbed by the seed, which causes it to grow and release stored enzymes. These
enzymes set off metabolic processes that decompose the seed's stored nutrients. The seed
coat rips apart as a consequence, and a little root appears to look for nutrients in the soil.
A shoot simultaneously thrusts forward, toward the sun. Eventually, this sprout will begin
to unfurl its leaves, propelled by the stored nutrients. A plant's autonomous existence
begins with germination, when it stops depending on the seed and starts producing its own
energy through photosynthesis.It marks the beginning of a new plant's journey towards
development and maturity, much like nature's alarm clock.
1.1.1 Types of Seed Germination : There are two primary processes by which seeds can
germinate: epigeal and hypogeal. Plants have evolved several kinds of germinations as
adaptations to maximize their development in various conditions.
Epigeal
When a seed germinates epigeally, the growing stalk emerges above the soil's surface.
The cotyledons, or seed leaves, are raised into the air during this phase and develop into
the young plant's first true green leaves. This kind of germination is typical of plants like
sunflowers and beans. The cotyledons are exposed to sunlight for photosynthesis as the
shoot rises and carries them along with it. With this technique, the plant may effectively
use soil nutrients and sunshine for its early growth and development.
Hypogeal
When a seed germinates hypogeally, the growing branch remains below the soil's surface.
During this phase, the stem elongates to force the first genuine leaves above earth, leaving
the cotyledons (seed leaves) buried. Plants that frequently display hypogeal germination
are peas and peanuts. Using this strategy, the cotyledons act as nutrient stores for the
growing seedling. This tactic shields the tender branch from damage during the early
phases of growth while enabling the young plant to obtain vital resources.
1.1.2 Germination Process : The germination process is a critical phase in the life cycle
of a seed, marking its transition from dormancy to active growth. This process involves
several stages, each essential for the successful development of a new plant. Here is a
detailed look at the germination process:
Imbibition: The first step in germination is imbibition, where the dry seed absorbs
water through its seed coat. This water uptake causes the seed to swell and the seed
coat to soften, making it permeable to gases and soluble nutrients.
Activation of Metabolism: As the seed absorbs water, its metabolic processes are
reactivated. Enzymes that were previously inactive during dormancy become active,
leading to the breakdown of stored food reserves in the seed. These reserves,
typically starches, proteins, and lipids, are converted into simpler molecules that
provide energy and building materials for the growing embryo.
Respiration: During germination, the seed's respiration rate increases significantly.
Oxygen is consumed, and carbon dioxide is produced as the seed converts its stored
food into energy. Adequate oxygen supply is crucial during this stage, which is why
well-aerated soil is important for successful germination.
Growth of the Radicle: The radicle, or embryonic root, is the first part of the seedling
to emerge from the seed. It grows downward into the soil, anchoring the plant and
beginning the process of water and nutrient absorption. The emergence of the radicle
is often considered the definitive sign that germination has occurred.
Emergence of the Shoot: Following the radicle, the shoot (or plumule) begins to grow
upwards towards the soil surface. This shoot consists of the stem and the first leaves
(cotyledons). The growth of the shoot is facilitated by the elongation of cells in the
hypocotyl (the stem-like part of the seedling below the cotyledons) and the epicotyl
(the part above the cotyledons).
Development of True Leaves: Once the shoot reaches the surface and is exposed to
light, the cotyledons open up and photosynthesis begins. This process provides the
seedling with the energy required for further growth. The true leaves, which will be
the plant’s main photosynthetic organs, start to develop after the cotyledons.
Establishment of the Seedling: As the seedling continues to grow, it becomes less
reliant on the seed's food reserves and more dependent on photosynthesis and nutrient
uptake from the soil. The root system expands, the shoot elongates, and the plant
establishes itself as a young seedling capable of independent growth.
1.3 Germination Rate : The germination rate is a measure of the proportion of seeds that
successfully sprout and begin to grow over a specific period. It is an important indicator
of seed viability and quality, influencing agricultural productivity and efficiency. The
percentage of seeds that successfully germinate in a given sample, typically expressed
over a specific time frame.
Germination rate can calculate as suppose 100 seed is taken for germination test from a
batch of 1000 seeds and that 100 seeds are put to germination. After germination process
from that 100 seeds 70 seed are germinated and 30 are not germination , so the
germination rate of that particular batch will be 70%.
1.4 Background of Project : The assessment of seed quality is typically performed by
professional technicians, relying on their individual expertise and taking considerable
time. To simplify seed grading for producers, researchers have developed machine vision-
oriented methods to monitor seed quality attributes such as texture, color, and form. These
models enable automated and rapid acquisition of relevant variables. However, the results
of seed grading, whether through automated 5computer classification or technician
assessment, must be validated by the conventional germination test.
This germination test usually takes two to ten days, depending on the seed type, and
requires expert personnel to manually count germinated seeds. If the results of the
samples fall outside the acceptable range, the test must be repeated, demanding significant
effort from skilled experts. Due to the unique nature of germination tests and their
dependence on skilled professionals for accurate classification, there is a pressing need for
a reliable, automated system to examine seed germination with greater stability,
independence, and accuracy across species.
The automated germination test monitoring system in the study now employs traditional
image analysis techniques. The conventional method for testing seed vigor3 is labor-
intensive and challenging, leading to its infrequent use. Additionally, many seed tests
established by the International Seed Testing Association (ISTA) are manually assessed
using protocols that vary by crop. Researchers have developed automatic methods to
5
minimize errors and reduce manual steps in the seed testing process. Modern image
analysis techniques, being automated, produce quantitative, unbiased measures with
minimal errors and have thus been utilized to detect seeds.
1.5 Objectives :
3
1. To reduce the number of manual steps in seed testing, which is highly error-prone.
9
2. To automate the process of seed germination testing.
2. Literature Survey :
5
In this chapter, a brief survey of the present germination classification model has
been mentioned.
1. Here Nikita Genze et al. [1] introduce a machine learning approach for high-
3
throughput seed germination studies and reliable detection using modern artificial
neural networks with region proposals. They compiled tagged image data on the
3
germination stages of over 2,400 seeds from three different crops: Zea mays (maize),
Secale cereale (rye), and Pennisetum glaucum (pearl millet), utilizing approximately
3
23,000 images in total. Transfer learning was employed to train various state-of-the-
9
art convolutional neural network (CNN) architectures with region proposals to
automatically identify seedlings in petri dishes and predict their germination
potential.
There suggested machine learning-based approach can facilitate the evaluation of
seed germination. Experiments with several seed varieties. In comparison to
traditional and manual approaches, it has reduced error rates and higher performance,
resulting in more recise germination indices and seed quality evaluations.
2. In this study Srinath Yasam , et al. [2] the creation of the Automated Germination
Classification utilising ORB Feature Extractor with Machine Learning
(AGCORBML) model is the main emphasis of this study, according to Srinath
Yasam, et al. [2]. For the purposes of determining seed quality and detecting
germination, the proposed AGCORB-ML approach employs hand-crafted features
and ML classifiers. The recently developed AGCORB-ML approach uses the ORB
technique to extract a suitable set of feature vectors from8the germination images. To
categorize germination, two machine learning models—the support vector machine
(SVM) and the extreme learning machine (ELM)—are used. An comprehensive
range of trials were conducted to show the AGCORB-ML technique’s effectiveness
in germination categorization. According on the simulation results, the AGCORB-
ML approach outperforms existing ML models in a promising manner.
5. Nehoshtan, Y., et al. present the first-ever general germination prediction method
here [5]. It makes it simpler to categorize seeds based on their usefulness and
capacity to germinate—two characteristics that influence the outcome of
2
germination—and is based on deep learning and RGB image data. By making
2
hundreds of rejected seed batches from seven different vegetable crops—representing
different genotypes and production pipelines—industrially acceptable, they exhibit
their technological expertise.
We also demonstrate the ability to accurately categories lots using crop-level picture
2
data rather than lot-specific data. These successes mark a significant turning point in
the application of this technique for industrial seed sorting by germination outcome
for various crops. There suggested method builds on prior attempts to use optics to
forecast seed germination efficiently and without harm,2 but unlike them, it makes use
of colour imaging and neural network classification. In a cohort of seven distinct
crops, dispersed across a wide phylogeny and represented by at least two lots, bred
7
and produced by twelve distinct enterprises, we used it to robustly enrich seed
germinability and usability, two distinct features of germination, to 90\% precision.
2
This new capability is anticipated to allow seed companies to 2separate seeds of
various crops based on their germination potential for the first time in high-fidelity, in
a single separation process, and without obligatory seed priming. This will streamline
2
the process of seed germination enrich- ment and make lots that, using modern
separation techniques, would have failed enrichment attempts, such as the lots used in
this study, industrially useful. Unfortunately, they only know the cause of poor
germination for a small number of the lots, which was either genetics production, or
prolonged storage. Those lots encompass the potential causes of mal-germination,
and their classification satisfied the requirements of 90% precision and 80% recall,
proving the method’s adaptability to various causes of mal-germination.
3. Methodology
To predict seed germination using machine learning, we integrate advanced hardware
and software tools to create a comprehensive and efficient methodology. The
hardware setup includes an infrared thermal camera to monitor temperature variations
during germination, germination paper to provide a consistent medium, and a
controlled germination chamber to maintain optimal environmental conditions. These
tools enable the precise collection of thermal images and environmental data, which
are essential for accurate predictions. The software aspect involves using Jupyter
Notebook for data preprocessing, feature extraction, and model training, leveraging
Python libraries like OpenCV, pandas, and scikit-learn. Additionally, Streamlit is
employed to develop an interactive web application, allowing users to input new data
and visualize predictions in real-time.
10
TFT full view color screen display.
Dual MCU processors, refresh faster.
Users may navigate to the hot and cold spots in a thermal picture by using the hot and
cold point temperature indicators.
Selectable palette
hybrid imaging using both infrared and visible data.
6
Images capture saved in SD.
Connecting PC with USB can read the image stored in instrument directly.
The primary criterion used to assess a seed's viability, vigour, and outward
appearance is its germination count. With seed germination paper, the optimal
moisture content is ensured for the greatest range of seeds and germination forms.
3.2.2. Streamlit
With the help of the open-source Python module Streamlit, developers can quickly
create eye-catching user interfaces. The simplest method for integrating code into a
web site, particularly for those without any prior front-end experience, is to use
Streamlit: There is no need for prior front-end (html, js, css) expertise or experience.
A free and open-source framework called Streamlit makes it easy to create and
distribute stunning online applications for data science and machine learning. It is a
Python-based library created with machine learning developers in mind. online
developers are neither data scientists or machine learning engineers, and they have
little interest in spending weeks learning how to create online applications using
these frameworks. As long as the tool can present data and gather the necessary
parameters for modelling, they would prefer something simpler to understand and
operate. Streamlit helps you to develop a stunning-looking application with only a
few lines of code.
4. Block Diagram:
The block diagram for seed germination prediction using machine learning outlines a
systematic workflow integrating both hardware and software components.
Initially, the hardware setup includes an infrared thermal camera capturing
temperature profiles of seeds on germination paper within a controlled germination
chamber, ensuring consistent environmental conditions. The captured thermal images
and environmental data are then fed into the preprocessing block, where image
processing techniques extract relevant features and synchronize them with
environmental parameters. These preprocessed features are input into the machine
learning model development block, where algorithms like decision trees, random
forests, and neural networks are trained and validated using labelled data. The final
block involves deploying the trained model via a Streamlit-based interactive web
application, allowing users to input new data, visualize predictions, and make
informed decisions about seed germination. This comprehensive diagram illustrates
the seamless integration of hardware data acquisition with advanced machine
learning techniques and user-friendly software interfaces.
To conduct the seed germination process, the methodology involves several steps.
Firstly, three germination papers are acquired and thoroughly wetted to ensure
adequate moisture for seed germination. Next, soybean seeds are carefully placed
onto two of the wetted germination papers, with appropriate spacing between each
seed. These seeds are then covered with the third germination paper, forming a
sandwich-like structure to promote germination. Subsequently, the prepared seeds are
left undisturbed for a duration of 8 days to initiate the germination process under
controlled environmental conditions. After the incubation period, thermal images of
the soybean seeds are captured using specialised thermal imaging equipment. These
thermal images serve as valuable data to discern the germination status of the seeds.
The next step involves organising the thermal images into a comprehensive dataset,
meticulously labelling each image according to the germination status of the
corresponding seed—whether germinated or dead. This dataset compilation is crucial
for training and testing deep learning algorithms. Finally, deep learning algorithms,
such as convolutional neural networks (CNNs), are employed to classify the seeds
based on the thermal images. Through meticulous training, validation, and testing
processes, the deep learning model can effectively distinguish between germinated
and dead seeds with high accuracy. This comprehensive methodology ensures a
systematic approach to analyse and classify soybean seeds' germination status,
offering valuable insights for agricultural research and practice.
5. Design and Implementation:
The design and implementation of a seed germination prediction system using
machine learning involves integrating thermal imaging systems, both active and
passive, to capture comprehensive temperature profiles of seeds during the
germination process. In the design phase, we employ a thermal imaging system
where infrared cameras continuously monitor the seeds on germination paper within a
controlled environment. Both active thermal imaging, which involves external heat
sources, and passive thermal imaging, relying on natural heat emission from the
seeds, are utilized to ensure accurate data collection. The processing of thermal
images includes extracting relevant features that reflect germination activity, such as
temperature patterns and heat distribution. Factors affecting the quality of thermal
images, such as camera resolution, environmental stability, and seed placement, are
meticulously managed to maintain data integrity. In the implementation phase, these
processed thermal images, along with environmental parameters, are fed into
machine learning models to predict germination outcomes. The system leverages
software tools like Jupyter Notebook for data processing and model training, and
Streamlit for developing an interactive application, thus providing a robust and user-
friendly platform for accurate seed germination prediction.
5.1 Design :
In order for thermal imaging systems to be used practically, the target object's
temperature has to vary from its surroundings. This means that before the imaging
begins, the item must be either an energy source or a sink, or it must be heated or
cooled. Furthermore, it is crucial
To take into account the possibility that the target items' temperature may fluctuate
while being observed. Essentially, there are two methods for employing thermal
imaging to discern objects: either using the varying heat conductivities or capacities
of the object materials, or using differing emissivities.
Infrared light emitted from two objects can be used to directly differentiate them if
their inherent emissivity coefficients differ. As an alternative, covering the items
with a heat pulse and because of the variations in their heat conductivities, it is
possible to distinguish between things by monitoring the radiation emitted from
them.
The true potential of any imaging system lies in its ability to provide quantitative data
on the spatial variation of the measured quantity. Thermographs can independently
describe various characteristics of the imaged objects or indirectly link thermal data
to hidden attributes. To accurately evaluate surface temperature data captured in
thermal imaging, one must understand all the processes that can affect an object's
temperature. A thermograph taken by a thermal camera can be analyzed using various
image processing techniques. The choice of technique for thermal imaging data
analysis depends on the application's primary goal and the required speed of decision-
making. In industrial settings, the processing method for thermal image analysis must
be quick and straightforward to enable real-time task completion. Current thermal
imaging cameras come pre-programmed with processing methods to estimate the
temperature of target objects in the scene.
5.2 Implementation
The thick paper towel and the thin, moist paper towel are layered over the seeds, and
the edges of the paper towels are squeezed together to seal them.To prevent the seed
from rolling out, the towels' bottoms are folded. I've rolled up the towel. In addition
to making handling easier, this prevents the towel from drying out. For identifying
purposes, the towel has the seed lot number in writing. Pencils designed specifically
to write on damp paper are used.A germination chamber, equipped with a humidifier
to control relative humidity, regulates light, temperature, and humidity levels. The
temperature settings within the germination chamber are adjusted based on the
optimal germination temperatures for the specific plant species being tested.
Temperature and humidity levels inside the chamber are monitored and recorded
throughout the germination process.
The rolled paper towels are placed plastic sacks to prevent drying, placed in a
germination chamber with the proper temperature, humidity and light.The towels are
taken out of the chamber after the designated period of time, which is usually one
week.The paper towels are unrolled, completely uncovering the germinating
seedlings. Ungerminated seeds and germinated seedlings are separated and counted,
and the germination percentage is calculated.
6.1 Experimentation :
Soybean seeds were positioned on germination paper so that various germination
stages could be photographed with infrared thermal imaging. To provide a sharp
contrast between the developing radicle and the background, the seeds were put on a
black fabric during the thermal picture capture. To lessen the impact of water
evaporation and the consequent drying out of the seeds, the cloth was moistened with
tap water and the germination paper was covered with butter paper. Reflections from
the camera setup were among the reflections that the lid occasionally produced. Each
picture was taken in an office at room temperature using the same artificial light
source. For this purpose, in a single batch of germination 10 seed samples were put
for germination. Seed Germination required total 8 days to get fully germinated but
on-wards 6th day we are able to see that seed is germinating. Seed Germination was
captured over a time span of 1 days with a resolution of 33x33 (1089 pixels). First we
have captured initial images of seed on day 1 before putting for germination than on-
wards 6th day we have captured images everyday till 9th day.
For this purpose, in a single batch of germination 10 seed samples were put for
germination. Seed Germination required total 8 days to get fully germinated but on-
wards 6th day we are able to see that seed is germinating. Seed Germination was
captured over a time span of 1 days with a resolution of 33x33 (1089 pixels). First we
have captured initial images of seed on day 1 before putting for germination than on-
wards 6th day we have captured images everyday till 9th day.
6.2 Results:
6.2.1 Experiment Analysis:
Total numbers of Seeds Germinated Dead Seeds
360 350 10
Preprocessing:
Data Splitting: The dataset is divided into training and testing sets, ensuring a
balanced representation of both classes.
Model Training:
1. Support Vector Machine (SVM):
Initialized an SVM model with suitable hyperparameters (e.g., linear or RBF kernel).
Trained the SVM model on the preprocessed training data.
2. Random Forest:
Evaluation:
7. Conclusion :
9% Overall Similarity
Top sources found in the following databases:
8% Internet database 8% Publications database
Crossref database Crossref Posted Content database
0% Submitted Works database
TOP SOURCES
The sources with the highest number of matches within the submission. Overlapping sources will not be
displayed.
hdl.handle.net
1 4%
Internet
ncbi.nlm.nih.gov
2 1%
Internet
link.springer.com
3 1%
Internet
ijisae.org
5 <1%
Internet
mextech.co.in
6 <1%
Internet
Yuval Nehoshtan, Elad Carmon, Omer Yaniv, Sharon Ayal, Or Rotem. "R...
7 <1%
Crossref
mdpi.com
8 <1%
Internet
Sources overview
Similarity Report ID: oid:3618:59919957
htcinstrument.com
10 <1%
Internet
Sources overview
Powerpoint Presenatation
156
157
158
159
160
161
CD
162