0% found this document useful (0 votes)

36 views18 pages

Rudransh Lam X - C Palmers Penguins Case Study

Uploaded by

rudranshlamba2020

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views18 pages

Rudransh Lam X - C Palmers Penguins Case Study

Uploaded by

rudranshlamba2020

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

ORANGE DATA MINING NOCODE

TOOL

CASE STUDY PALMER PENGUINS MODEL

Rudransh lamba
X-C
Roll no. 26

1. Problem Scoping
Who
Who are the stakeholders?
Research communities and data scientists of Antarctica.

What do you know about them?

They collect data on different species of penguins.

What
What is the problem?
It is difficult to identify the species of some Palmer
Penguins.
How do you know that it is a problem?
Environmental – Harsh climate and remote, icy terrain.
Biological – Migration, nesting behaviour, and stress
from human contact.
Logistical – Limited access to islands and short research
seasons.
Data gaps – Missing values in the dataset itself point to
real-world collection difficulties and limits.

Where
What is the context/situation in which the stakeholders
experience the problem?
Collecting data from the remote continent of Antarctica.
Where is the problem located?
In Antarctica.

Why
Why will this solution be of value to the stakeholders?
The solution will help predict the species of Palmer
Penguins from the collected data.

How will the situation improve their situation?

They can study the data without being in the harsh
climatic conditions on Antarctica.

Our Research Who

Community
Has a problem It is difficulty to What
that identify the
species of some
Palmer Penguins
When / while Collecting data Where
from the remote
continent of
Antarctica
An ideal solution Predict the Why
would species of
Palmer Penguins
from the
collected data

2. Data Acquisition
Data acquired from
[Link]
ilTyUhmUv4DWT1BFsaCoQ2BmF
[Link]
study-palmer-penguins

3. Data Exploration
- Opening ODM tool
-Insert training and testing data files from google drive
link
- Notice Missing values

- Insert Feature Statistics Widget and connect output of

Train Data to Input of Feature statistics
- Insert Impute widget and connect to Train data

- Remove instances with unknown values

- Connect Feature Statistics to Impute widget

Now the data is clean and without any missing values.

- We need to change the Feature type for species, from
Categorical Feature to Categorical Label.
Add Select columns and connect to Impute widget
- Drag species feature to Target box

- Splitting the data

Insert Data Sampler and connect to Select Columns
- Insert Data info and connect to Data Sampler

- Insert another Data info and connect to Data Sampler

- Double click on Data info(1)

The data has been split now.

4. Modelling and Evaluation
- Insert Test and Score and connect to Data Sampler

- Insert Tree widget and connect to the input of Test and

Score
- Connect Widget Data Sampler to Test and Score again

-Double click on the connection made and Disconnect

Test data from Data sample and connect it with
Remaining Data.
Evaluation
These are the evaluation results for the model.
- Evaluating using another model, Insert Random Forest
widget and connect to Test and Score.

5. Prediction
- Connect Predictions widget with Test Data Output

- Connect Random Forest widget to Prediction widget

(The connection is dotted because we are not feeding it
data yet)
- Connect Data sampler to Random Forest model(The
connecting is now normal)

These are all the predictions made by Random Forest

- We can also connect Data Sampler to Tree model and
connect Tree model to Predictions (Using two models at
the same time)
Final predictions made by both models (Random Forest
and Tree)

ORANGE DATA MINING Steps
No ratings yet
ORANGE DATA MINING Steps
36 pages
Statistical Data Project - Palmer Penguin Analysis
100% (3)
Statistical Data Project - Palmer Penguin Analysis
23 pages
Penguin Species Classification App
No ratings yet
Penguin Species Classification App
27 pages
Palmer Penguin
No ratings yet
Palmer Penguin
50 pages
Project Report - Penguins
No ratings yet
Project Report - Penguins
19 pages
Random Forest Classifier in Python
No ratings yet
Random Forest Classifier in Python
15 pages
Visualization Project in R
No ratings yet
Visualization Project in R
15 pages
Data Mining
No ratings yet
Data Mining
31 pages
Second Practical Assignment 2024
No ratings yet
Second Practical Assignment 2024
5 pages
Forest Cover Prediction Report
No ratings yet
Forest Cover Prediction Report
21 pages
Ai Final Project File by Yogesh Xii - B.docx Reedited
100% (2)
Ai Final Project File by Yogesh Xii - B.docx Reedited
28 pages
Kaylan Cemelli A Tangled Web Workbook
No ratings yet
Kaylan Cemelli A Tangled Web Workbook
26 pages
Kunal DSML Laboratory Record - Format
No ratings yet
Kunal DSML Laboratory Record - Format
58 pages
Lecture Slides Slides 9
No ratings yet
Lecture Slides Slides 9
2 pages
Image Classification for Rare Animals
No ratings yet
Image Classification for Rare Animals
8 pages
AI Practical File Yogesh1
No ratings yet
AI Practical File Yogesh1
25 pages
Simulating Salamander Data with JAGS
No ratings yet
Simulating Salamander Data with JAGS
4 pages
Ai Record Programs
No ratings yet
Ai Record Programs
34 pages
Programs
No ratings yet
Programs
18 pages
ENM Tutorial
No ratings yet
ENM Tutorial
64 pages
Lab 20
No ratings yet
Lab 20
4 pages
Data Science Practicals
No ratings yet
Data Science Practicals
47 pages
Bi 5to 8
No ratings yet
Bi 5to 8
6 pages
Llewelyn Et Al., 2023
No ratings yet
Llewelyn Et Al., 2023
17 pages
Random Forest Thesis
100% (3)
Random Forest Thesis
6 pages
Bird Species
No ratings yet
Bird Species
60 pages
Logistic Regression on Iris Dataset
No ratings yet
Logistic Regression on Iris Dataset
7 pages
Mythika Project
No ratings yet
Mythika Project
13 pages
L3 - Classification - RandomForest - Jupyter Notebook
No ratings yet
L3 - Classification - RandomForest - Jupyter Notebook
6 pages
Data Mining Lab Manual
No ratings yet
Data Mining Lab Manual
8 pages
Xii STD Practical 1 (1) 1
No ratings yet
Xii STD Practical 1 (1) 1
22 pages
Powdery Mildew Prediction in Sandalwood Trees
No ratings yet
Powdery Mildew Prediction in Sandalwood Trees
11 pages
Decisiontree 1
No ratings yet
Decisiontree 1
10 pages
3 Text
No ratings yet
3 Text
2 pages
English Boss
No ratings yet
English Boss
4 pages
Estimating Species Richness Methods
No ratings yet
Estimating Species Richness Methods
92 pages
Animal Species Prediction Using Machine Learning
No ratings yet
Animal Species Prediction Using Machine Learning
10 pages
Package Spaa': R Topics Documented
No ratings yet
Package Spaa': R Topics Documented
32 pages
Untitled Document
No ratings yet
Untitled Document
8 pages
Bird Species ID Using Deep Learning
No ratings yet
Bird Species ID Using Deep Learning
47 pages
Python Programming Lab Assignments
No ratings yet
Python Programming Lab Assignments
18 pages
Python Feature Engineering Guide
No ratings yet
Python Feature Engineering Guide
27 pages
Datamining 2
No ratings yet
Datamining 2
54 pages
Zipkin al2010.MultiSpOccurrModelEvaluatEffectConservManagActions s1
No ratings yet
Zipkin al2010.MultiSpOccurrModelEvaluatEffectConservManagActions s1
3 pages
Penguin Prey Selection Analysis Lab
No ratings yet
Penguin Prey Selection Analysis Lab
8 pages
Ds 1 DW1
No ratings yet
Ds 1 DW1
2 pages
Day 2
No ratings yet
Day 2
5 pages
Assign Men 4
No ratings yet
Assign Men 4
12 pages
Ecosystems Module: Interactive Learning
No ratings yet
Ecosystems Module: Interactive Learning
68 pages
Hepinstall and Sader 1997. Photogrammetric Engineering & Remote Sensing
No ratings yet
Hepinstall and Sader 1997. Photogrammetric Engineering & Remote Sensing
8 pages
DATAMINING
No ratings yet
DATAMINING
24 pages
DM Lab Manual
No ratings yet
DM Lab Manual
5 pages
CS102 Final Exam Overview
No ratings yet
CS102 Final Exam Overview
19 pages
Introduction to Machine Learning with Scikit-Learn
No ratings yet
Introduction to Machine Learning with Scikit-Learn
2 pages
Tutorial 6
No ratings yet
Tutorial 6
8 pages
Ecology Habitable Planet Lab: Directions
75% (4)
Ecology Habitable Planet Lab: Directions
12 pages
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
No ratings yet
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
7 pages
Datamining
No ratings yet
Datamining
20 pages
SpadeR User's Guide for Biodiversity Analysis
No ratings yet
SpadeR User's Guide for Biodiversity Analysis
89 pages
Chapter 7 - Estimation Single Population
No ratings yet
Chapter 7 - Estimation Single Population
43 pages
NACLIN 2022 25 National Convention On Knowledge, Library and Information Networking December 14-16, 2022
No ratings yet
NACLIN 2022 25 National Convention On Knowledge, Library and Information Networking December 14-16, 2022
19 pages
Approval Sheet
No ratings yet
Approval Sheet
8 pages
Marketing Management Book 1ST Sem Mba
No ratings yet
Marketing Management Book 1ST Sem Mba
271 pages
Dr. Yashashwini and Neelima Scopus Paper Publication 1
No ratings yet
Dr. Yashashwini and Neelima Scopus Paper Publication 1
16 pages
Glossary of Terms and Symbols Used in Pharmacology
No ratings yet
Glossary of Terms and Symbols Used in Pharmacology
55 pages
WBS Undergraduate Exchange Module Guide
No ratings yet
WBS Undergraduate Exchange Module Guide
185 pages
Peatland Survey & Conservation Plan
0% (1)
Peatland Survey & Conservation Plan
32 pages
Legaspina - Thesis Research - Muhon
No ratings yet
Legaspina - Thesis Research - Muhon
8 pages
Analyzing Arguments in Manifestoes
No ratings yet
Analyzing Arguments in Manifestoes
4 pages
Glenn Parker Team Player Survey Guide
100% (1)
Glenn Parker Team Player Survey Guide
29 pages
3857 7309 1 SM PDF
No ratings yet
3857 7309 1 SM PDF
11 pages
Statistical Methods: Hypothesis Testing Assignment
No ratings yet
Statistical Methods: Hypothesis Testing Assignment
8 pages
Platz 2022 Learning With Serious Games in Economics Education A Systematic Review of The
No ratings yet
Platz 2022 Learning With Serious Games in Economics Education A Systematic Review of The
14 pages
Women's Experiences With Postp
No ratings yet
Women's Experiences With Postp
14 pages
Gould Voelker 2010 J Spa
No ratings yet
Gould Voelker 2010 J Spa
16 pages
Grade 8 Dragon's Den Business Project
No ratings yet
Grade 8 Dragon's Den Business Project
10 pages
5 Mcqs Sampling
No ratings yet
5 Mcqs Sampling
2 pages
Omond Solandt: Canadian Scientist and Defence Leader
No ratings yet
Omond Solandt: Canadian Scientist and Defence Leader
3 pages
Experience With Wheat Flour Reference Material
No ratings yet
Experience With Wheat Flour Reference Material
4 pages
HRM Insights for Project Leaders
No ratings yet
HRM Insights for Project Leaders
19 pages
It Appendices
No ratings yet
It Appendices
7 pages
Applied Theatre's Impact on Seniors
No ratings yet
Applied Theatre's Impact on Seniors
5 pages
Megersa
No ratings yet
Megersa
13 pages
UpGrad Live: Advanced Regression Techniques
No ratings yet
UpGrad Live: Advanced Regression Techniques
18 pages
Landslide Assessment on Gohatsion Road
No ratings yet
Landslide Assessment on Gohatsion Road
27 pages
CATC DL Ch02 Digital Skills
No ratings yet
CATC DL Ch02 Digital Skills
14 pages
Rguhs Thesis Topics in Paediatrics
100% (3)
Rguhs Thesis Topics in Paediatrics
7 pages
Derived Relational Responding Applications For Learners With Autism and Other Developmental Disabilities: A Progressive Guide To Change
100% (19)
Derived Relational Responding Applications For Learners With Autism and Other Developmental Disabilities: A Progressive Guide To Change
23 pages
MPH Course Admission Notification 2019
No ratings yet
MPH Course Admission Notification 2019
2 pages

Rudransh Lam X - C Palmers Penguins Case Study

Uploaded by

Rudransh Lam X - C Palmers Penguins Case Study

Uploaded by

ORANGE DATA MINING NOCODE

CASE STUDY PALMER PENGUINS MODEL

What do you know about them?

How will the situation improve their situation?

Our Research Who

- Insert Feature Statistics Widget and connect output of

- Remove instances with unknown values

Now the data is clean and without any missing values.

- Splitting the data

- Insert another Data info and connect to Data Sampler

The data has been split now.

- Insert Tree widget and connect to the input of Test and

-Double click on the connection made and Disconnect

- Connect Random Forest widget to Prediction widget

These are all the predictions made by Random Forest

You might also like