0% found this document useful (0 votes)

27 views5 pages

Random Forest

The document provides code examples for implementing a Random Forest Classifier using various datasets, including synthetic data, Iris, Breast Cancer, and Wine datasets. Each example demonstrates data loading, model training, and evaluation, with accuracy results reported for each dataset. The classifiers achieved high accuracy scores, indicating effective model performance across the different datasets.

Uploaded by

Catherine Shendre

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views5 pages

Random Forest

Uploaded by

Catherine Shendre

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 5

RANDOM FOREST CLASSIFIER

from sklearn.datasets import make_classification

from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import accuracy_score

# Generate a synthetic dataset with proper feature distribution

X, y = make_classification(n_samples=1000, n_features=3,
n_informative=3, n_redundant=0, n_classes=2, random_state=42)

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.3, random_state=42)

clf = RandomForestClassifier(n_estimators=100, random_state=42)

clf.fit(X_train, y_train)

y_pred = clf.predict(X_test)

print("Accuracy on Test Set: ", accuracy_score(y_test, y_pred))

OUTPUT: Accuracy on Test Set: 0.9333333333333333

QUESTION 2:

from sklearn.datasets import load_iris

import pandas as pd

# Load the Iris dataset

data = load_iris()

df = pd.DataFrame(data.data, columns=data.feature_names)

df['species'] = data.target

print(df.head())

from sklearn.datasets import load_iris

from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import accuracy_score

data = load_iris()
X = data.data

y = data.target

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.2, random_state=42)
clf = RandomForestClassifier(n_estimators=100, random_state=42)
clf.fit(X_train, y_train)

y_pred = clf.predict(X_test)

# Evaluate the model

print("Accuracy on Test Set: ", accuracy_score(y_test, y_pred))

OUTPUT:

sepal length (cm) sepal width (cm) petal length (cm) petal width
(cm) \
0 5.1 3.5 1.4
0.2
1 4.9 3.0 1.4
0.2
2 4.7 3.2 1.3
0.2
3 4.6 3.1 1.5
0.2
4 5.0 3.6 1.4
0.2

species
0 0
1 0
2 0
3 0
4 0
Accuracy on Test Set: 1.0

QUESTION 3:
from sklearn.datasets import load_breast_cancer
import pandas as pd

# Load the Breast Cancer dataset

data = load_breast_cancer()
df = pd.DataFrame(data.data, columns=data.feature_names)
df['target'] = data.target
print(df.head())

from sklearn.datasets import load_breast_cancer

from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import accuracy_score

data = load_breast_cancer()
X = data.data
y = data.target
X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.2, random_state=42)

clf = RandomForestClassifier(n_estimators=100, random_state=42)

clf.fit(X_train, y_train)

y_pred = clf.predict(X_test)

# Evaluate the model

print("Accuracy on Test Set: ", accuracy_score(y_test, y_pred))

OUTPUT:

mean radius mean texture mean perimeter mean area mean

smoothness \
0 17.99 10.38 122.80 1001.0
0.11840
1 20.57 17.77 132.90 1326.0
0.08474
2 19.69 21.25 130.00 1203.0
0.10960
3 11.42 20.38 77.58 386.1
0.14250
4 20.29 14.34 135.10 1297.0
0.10030

mean compactness mean concavity mean concave points mean

symmetry \
0 0.27760 0.3001 0.14710
0.2419
1 0.07864 0.0869 0.07017
0.1812
2 0.15990 0.1974 0.12790
0.2069
3 0.28390 0.2414 0.10520
0.2597
4 0.13280 0.1980 0.10430
0.1809

mean fractal dimension ... worst texture worst perimeter worst

area \
0 0.07871 ... 17.33 184.60
2019.0
1 0.05667 ... 23.41 158.80
1956.0
2 0.05999 ... 25.53 152.50
1709.0
3 0.09744 ... 26.50 98.87
567.7
4 0.05883 ... 16.67 152.20
1575.0

worst smoothness worst compactness worst concavity worst concave

points \
0 0.1622 0.6656 0.7119
0.2654
1 0.1238 0.1866 0.2416
0.1860
2 0.1444 0.4245 0.4504
0.2430
3 0.2098 0.8663 0.6869
0.2575
4 0.1374 0.2050 0.4000
0.1625

worst symmetry worst fractal dimension target

0 0.4601 0.11890 0
1 0.2750 0.08902 0
2 0.3613 0.08758 0
3 0.6638 0.17300 0
4 0.2364 0.07678 0

[5 rows x 31 columns]
Accuracy on Test Set: 0.9649122807017544

QUESTION 4:

from sklearn.datasets import load_wine

import pandas as pd

# Load the Wine dataset

data = load_wine()

df = pd.DataFrame(data.data, columns=data.feature_names)

df['target'] = data.target
print(df.head())

from sklearn.datasets import load_wine

from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import accuracy_score

# Load the Wine dataset

data = load_wine()
X = data.data

y = data.target
X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.2, random_state=42)

clf = RandomForestClassifier(n_estimators=100, random_state=42)

clf.fit(X_train, y_train)

y_pred = clf.predict(X_test)

print("Accuracy on Test Set: ", accuracy_score(y_test, y_pred))

alcohol malic_acid ash alcalinity_of_ash magnesium total_phenols \

0 14.23 1.71 2.43 15.6 127.0 2.80
1 13.20 1.78 2.14 11.2 100.0 2.65
2 13.16 2.36 2.67 18.6 101.0 2.80
3 14.37 1.95 2.50 16.8 113.0 3.85
4 13.24 2.59 2.87 21.0 118.0 2.80

flavanoids nonflavanoid_phenols proanthocyanins color_intensity hue \

0 3.06 0.28 2.29 5.64 1.04
1 2.76 0.26 1.28 4.38 1.05
2 3.24 0.30 2.81 5.68 1.03
3 3.49 0.24 2.18 7.80 0.86
4 2.69 0.39 1.82 4.32 1.04

od280/od315_of_diluted_wines proline target

0 3.92 1065.0 0
1 3.40 1050.0 0
2 3.17 1185.0 0
3 3.45 1480.0 0
4 2.93 735.0 0
Accuracy on Test Set: 1.0

20BCP021 Assignment 3
No ratings yet
20BCP021 Assignment 3
7 pages
Project 1
No ratings yet
Project 1
6 pages
Breast Cancer Diagnosis 1703707725
No ratings yet
Breast Cancer Diagnosis 1703707725
52 pages
LAB # 08 Naive Bayes - Ipynb - Colab
No ratings yet
LAB # 08 Naive Bayes - Ipynb - Colab
3 pages
Karisma 23011101119 Eda Rec
No ratings yet
Karisma 23011101119 Eda Rec
88 pages
ML LAB 12 - Jupyter Notebook
No ratings yet
ML LAB 12 - Jupyter Notebook
11 pages
DIY Bagging Boosting
No ratings yet
DIY Bagging Boosting
14 pages
5 Breast Cancer Model - Ipynb Colab
No ratings yet
5 Breast Cancer Model - Ipynb Colab
5 pages
Logistic Regression Wine Analysis
No ratings yet
Logistic Regression Wine Analysis
3 pages
Data Mining 1 Practical File-1
No ratings yet
Data Mining 1 Practical File-1
24 pages
Code
No ratings yet
Code
5 pages
K Fold
No ratings yet
K Fold
2 pages
Week 4 Naive Bayes Classifier
No ratings yet
Week 4 Naive Bayes Classifier
2 pages
Logistic Regression on Iris Dataset
No ratings yet
Logistic Regression on Iris Dataset
7 pages
Machine Learning - Lab Record
No ratings yet
Machine Learning - Lab Record
43 pages
Exercise 10
No ratings yet
Exercise 10
4 pages
L3 - Classification - RandomForest - Jupyter Notebook
No ratings yet
L3 - Classification - RandomForest - Jupyter Notebook
6 pages
Introduction To Neural Networks
No ratings yet
Introduction To Neural Networks
4 pages
Market Basket Analysis with Apriori
No ratings yet
Market Basket Analysis with Apriori
7 pages
Wine Quality Prediction
No ratings yet
Wine Quality Prediction
6 pages
4.4. Data Standardization - Ipynb - Colaboratory
No ratings yet
4.4. Data Standardization - Ipynb - Colaboratory
1 page
DATA SCIENCE IDC 302 End Sem Project
No ratings yet
DATA SCIENCE IDC 302 End Sem Project
1 page
Task 7
No ratings yet
Task 7
14 pages
DSE 6 - Colab
No ratings yet
DSE 6 - Colab
5 pages
Statistical Z-Table
No ratings yet
Statistical Z-Table
2 pages
Update on pandas.util.testing Deprecation
No ratings yet
Update on pandas.util.testing Deprecation
10 pages
Experiment 1
No ratings yet
Experiment 1
2 pages
Project Coding-Manish Dwari 1807
No ratings yet
Project Coding-Manish Dwari 1807
1 page
EXP - 7 - Prasham Doshi - 22bec097
No ratings yet
EXP - 7 - Prasham Doshi - 22bec097
7 pages
ML FINAL Lab Manual
No ratings yet
ML FINAL Lab Manual
7 pages
Unsupervised ML
No ratings yet
Unsupervised ML
17 pages
Untitled2 - Jupyter Notebook
No ratings yet
Untitled2 - Jupyter Notebook
4 pages
Normalization
No ratings yet
Normalization
4 pages
Howxtre
No ratings yet
Howxtre
8 pages
Dsbdalab 6
No ratings yet
Dsbdalab 6
5 pages
Ztest
No ratings yet
Ztest
1 page
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
7 pages
Scikit Learn1
No ratings yet
Scikit Learn1
4 pages
Data Mining Lab Manual
No ratings yet
Data Mining Lab Manual
7 pages
Support Vector Machine (SVM Classifier) Implemenation in Python With Scikit-Learn
No ratings yet
Support Vector Machine (SVM Classifier) Implemenation in Python With Scikit-Learn
21 pages
7 Output
No ratings yet
7 Output
4 pages
Binned Data
No ratings yet
Binned Data
1 page
Decision Tree PBEL With GridSearchCV
No ratings yet
Decision Tree PBEL With GridSearchCV
12 pages
Statistical Tables (Z & T-Distribution) - 080734
No ratings yet
Statistical Tables (Z & T-Distribution) - 080734
3 pages
Statistical Z Tables
No ratings yet
Statistical Z Tables
2 pages
Heart Disease Prediction! ?
No ratings yet
Heart Disease Prediction! ?
52 pages
SK Learn 1
No ratings yet
SK Learn 1
11 pages
Database XRD
No ratings yet
Database XRD
6 pages
Table of Areas Under The Normal Curve (Updated 2023)
No ratings yet
Table of Areas Under The Normal Curve (Updated 2023)
2 pages
Stratified K-Fold in Scikit-Learn
No ratings yet
Stratified K-Fold in Scikit-Learn
6 pages
Data Set
No ratings yet
Data Set
2 pages
Statistical Tables: Z, T, Chi-Square
No ratings yet
Statistical Tables: Z, T, Chi-Square
12 pages
KNN Classifier on Digits Data
No ratings yet
KNN Classifier on Digits Data
3 pages
Tables Perf
No ratings yet
Tables Perf
3 pages
ML Expt 2
No ratings yet
ML Expt 2
5 pages
Tabela Normal
No ratings yet
Tabela Normal
2 pages
Tablas Estadisticas
No ratings yet
Tablas Estadisticas
5 pages
T Distribution: DF T-Score T-Score T-Score T-Score T-Score T-Score T-Score T-Score T-Score T-Score T-Score T-Score
No ratings yet
T Distribution: DF T-Score T-Score T-Score T-Score T-Score T-Score T-Score T-Score T-Score T-Score T-Score T-Score
33 pages
Unit 3
No ratings yet
Unit 3
13 pages
TK/KW/15 - 6235 Third Semester Master of Science (M. SC.) Examination
No ratings yet
TK/KW/15 - 6235 Third Semester Master of Science (M. SC.) Examination
3 pages
Revised OGs UFS 2017-22
No ratings yet
Revised OGs UFS 2017-22
9 pages
Master of Science (M.SC.) Third Semester (Statistics) (CBCS) Examination Decision Theory and Non Parametric Methods Compulsory Paper-1 Paper-I
No ratings yet
Master of Science (M.SC.) Third Semester (Statistics) (CBCS) Examination Decision Theory and Non Parametric Methods Compulsory Paper-1 Paper-I
12 pages
Survey Methodology and Estimation Procedure
No ratings yet
Survey Methodology and Estimation Procedure
13 pages
Soda Sales
No ratings yet
Soda Sales
2 pages
Import As Import As From Import From Import From Import From Import
No ratings yet
Import As Import As From Import From Import From Import From Import
4 pages
Principal Component Analysis: #Question 1
No ratings yet
Principal Component Analysis: #Question 1
6 pages
Graphic Designer Job PAN India
No ratings yet
Graphic Designer Job PAN India
2 pages
Understanding Z-Scores for Students
No ratings yet
Understanding Z-Scores for Students
2 pages
Federated Learning for Engine Maintenance
No ratings yet
Federated Learning for Engine Maintenance
13 pages
The State of AI Infrastructure at Scale 2024
No ratings yet
The State of AI Infrastructure at Scale 2024
22 pages
Anticipating Consumer Demand Using ML
No ratings yet
Anticipating Consumer Demand Using ML
8 pages
Elements of Dimensionality Reduction and Manifold Learning 1st Edition Benyamin Ghojogh Download
100% (1)
Elements of Dimensionality Reduction and Manifold Learning 1st Edition Benyamin Ghojogh Download
82 pages
DoS Attack Detection with ML & Neural Networks
No ratings yet
DoS Attack Detection with ML & Neural Networks
5 pages
Machine Learning for Container Scheduling
No ratings yet
Machine Learning for Container Scheduling
7 pages
Bitcoin Market Forecast for Investors
No ratings yet
Bitcoin Market Forecast for Investors
29 pages
A Cluster Based Under-Sampling Solution For Handling Imbalanced Data
No ratings yet
A Cluster Based Under-Sampling Solution For Handling Imbalanced Data
12 pages
ML Using Python
No ratings yet
ML Using Python
6 pages
Machine Learning for Engineers
No ratings yet
Machine Learning for Engineers
31 pages
Optimizing Breast Cancer Mammogram Classification Through A Dual Approach A Deep Learning Framework Combining ResNet50 SMOTE and Fully Connected Layers For Balanced and Imbalanced Data
No ratings yet
Optimizing Breast Cancer Mammogram Classification Through A Dual Approach A Deep Learning Framework Combining ResNet50 SMOTE and Fully Connected Layers For Balanced and Imbalanced Data
12 pages
Panns: Large-Scale Pretrained Audio Neural Networks For Audio Pattern Recognition
No ratings yet
Panns: Large-Scale Pretrained Audio Neural Networks For Audio Pattern Recognition
15 pages
Trust Issues in Artificial Intelligence
No ratings yet
Trust Issues in Artificial Intelligence
4 pages
Ahmed Efficient Event-Based Object Detection A Hybrid Neural Network With Spatial CVPR 2025 Paper
No ratings yet
Ahmed Efficient Event-Based Object Detection A Hybrid Neural Network With Spatial CVPR 2025 Paper
10 pages
Animesh Gupta - 2021uea6545 - PPT - DL
No ratings yet
Animesh Gupta - 2021uea6545 - PPT - DL
23 pages
Engineering Students' Spotify Project
100% (1)
Engineering Students' Spotify Project
38 pages
Artificial Intelligence With Lab: Report: Machine Learning
No ratings yet
Artificial Intelligence With Lab: Report: Machine Learning
6 pages
Cyient Data Analytics Offerrings
No ratings yet
Cyient Data Analytics Offerrings
17 pages
Full Stack Data Science & AI Course
No ratings yet
Full Stack Data Science & AI Course
22 pages
Ashenafi Alietal 2024
No ratings yet
Ashenafi Alietal 2024
22 pages
Date Fruit Dataset for ML Research
No ratings yet
Date Fruit Dataset for ML Research
7 pages
Shikhar25 Program-02
No ratings yet
Shikhar25 Program-02
10 pages
Prediction of Indonesian Palm Oil Production Using Long Short-Term Memory Recurrent Neural Network (LSTM-RNN)
No ratings yet
Prediction of Indonesian Palm Oil Production Using Long Short-Term Memory Recurrent Neural Network (LSTM-RNN)
5 pages
Lab Manual
No ratings yet
Lab Manual
19 pages
Unit 4 Conditional Random Field
No ratings yet
Unit 4 Conditional Random Field
4 pages
Defense Against Distributed Dos Attack Detection by Using Intelligent Evolutionary Algorithm
No ratings yet
Defense Against Distributed Dos Attack Detection by Using Intelligent Evolutionary Algorithm
12 pages
DetectDUI An in Car Detection System For Drink Driving and BAC's
No ratings yet
DetectDUI An in Car Detection System For Drink Driving and BAC's
71 pages
Poornima Jadhav Soft Computing Research Paper
No ratings yet
Poornima Jadhav Soft Computing Research Paper
5 pages
5 Faris
No ratings yet
5 Faris
4 pages
Brit J of Edu Psychol - 2024 - Cheung - A Machine Learning Model of Academic Resilience in The Times of The COVID 19
No ratings yet
Brit J of Edu Psychol - 2024 - Cheung - A Machine Learning Model of Academic Resilience in The Times of The COVID 19
21 pages