0% found this document useful (0 votes)

38 views2 pages

CART Regression Model

CART (Classification and Regression Trees) is a decision tree algorithm used for regression tasks, which builds a binary tree to minimize prediction error measured by Mean Squared Error (MSE). The process involves selecting split points to create nodes, calculating weighted MSE, and recursively building the tree until a stopping condition is met. However, CART regression has drawbacks such as overfitting, high variance, and complexity in interpretation.

Uploaded by

Ram Jaybhaye

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views2 pages

CART Regression Model

Uploaded by

Ram Jaybhaye

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

CART Regression Model: Detailed

Explanation
1. Introduction
CART (Classification and Regression Trees) is a decision tree algorithm used for both
classification and regression tasks. In regression problems, CART builds a binary tree that
splits the data into subsets to minimize prediction error, measured by Mean Squared Error
(MSE).

2. Important Terms
• Node: A point in the tree where the data is split.

• Root Node: The topmost node representing the entire dataset.

• Leaf Node: Terminal node that gives a prediction (average of target values).

• Split Point: The value used to divide the dataset.

• MSE (Mean Squared Error): A measure of the average squared difference between actual
and predicted values.

• MSR (Mean Squared Residual): Equivalent to MSE in the context of CART regression; error
from using mean of node as prediction.

• Weighted MSE: The average of MSEs from child nodes, weighted by the number of
samples.

3. Steps to Build a CART Regression Tree

1. Start with all data at the root.

2. Try all possible split points (feature ≤ threshold).

3. For each split, calculate the MSE for left and right nodes.

4. Compute the weighted MSE of the split:

Weighted MSE = (n_left / n_total) * MSE_left + (n_right / n_total) * MSE_right

5. Choose the split with the lowest weighted MSE.

6. Repeat recursively for each child node until a stopping condition is met (e.g., minimum
samples per leaf).
7. The prediction at each leaf is the mean of target values in that leaf.

4. Example
Dataset:

Hours Studied: [1, 2, 3, 4, 5]

Test Scores: [50, 55, 65, 70, 75]

Candidate Split at 2.5:

Left Node: [50, 55], Mean = 52.5, MSE = 6.25

Right Node: [65, 70, 75], Mean = 70, MSE = 16.67

Weighted MSE = (2/5)6.25 + (3/5)16.67 = 12.5

5. Prediction for Exact Match (e.g., 2.5 hours)

If the input value is exactly equal to the split point (e.g., 2.5), it goes to the left node due to
the ≤ condition.

So, 2.5 hours would lead to a prediction of 52.5.

6. Drawbacks of CART Regression

• Greedy Algorithm: Finds locally optimal splits, may miss better global tree structure.

• High Variance: Small changes in data can produce very different trees.

• Overfitting: May create complex trees that fit noise.

• Stepwise Prediction: Can't model smooth relationships well.

• Bias Toward Features with Many Values: Prefers features with more unique values.

• Complexity: Large trees can be hard to interpret.

Models: Regularization Is Used To Prevent Overfitting by Adding A Penalty To The Model's Complexity. 1.
No ratings yet
Models: Regularization Is Used To Prevent Overfitting by Adding A Penalty To The Model's Complexity. 1.
6 pages
1
No ratings yet
1
2 pages
Introduction to CART: Decision Trees
No ratings yet
Introduction to CART: Decision Trees
65 pages
223a1131 ML Exp 4
No ratings yet
223a1131 ML Exp 4
9 pages
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-02-18 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-02-18 Reference-Material-I
67 pages
Classification and Regression Trees Overview
No ratings yet
Classification and Regression Trees Overview
37 pages
CART Algorithm in Machine Learning
No ratings yet
CART Algorithm in Machine Learning
7 pages
Business Data Mining Week 11
No ratings yet
Business Data Mining Week 11
15 pages
Discuss The Concept of Pruning in Decision Trees and Its Role in Preventing Overfitting
No ratings yet
Discuss The Concept of Pruning in Decision Trees and Its Role in Preventing Overfitting
3 pages
Untitled Presentation
No ratings yet
Untitled Presentation
6 pages
Untitled Presentation
No ratings yet
Untitled Presentation
6 pages
CART Work in Regression
No ratings yet
CART Work in Regression
16 pages
68546bc500cdd
No ratings yet
68546bc500cdd
6 pages
Classification and Regression Trees CART
No ratings yet
Classification and Regression Trees CART
40 pages
CART: Theory & Applications
No ratings yet
CART: Theory & Applications
40 pages
Decision Trees: CART & C4.5 Explained
No ratings yet
Decision Trees: CART & C4.5 Explained
19 pages
Decision Trees: Example
No ratings yet
Decision Trees: Example
14 pages
CART
No ratings yet
CART
8 pages
Dadm s16 Cart
No ratings yet
Dadm s16 Cart
18 pages
Decision Trees: CART Explained
No ratings yet
Decision Trees: CART Explained
24 pages
Module 2 CARTAlgorithm
No ratings yet
Module 2 CARTAlgorithm
13 pages
Cartfromatob: James Guszcza, Fcas, Maaa
No ratings yet
Cartfromatob: James Guszcza, Fcas, Maaa
54 pages
Financial Applications of Classification and Regr
No ratings yet
Financial Applications of Classification and Regr
41 pages
AST Day 3 Slides
No ratings yet
AST Day 3 Slides
79 pages
Understanding CART Classification and Regression Trees
No ratings yet
Understanding CART Classification and Regression Trees
7 pages
Cart: Classification and Regression Tree
No ratings yet
Cart: Classification and Regression Tree
45 pages
Week 7
No ratings yet
Week 7
32 pages
Classification and Regression Trees
No ratings yet
Classification and Regression Trees
36 pages
Chapter 09 CART - N
No ratings yet
Chapter 09 CART - N
24 pages
Decision Trees: Make A Decision (Represent An Outcome
No ratings yet
Decision Trees: Make A Decision (Represent An Outcome
4 pages
Chap9 Cart 574 1
No ratings yet
Chap9 Cart 574 1
42 pages
Data Mining: Trees & Rules
No ratings yet
Data Mining: Trees & Rules
36 pages
Cart Introduction Beamer
No ratings yet
Cart Introduction Beamer
18 pages
CART - Machine Learning
No ratings yet
CART - Machine Learning
29 pages
Understanding CART Decision Trees
No ratings yet
Understanding CART Decision Trees
82 pages
CART Algorithm: Overview and Insights
No ratings yet
CART Algorithm: Overview and Insights
15 pages
Figure 9: Process of Knowledge Data Discovery Based On
No ratings yet
Figure 9: Process of Knowledge Data Discovery Based On
7 pages
Reg Tree
No ratings yet
Reg Tree
38 pages
Regression Trees: ID3 and CART Methods
No ratings yet
Regression Trees: ID3 and CART Methods
36 pages
Decision Tree
No ratings yet
Decision Tree
10 pages
S&ML Unit 6 - Q & A
No ratings yet
S&ML Unit 6 - Q & A
12 pages
6 - CART Models
No ratings yet
6 - CART Models
15 pages
Decision Tree Impurity Metrics Explained
No ratings yet
Decision Tree Impurity Metrics Explained
49 pages
ML Exp8 C36
No ratings yet
ML Exp8 C36
18 pages
Unit 2
No ratings yet
Unit 2
29 pages
Decision Tree
No ratings yet
Decision Tree
15 pages
Decision Trees
No ratings yet
Decision Trees
38 pages
Classification and Regression Tree
No ratings yet
Classification and Regression Tree
5 pages
Lecture 5a
No ratings yet
Lecture 5a
24 pages
Week 13 ML
No ratings yet
Week 13 ML
3 pages
6 DecisionTrees ID3 CART
No ratings yet
6 DecisionTrees ID3 CART
24 pages
Secret Extended
No ratings yet
Secret Extended
11 pages
Presented by Elden 18mca514
No ratings yet
Presented by Elden 18mca514
15 pages
Decision Tree Metrics and Concepts
No ratings yet
Decision Tree Metrics and Concepts
28 pages
Unit 3
No ratings yet
Unit 3
28 pages
Objective Segmentation
No ratings yet
Objective Segmentation
21 pages
Tree-Based Methods
No ratings yet
Tree-Based Methods
32 pages

CART Regression Model

Uploaded by

CART Regression Model

Uploaded by

CART Regression Model: Detailed

• Root Node: The topmost node representing the entire dataset.

• Split Point: The value used to divide the dataset.

3. Steps to Build a CART Regression Tree

2. Try all possible split points (feature ≤ threshold).

4. Compute the weighted MSE of the split:

Weighted MSE = (n_left / n_total) * MSE_left + (n_right / n_total) * MSE_right

5. Choose the split with the lowest weighted MSE.

Hours Studied: [1, 2, 3, 4, 5]

Test Scores: [50, 55, 65, 70, 75]

Candidate Split at 2.5:

Left Node: [50, 55], Mean = 52.5, MSE = 6.25

Right Node: [65, 70, 75], Mean = 70, MSE = 16.67

Weighted MSE = (2/5)*6.25 + (3/5)*16.67 = 12.5

5. Prediction for Exact Match (e.g., 2.5 hours)

So, 2.5 hours would lead to a prediction of 52.5.

6. Drawbacks of CART Regression

• Overfitting: May create complex trees that fit noise.

• Stepwise Prediction: Can't model smooth relationships well.

• Complexity: Large trees can be hard to interpret.

You might also like

Weighted MSE = (2/5)6.25 + (3/5)16.67 = 12.5