Python Linear Regression Guide

This document provides a cheat sheet for linear regression modeling in Python. It outlines the key steps, which include importing data and modeling libraries, preparing and visualizing the data, splitting the data into training and test sets, fitting a linear regression model to the training data, making predictions on the test data, and evaluating the model's performance using metrics like MAE, MSE, and RMSE. Code examples are provided for each step to demonstrate how to implement linear regression in Python.

Uploaded by

himtajay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views1 page

Python Linear Regression Guide

Uploaded by

himtajay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Python - Linear Regression Model Cheat Sheet

by DarioPittera (aggialavura) via cheatography.com/83764/cs/19917/

TO START TRAIN MODEL (cont)

# IMPORT DATA LIBRARIES lm.coef_ show coefficients

import pandas as pd coeff_df = pd.DataFrame create coeff df
import numpy as np (lm.coef_,X.columns,columns=['Coeff'])*
# IMPORT VIS LIBRARIES
pd.DataFrame: pd.DataFrame(data=None, index=None, column‐
import matplotlib.pyplot as plt
s=None, dtype=None, copy=False). data = values, index= name
import seaborn as sns index, columns= name column. This could be useful just to interpret
%matplotlib inline the coefficient of the regression.
# IMPORT MODELLING LIBRARIES
from sklearn.model_selection import MAKE PREDICTIONS
train_test_split
predictions = lm.predict(X_test) create predictions
from sklearn.linear_model import LinearRegression
plt.scatter(y_test,predictions)* plot predictions
from sklearn import metrics
sns.distplot((y_test-predictions),bins=50)* distplot of residuals

PRELIMINARY OPERATIONS scatter: this graph show the difference between actual values and
the values predicted by the model we trained. It should resemble as
df = pd.read_csv('data.csv') read data
much as possible a diagonal line .
df.head() check head df
distplot: this graph shows the distributions of the residual errors, that
df.info() check info df is, the difference between the actual values minus the predicted
df.describe() check stats df values; it should result in an as much as possible normal distribution.
df.columns check col names If not, maybe change model!

VISUALISE DATA EVALUATION METRICS

sns.pairplot(df) pairplot print('MAE:', metrics.mean_absolute_error(y_test, predictions))

sns.distplot(df['Y']) distribution plot print('MSE:', metrics.mean_squared_error(y_test, predictions))

sns.heatmap(df.corr(), annot=True) heatmap with values print('RMSE:', np.sqrt(metrics.mean_squared_error(y_test, predictions))

MAE is the easiest to understand, because it's the average error.

TRAIN MODEL MSE is more popular than MAE, because MSE "punishes" larger
CREATE X and y --------------- errors, which tends to be useful in the real world.
RMSE is even more popular than MSE, because RMSE is interp‐
X = df[['col1','col2',etc.]] create df features
retable in the "y" units.
y = df['col'] create df var to predict
 SPLIT DATASET ---------------
X_train, X_test, y_train, y_test = split df in train and test df
train_test_split(
X,
y,
test_size=0.3)
 FIT THE MODEL ---------------
lm = LinearRegression() instatiate model
lm.fit(X_train, y_train) train/fit the model
 SHOW RESULTS ---------------
lm.intercept_ show intercept

By DarioPittera (aggialavura) Not published yet. Sponsored by CrosswordCheats.com

Last updated 24th June, 2019. Learn to solve cryptic crosswords!
Page 1 of 1. http://crosswordcheats.com

cheatography.com/aggialavura/
www.dariopittera.com

Regression Model
No ratings yet
Regression Model
6 pages
Linear Regression - Cheatsheet
No ratings yet
Linear Regression - Cheatsheet
8 pages
Data Science for Beginners
No ratings yet
Data Science for Beginners
98 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
Supervised Learning For Data Science...
No ratings yet
Supervised Learning For Data Science...
14 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
30 pages
Day 3 ML
No ratings yet
Day 3 ML
4 pages
Linear Regression Explained
No ratings yet
Linear Regression Explained
8 pages
Python Simple Linear Regression Guide
No ratings yet
Python Simple Linear Regression Guide
14 pages
Simple Linear Regression: Math Behind
0% (1)
Simple Linear Regression: Math Behind
6 pages
ml1 PRG
No ratings yet
ml1 PRG
2 pages
ML LN 3
No ratings yet
ML LN 3
44 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
22UCS303 DS-Unit IV-LINEAR REGRESSION
No ratings yet
22UCS303 DS-Unit IV-LINEAR REGRESSION
19 pages
Exp 4 - LM
No ratings yet
Exp 4 - LM
5 pages
ML Combined
No ratings yet
ML Combined
254 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Class Xii PDF For Practical
No ratings yet
Class Xii PDF For Practical
24 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
Generative AI For Models Development
No ratings yet
Generative AI For Models Development
8 pages
Linear Regression
No ratings yet
Linear Regression
6 pages
Practical 5
No ratings yet
Practical 5
8 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
Multiple Linear Regression 3
No ratings yet
Multiple Linear Regression 3
68 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
ML Unit
No ratings yet
ML Unit
23 pages
Regression Analysis and Equations
No ratings yet
Regression Analysis and Equations
16 pages
ML Regression for Data Scientists
No ratings yet
ML Regression for Data Scientists
7 pages
Linear Regression - Jupyter Notebook
100% (3)
Linear Regression - Jupyter Notebook
56 pages
EE2211 CheatSheet
No ratings yet
EE2211 CheatSheet
15 pages
ML Lab-3
No ratings yet
ML Lab-3
14 pages
Python Linear Regression Guide
No ratings yet
Python Linear Regression Guide
23 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Linear Regression3.0
No ratings yet
Linear Regression3.0
24 pages
Cheat Sheet Linear and Logistic Regression
No ratings yet
Cheat Sheet Linear and Logistic Regression
2 pages
ml2020 Pythonlab02
No ratings yet
ml2020 Pythonlab02
3 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
8 pages
Python Pandas and Machine Learning Guide
No ratings yet
Python Pandas and Machine Learning Guide
21 pages
Machine Learning With Python Algorithms
No ratings yet
Machine Learning With Python Algorithms
28 pages
Deepak Data Analysis 1
No ratings yet
Deepak Data Analysis 1
31 pages
CL IV Manual
No ratings yet
CL IV Manual
108 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
132 pages
Regression
No ratings yet
Regression
16 pages
MLR Example 2predictors
No ratings yet
MLR Example 2predictors
5 pages
Lesson 3
No ratings yet
Lesson 3
5 pages
Python Data Analysis Guide
No ratings yet
Python Data Analysis Guide
171 pages
Assignment No.4 - (20-Ele-68)
No ratings yet
Assignment No.4 - (20-Ele-68)
17 pages
R and Python Programming Exercises
100% (1)
R and Python Programming Exercises
24 pages
DS
No ratings yet
DS
31 pages
Import Pandas As PD
No ratings yet
Import Pandas As PD
3 pages
Lecture 3
No ratings yet
Lecture 3
42 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
An Introduction To Stadistical Learning-129-140-1-8
No ratings yet
An Introduction To Stadistical Learning-129-140-1-8
8 pages
? What Is Regression
No ratings yet
? What Is Regression
12 pages
Linear Regression Model Insights
No ratings yet
Linear Regression Model Insights
5 pages
A Worn Path: Phoenix Jackson's Journey
No ratings yet
A Worn Path: Phoenix Jackson's Journey
4 pages
Effects of La Addition On The Microstructure and Mechanical Properties of Aluminum and Aluminum Alloys
No ratings yet
Effects of La Addition On The Microstructure and Mechanical Properties of Aluminum and Aluminum Alloys
10 pages
Grade 6 Math Daily Lesson Log
No ratings yet
Grade 6 Math Daily Lesson Log
10 pages
Transporting Companies
No ratings yet
Transporting Companies
3 pages
Close-Up B1 Unit 1 Book
100% (1)
Close-Up B1 Unit 1 Book
14 pages
Aloha Boeing 737 Accident
No ratings yet
Aloha Boeing 737 Accident
3 pages
Hecuba: A Tragic Plea for Justice
No ratings yet
Hecuba: A Tragic Plea for Justice
1 page
Y-7 AC/DC Electromagnetic Yoke Data
No ratings yet
Y-7 AC/DC Electromagnetic Yoke Data
2 pages
Carboguard 1209: Selection & Specification Data
100% (2)
Carboguard 1209: Selection & Specification Data
4 pages
Orozco Promethues Sumation
100% (1)
Orozco Promethues Sumation
18 pages
Dungeon Master Adventurer's Handbook
90% (10)
Dungeon Master Adventurer's Handbook
62 pages
BIOLOGY Project Class 12
No ratings yet
BIOLOGY Project Class 12
12 pages
Rich-Mar Autosound 9.6 Operation Manual
No ratings yet
Rich-Mar Autosound 9.6 Operation Manual
28 pages
WireSize Vs Resistance
No ratings yet
WireSize Vs Resistance
1 page
Iom 090
No ratings yet
Iom 090
14 pages
Panasonic Battery Catalogue
No ratings yet
Panasonic Battery Catalogue
7 pages
Cluster Overview: Table 3.1: Basic Information of Nashik District
No ratings yet
Cluster Overview: Table 3.1: Basic Information of Nashik District
5 pages
Turnaround
50% (2)
Turnaround
5 pages
Corporate Story On Tata Motors LTD
No ratings yet
Corporate Story On Tata Motors LTD
27 pages
Daily News
No ratings yet
Daily News
11 pages
Super 1100 Original IIM IPM Interview Questions Percentile Classes
No ratings yet
Super 1100 Original IIM IPM Interview Questions Percentile Classes
123 pages
The 5 Rules of Design Composition and Layout
No ratings yet
The 5 Rules of Design Composition and Layout
4 pages
Transformer Testing Voltage Guide
100% (1)
Transformer Testing Voltage Guide
21 pages
Paul Inspiration and Model Chapter 3
No ratings yet
Paul Inspiration and Model Chapter 3
28 pages
What Is Machine Design
No ratings yet
What Is Machine Design
54 pages
2-Meter Vertical Dipole Array Design
No ratings yet
2-Meter Vertical Dipole Array Design
4 pages
Towards A Complete Validation of The Lattice Scheme in The Hybrid Stress Blasting Model (HSBM)
No ratings yet
Towards A Complete Validation of The Lattice Scheme in The Hybrid Stress Blasting Model (HSBM)
10 pages
2A201 - 474E Spec Test
No ratings yet
2A201 - 474E Spec Test
11 pages
Ch-3 The Making of Global World Notes (History)
No ratings yet
Ch-3 The Making of Global World Notes (History)
6 pages
The 9 Houses of I Ching Astrology
0% (1)
The 9 Houses of I Ching Astrology
2 pages

Python Linear Regression Guide

Uploaded by

Python Linear Regression Guide

Uploaded by

Python - Linear Regression Model Cheat Sheet

by DarioPittera (aggialavura) via cheatography.com/83764/cs/19917/

TO START TRAIN MODEL (cont)

# IMPORT DATA LIBRARIES lm.coef_ show coeffi​cients

VISUALISE DATA EVALUATION METRICS

sns.pa​irp​lot(df) pairplot print(​'MAE:', metric​s.m​ean​_ab​sol​ute​_er​ror​(y_​test, predic​tions))

sns.di​stp​lot​(df​['Y']) distri​bution plot print(​'MSE:', metric​s.m​ean​_sq​uar​ed_​err​or(​y_test, predic​tions))

sns.he​atm​ap(​df.c​orr(), annot=​True) heatmap with values print('RMSE:', np.sqrt(metrics.mean_squared_error(y_test, predictions))

MAE is the easiest to unders​tand, because it's the average error.

By DarioPittera (aggialavura) Not published yet. Sponsored by CrosswordCheats.com

You might also like

# IMPORT DATA LIBRARIES lm.coef_ show coefficients

sns.pairplot(df) pairplot print('MAE:', metrics.mean_absolute_error(y_test, predictions))

sns.distplot(df['Y']) distribution plot print('MSE:', metrics.mean_squared_error(y_test, predictions))

sns.heatmap(df.corr(), annot=True) heatmap with values print('RMSE:', np.sqrt(metrics.mean_squared_error(y_test, predictions))

MAE is the easiest to understand, because it's the average error.