0% found this document useful (0 votes)

101 views11 pages

How Shapley Values Work - A Simple Guide

This article explains Shapley values and their application in machine learning explainability, particularly using the Boston housing dataset. It details the mechanics of calculating Shapley values, including the use of marginal contributions from features, and compares traditional Shapley values with the SHAP technique. The article also includes a worked example, visualizations, and insights derived from the Shapley values to enhance understanding of model predictions.

Uploaded by

kahina.boukaci

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views11 pages

How Shapley Values Work - A Simple Guide

Uploaded by

kahina.boukaci

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Impromptu Engineer

DEC 31, 2022 • 10 MIN READ • EXPLAINABLE AI

How Shapley Values Work

In this article, we will explore how Shapley values work - not using cryptic formulae, but by
way of code and simplified explanations

Shapley values - and their popular extension, SHAP - are machine learning explainability
techniques that are easy to use and interpret. However, trying to make sense of their theory
can be intimidating. In this article, we will explore how Shapley values work - not using
cryptic formulae, but by way of code and simplified explanations.

Introducing the Dataset

Before we dive into things, we'll briefly examine the dataset that will be used throughout
this post.

We'll calculate Shapley values from scratch using a simplified version of the Boston housing
Subscribe
dataset. This dataset contains the prices of 506 houses, accompanied by three predictive
features (Table 1).
Variable Name Description
% working class Percentage of the population that is working class.
number of rooms The average number of rooms per house in the housing unit.
NOX concentration Nitric oxides concentration (parts per 10 million).

Table 1. The model input variables used to predict house prices.

Shapley values will enable us to understand the house price predictions of a machine
Ask
learning model trained on these three features.

import pandas as pd

features = ["% working class", "number of rooms", "NOX concentration"]

df = pd.read_csv("[Link]")
y = df["y"].values
print(f"{len(y)} rows")
print(df[features + ["y"]].sample(5, random_state=0))

# returns:
# 506 rows
# % working class number of rooms NOX concentration y
# 329 14.68 6.333 0.460 22600.0
# 371 19.06 6.216 0.631 50000.0
# 219 21.00 6.373 0.550 23000.0
# 403 39.54 5.349 0.693 8300.0
# 78 24.68 6.232 0.437 21200.0

The Mechanics of Shapley Values

Suppose we have a machine learning model, model(A, B, C), which has been trained on
the above dataset to predict house prices based on three features, A, B , and C . We select
an instance from the dataset, house1, with feature values A = 10, B = 6.5, and C = 0.5.
The model predicts a price of $24,200 for house1.

The Shapley values for house1 will quantify how much each of its features contributes to
the predicted price, p(house1), of $24,200. This is expressed by the following relationship,
where the base value, b, is a fixed constant that is the same across all instances in the
dataset:

p(house1) = b + shapleyA (house1) + shapleyB (house1) + shapleyC (house1)

To calculate these Shapley values for features A, B , and C , we must consider the power
set of all possible feature coalitions. This is a fancy way of saying: consider all possible
combinations of the three features, spanning 0, 1, 2, and all 3 features. For each coalition,
we retrain the machine learning model using the features included in the coalition. This
power set of models is shown in Figure 1, with the models connected from top to bottom by
the incremental addition of features. For the model trained on no features, the prediction is
always the average house price in the dataset. This constant prediction is the previously
mentioned base value (in this example, $22,500).

The Shapley values are based on the marginal contributions of each feature to the models'
predictions. That is to say, for house1, what is the change in predicted price when a feature
is added to the model? Figure 1 illustrates the marginal contributions of feature A, denoted
in red as M CA .

Fig 1. The power set of models using features A, B , and C , with predictions for house1. Exemplified in red are the marginal contributions
of feature A.
From these marginal contributions, we can calculate the Shapley value of feature A for
house1 as a weighted sum of the marginal contributions. The weights are the reciprocal of
the count of connections at each layer. Note that the weights sum to 1.

shapleyA (house1) = 1/3 × 2200 + 1/6 × 1200 + 1/6 × 8400 + 1/3 × 600

For feature A, this calculates a Shapley value of $2,550 for house1. The equivalent
calculations for features B and C produce Shapley values of $1,250 and -$2,100,
respectively. The relationship between the predicted price and Shapley values holds true:
24, 200 = 22, 500 + 2, 550 + 1, 250 − 2, 100.

From these Shapley values, we can see that house1's feature values of A = 10 and B =
6.5 are contributing towards a higher than average predicted house price, whereas C =
0.5 pushes the prediction downwards. By calculating the three features' Shapley values for
all houses in the dataset, we can reveal insights into how the machine learning model
makes its predictions.

A Note on Shapley Values versus SHAP Values

Shapley values and SHAP values are often conflated, but they aren't exactly the same.
SHAP encompasses a range of techniques for efficiently approximating Shapley values, by
combining them with local interpretability methods such as LIME. This complicates the
mechanics of SHAP, but it provides some major advantages in return:

1. To calculate Shapley values, you must retrain your machine learning model 2F times
(where F is the number of features). SHAP avoids this by using approximation methods,
making it viable for applications where obtaining Shapley values would be
computationally infeasible.

2. Although SHAP can be applied to any machine learning model, for tree-based models,
SHAP uses an algorithm that is especially fast at estimating Shapley values.

For a gentle introduction to SHAP analyses, I recommend checking out a previous post of
mine:

Explaining Machine Learning Models: A Non-Technical Guide

to Interpreting SHAP Analyses
With interpretability becoming an increasingly important requirement
for machine learning projects, there’s a growing need for the complex…

Aidan Cooper • Aidan Cooper

Although standard Shapley values are largely obsolete to those produced by SHAP for most
use cases, the theory carries over, so it's still useful to understand. I'll explain how SHAP
works in a future post, so subscribe if that's something you'd like to see.

Subscribe for more posts like this

Calculating Shapley Values: A Worked Example
This worked example has three stages:

1. Retraining the machine learning model for each feature coalition.

2. Calculating the Shapley values from each feature's marginal contributions.

3. Visualising the Shapley values to generate insights.

Train the Machine Learning Models and Make Predictions

We want to obtain Shapley values for the machine learning model trained on all three
features. As we saw in the theory section, this requires us to retrain the model 23 times, for
the full power set of possible feature coalitions. This is shown in the code sample below,
where a model is trained for each possible combination of features and stored in the
models dictionary. This includes the full feature model, models["all"] , that we're
ultimately interested in.

from [Link] import RandomForestRegressor

models = {}

# model with no features

models["none"] = [[Link]()] * len(y)

# models with one feature

for feature in features:
X = df[feature].[Link](-1, 1)
m = RandomForestRegressor(random_state=0).fit(X, y)
models[feature] = [Link](X)

# models with two features

for i, feature1 in enumerate(features):
for feature2 in features[i+1:]:
X = df[[feature1, feature2]].values
m = RandomForestRegressor(random_state=0).fit(X, y)
models[f"{feature1}, {feature2}"] = [Link](X)

# model with all features

X_all = df[features]
m = RandomForestRegressor(random_state=0).fit(X_all, y)
models["all"] = [Link](X_all)

In this example, we're using a random forest model with scikit-learn's default
hyperparameters. In most real-world applications, you would arrive at a set of custom
hyperparameters for your model trained on all the features via tuning. These
hyperparameters would then be held constant during the retraining of the model.
Calculate the Shapley Values
Now that we have our power set of models, we can calculate the Shapley values as the
weighted average of the marginal contributions for each feature.

sv_pwc = 1/3 * (models["% working class"] -

models["none"]) +\
1/6 * (models["% working class, number of rooms"] -
models["number of rooms"]) +\
1/6 * (models["% working class, NOX concentration"] -
models["NOX concentration"]) +\
1/3 * (models["all"] -
models["number of rooms, NOX concentration"])

sv_nor = 1/3 * (models["number of rooms"] -

models["none"]) +\
1/6 * (models["% working class, number of rooms"] -
models["% working class"]) +\
1/6 * (models["number of rooms, NOX concentration"] -
models["NOX concentration"]) +\
1/3 * (models["all"] -
models["% working class, NOX concentration"])

sv_nc = 1/3 * (models["NOX concentration"] -

models["none"]) +\
1/6 * (models["% working class, NOX concentration"] -
models["% working class"]) +\
1/6 * (models["number of rooms, NOX concentration"] -
models["number of rooms"]) +\
1/3 * (models["all"] -
models["% working class, number of rooms"])

Here, sv_pwc , sv_nor and sv_nc are arrays containing 506 elements - each being that
feature's Shapley value for each instance in the dataset.

Create a Waterfall Chart for a Single Instance

We'll now visualise these Shapley values to generate insights into the workings of our
machine learning model.

A common SHAP plot for understanding local feature importance (i.e. how features
contribute to the prediction of a single instance in the dataset) is the waterfall chart. This
deconstructs a single house's prediction into the base value, plus the SHAP values for each
feature of that house. We can create waterfall charts using plotly.

import plotly.graph_objects as go

i = 0
fig = [Link](
[Link](
name="waterfall",
orientation="h",
y=features,
x=[sv_pwc[i], sv_nor[i], sv_nc[i]],
base=models["none"][i],
decreasing=dict(marker=dict(color="#008bfb")),
increasing=dict(marker=dict(color="#fb0655"))
)
)
fig.update_layout(
title=f"Base: ${models['none'][i]:,.0f}, Prediction: ${models['all'][i]:,.0f}",
width=1000, height=500, font=dict(size=14),
)
fig.write_image("plots/[Link]")

For this particular example using the first instance in the dataset (i = 0), we see that the %
working class and number of rooms features "push" the house price prediction higher to
~$26,300 from the starting point (base value) of $22,533. The NOX concentration feature,
however, reduces it to a final value of $24,203. The base value is the mean house price in
the dataset, which is stored in models["none"] ($22,532).

We can verify that the sum of the base value and Shapley values equal the house price
predictions for all houses in the dataset using [Link]([Link](models["none"] +
sv_pwc + sv_nor + sv_nc, models["all"])) , which returns True .

Create a Bar Chart of the Mean Absolute Shapley Values

To understand the global importance of the model's features, we'll create a bar chart of the
mean absolute Shapley values (MASVs). This is a measure of how much each feature
influences the model's house price predictions, on average. The MASVs are calculated and
plotted using the code below.
import [Link] as plt

fig, ax = [Link]()
[Link](
["NOX concentration", "number of rooms", "% working class"],
[[Link](sv_nc).mean(), [Link](sv_nor).mean(), [Link](sv_pwc).mean()],
height=0.6,
)
[Link](axis="x")
ax.set_xlabel("mean absolute shapley value")

We see that on average, % working class , number of rooms , and NOX concentration ,
contribute ±$2,641, ±$2,474, and ±$2,247 to each house price prediction, respectively.

Create a Beeswarm Plot of the Shapley Values

Another useful representation of Shapley values is the beeswarm plot. This allows us to see
the full distribution of Shapley values for each feature, and understand their relationships
with the actual feature values. The code below uses the seaborn library to produce the
beeswarm plot.

import seaborn as sns

# shape data for beeswarm plot

df_sv = [Link]()
df_sv["feature"] = (
["% working class"] * len(y)
+ ["numer of rooms"] * len(y)
+ ["NOX concentration"] * len(y)
)
df_sv["shapley value"] = [Link]([sv_pwc, sv_nor, sv_nc])
df_sv["hue"] = [Link](
[
(df["% working class"].values - df["% working class"].mean())
/ df["% working class"].std(),
(df["number of rooms"].values - df["number of rooms"].mean())
/ df["number of rooms"].std(),
(df["NOX concentration"].values - df["NOX concentration"].mean())
/ df["NOX concentration"].std(),
]
)

# beeswarm plot
fig, ax = [Link]()
[Link](0, c="grey", alpha=0.8)
ax = [Link](
x=df_sv["shapley value"],
y=df_sv["feature"],
hue=df_sv["hue"],
palette="coolwarm", # red = higher raw value; blue = lower raw value
size=5,
legend=False,
)
[Link].set_visible(False)
[Link](axis="x")
ax.set_ylabel("")
[Link]("plots/[Link]")

Here are two insights we can infer from the beeswarm plot:

1. Lower values of % working class (blue dots) have positive Shapley values (i.e.
contribute towards predictions of higher house prices). The same is (generally) true of
NOX concentration , and the opposite is true of number of rooms .
2. The largest Shapley values (~$14,000) are seen for high values (red dots) of the number
of rooms feature.

Conclusion
In this article, we've calculated Shapley values from scratch for a three-feature random
forest regressor. We've also seen how these Shapley values can be visualised to generate
insights into the predictions of the model.

Approximating Shapley Values for Machine Learning

The how and why of Shapley value approximation, explained in code

Impromptu Engineer • Aidan Cooper

Read the next article in this series

In a future posts, we'll explore how Shapley values are extended for practicable application
to machine learning using SHAP. The next article explains how Shapley values can be
approximated for machine learning models, instead of using the computationally intensive
exact calculation outlined in this post.

Subscribe for future posts on SHAP

GitHub - AidanCooper/shapley-values-from-scratch
Contribute to AidanCooper/shapley-values-from-scratch development
by creating an account on GitHub.

GitHub • AidanCooper

Code samples for this article can be found on GitHub-

Published by:

You might also like...

APR 26 How to Beat Proprietary LLMs With Smaller Open Source Models 14 min read

APR 08 A Guide to Structured Generation Using Constrained Decoding 13 min read

JUL 22 Modern Data Engineering and the Lost Art of Data Modelling 5 min read

JUN 07 Approximating Shapley Values for Machine Learning 6 min read

APR 07 Gnillehcs' Model of Integration 3 min read

Member discussion

Shapley Values in ML Interpretability
No ratings yet
Shapley Values in ML Interpretability
60 pages
Understanding Shapley Values in ML
No ratings yet
Understanding Shapley Values in ML
14 pages
An Introduction To Explainable AI With Shapley Values - SHAP Latest Documentation
No ratings yet
An Introduction To Explainable AI With Shapley Values - SHAP Latest Documentation
20 pages
Shapley Values for ML Predictions
No ratings yet
Shapley Values for ML Predictions
14 pages
SHAP for Interpreting ML Models
No ratings yet
SHAP for Interpreting ML Models
5 pages
SHAP Interpretability in Machine Learning
No ratings yet
SHAP Interpretability in Machine Learning
6 pages
Shap
100% (1)
Shap
214 pages
SHAP: Feature Attribution in ML
No ratings yet
SHAP: Feature Attribution in ML
44 pages
SHAP-Based Explanation Methods: A Review For NLP Interpretability
No ratings yet
SHAP-Based Explanation Methods: A Review For NLP Interpretability
11 pages
Understanding SHAP Values in ML Models
No ratings yet
Understanding SHAP Values in ML Models
12 pages
Shapley Value Feature Attribution Algorithms
No ratings yet
Shapley Value Feature Attribution Algorithms
33 pages
SHAP1
No ratings yet
SHAP1
68 pages
Problems With Shapley-Value-Based Explanations As Feature Importance Measures
No ratings yet
Problems With Shapley-Value-Based Explanations As Feature Importance Measures
10 pages
Explaining Xgboost Predictions With Shap Value A C
No ratings yet
Explaining Xgboost Predictions With Shap Value A C
13 pages
SHAP Documentation: TreeExplainer & KernelExplainer
No ratings yet
SHAP Documentation: TreeExplainer & KernelExplainer
11 pages
The Many Shapley Values For Model Explanation
No ratings yet
The Many Shapley Values For Model Explanation
11 pages
SHAP Values Algorithm Intro
No ratings yet
SHAP Values Algorithm Intro
22 pages
Junk 3
No ratings yet
Junk 3
11 pages
140+ +Use+Model+Explainer
No ratings yet
140+ +Use+Model+Explainer
22 pages
Explaining ML Models with Shapley Values
No ratings yet
Explaining ML Models with Shapley Values
20 pages
The Shapley Value For ML Models. What Is A Shapley Value, and Why Is It - by Divya Gopinath - Towards Data Science
No ratings yet
The Shapley Value For ML Models. What Is A Shapley Value, and Why Is It - by Divya Gopinath - Towards Data Science
16 pages
From Explanations To Feature Selection: Assessing SHAP Values As Feature Selection Mechanism
No ratings yet
From Explanations To Feature Selection: Assessing SHAP Values As Feature Selection Mechanism
8 pages
Aiml Lab 6
No ratings yet
Aiml Lab 6
11 pages
Interpretable Machine Learning Methods
No ratings yet
Interpretable Machine Learning Methods
45 pages
Chapter Two - Classification Feb 26 2024
No ratings yet
Chapter Two - Classification Feb 26 2024
18 pages
SHAP-IQ Unified Approximation of Any-Order Shapley
No ratings yet
SHAP-IQ Unified Approximation of Any-Order Shapley
27 pages
Interpretable Machine Learning Methods
No ratings yet
Interpretable Machine Learning Methods
44 pages
Understanding SHAP Summary Plots
No ratings yet
Understanding SHAP Summary Plots
4 pages
Interpreting Model Predictions
No ratings yet
Interpreting Model Predictions
21 pages
Combined Breakdowns
No ratings yet
Combined Breakdowns
233 pages
SVM Guide for Data Science Enthusiasts
100% (1)
SVM Guide for Data Science Enthusiasts
28 pages
Train
No ratings yet
Train
12 pages
What Is Your Data Worth? Equitable Valuation of Data: Amirata Ghorbani and James Y. Zou
No ratings yet
What Is Your Data Worth? Equitable Valuation of Data: Amirata Ghorbani and James Y. Zou
23 pages
Understanding Machine Learning Features
No ratings yet
Understanding Machine Learning Features
5 pages
Explain Machine Learning Model Using SHAP
No ratings yet
Explain Machine Learning Model Using SHAP
28 pages
Gumbel-Softmax for Categorical VAEs
No ratings yet
Gumbel-Softmax for Categorical VAEs
8 pages
ML Interatomic Potentials: Performance Review
No ratings yet
ML Interatomic Potentials: Performance Review
15 pages
3 Classification
No ratings yet
3 Classification
12 pages
DeepTrading With TensorFlow 3 - TodoTrader
No ratings yet
DeepTrading With TensorFlow 3 - TodoTrader
10 pages
Save and Restore TensorFlow Models
No ratings yet
Save and Restore TensorFlow Models
14 pages
To Louse 23 Hand Out
No ratings yet
To Louse 23 Hand Out
31 pages
Towards Explainable Artificial Intelligence With P
No ratings yet
Towards Explainable Artificial Intelligence With P
15 pages
LightGBM Python Guide: Datasets & Training
No ratings yet
LightGBM Python Guide: Datasets & Training
26 pages
Lecture 2 (Data Representation)
No ratings yet
Lecture 2 (Data Representation)
21 pages
Efficient Shap Score Computation in BNNs
No ratings yet
Efficient Shap Score Computation in BNNs
11 pages
Decision Tree Classifier for Play Tennis
No ratings yet
Decision Tree Classifier for Play Tennis
1 page
Support Vector Machine (With Numerical Example) - by Balaji C - Medium
No ratings yet
Support Vector Machine (With Numerical Example) - by Balaji C - Medium
16 pages
Understanding Support Vector Machines (SVM)
No ratings yet
Understanding Support Vector Machines (SVM)
11 pages
SoftMax Regress Real
No ratings yet
SoftMax Regress Real
8 pages
Intro to ML with Sklearn & Python
No ratings yet
Intro to ML with Sklearn & Python
10 pages
PyTorch Tabular Regression Guide
No ratings yet
PyTorch Tabular Regression Guide
13 pages
Polynomial Regression with Python Code
No ratings yet
Polynomial Regression with Python Code
3 pages
TensorFlow Tensors for Deep Trading
No ratings yet
TensorFlow Tensors for Deep Trading
9 pages
BigQuery Machine Learning Project Guide
No ratings yet
BigQuery Machine Learning Project Guide
13 pages
VAMSHI PR (1) 2 Edit
No ratings yet
VAMSHI PR (1) 2 Edit
16 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
SHAP - Shapely Values
No ratings yet
SHAP - Shapely Values
7 pages
Thesis Part 2 Included
No ratings yet
Thesis Part 2 Included
51 pages
Chapter 1
No ratings yet
Chapter 1
62 pages
Engineering Chemistry: Water Hardness
No ratings yet
Engineering Chemistry: Water Hardness
3 pages
Rectus Sheath
No ratings yet
Rectus Sheath
16 pages
Tiago Rocha Damasceno's Architecture Portfolio
No ratings yet
Tiago Rocha Damasceno's Architecture Portfolio
50 pages
EDCH ServiceSheet EID Exchange E.1.0.1
No ratings yet
EDCH ServiceSheet EID Exchange E.1.0.1
1 page
PVC, PF, Uf & MF Resin3
No ratings yet
PVC, PF, Uf & MF Resin3
6 pages
SPE-PRO-DB32-035 DB32 Field Operations Integrated Operational Planning
No ratings yet
SPE-PRO-DB32-035 DB32 Field Operations Integrated Operational Planning
20 pages
Testing the Reversible Converter Coupling
No ratings yet
Testing the Reversible Converter Coupling
15 pages
Health Talk On Menopause
80% (5)
Health Talk On Menopause
19 pages
Food Processing Grade10 PECs LessonPlan Weeks1-8
No ratings yet
Food Processing Grade10 PECs LessonPlan Weeks1-8
2 pages
Report
No ratings yet
Report
4 pages
Sales Confidence for Coaches
No ratings yet
Sales Confidence for Coaches
7 pages
Isolation
No ratings yet
Isolation
8 pages
Guide to Cracking SDE Roles in Fintech
No ratings yet
Guide to Cracking SDE Roles in Fintech
13 pages
SCI3302 PLACEMENT Proposal Example B
No ratings yet
SCI3302 PLACEMENT Proposal Example B
2 pages
Chapter 19 Analysis: "Field Trip" Themes
No ratings yet
Chapter 19 Analysis: "Field Trip" Themes
1 page
Business Case Development in Procurement
No ratings yet
Business Case Development in Procurement
4 pages
Urogenital Development and Sex Determination
No ratings yet
Urogenital Development and Sex Determination
34 pages
Nursing Care Plan for Hyperthermia
No ratings yet
Nursing Care Plan for Hyperthermia
1 page
Programming STM32F Using Eclipse and Open OCD
No ratings yet
Programming STM32F Using Eclipse and Open OCD
2 pages
Digital Twin Lifecycle Management Solutions
No ratings yet
Digital Twin Lifecycle Management Solutions
14 pages
Hikvision RMA Policy V7.0
No ratings yet
Hikvision RMA Policy V7.0
10 pages
SQL Server Internals in Memory OLTP Inside The SQL Server 2016 Hekaton Engine 2nd Edition Kalen Delaney Digital Version 2025
No ratings yet
SQL Server Internals in Memory OLTP Inside The SQL Server 2016 Hekaton Engine 2nd Edition Kalen Delaney Digital Version 2025
102 pages
Knowledge Managementtechniques
No ratings yet
Knowledge Managementtechniques
15 pages
B2 Unit 9 Academic Skills Plus Teacher's Notes
No ratings yet
B2 Unit 9 Academic Skills Plus Teacher's Notes
3 pages
HPHWDiag Installation Log Analysis
No ratings yet
HPHWDiag Installation Log Analysis
28 pages
Sensory Analysis 1
No ratings yet
Sensory Analysis 1
4 pages
Portable Ultrasonic Flow Measurement of Gas in Hazardous Areas
No ratings yet
Portable Ultrasonic Flow Measurement of Gas in Hazardous Areas
30 pages
Diabetes Disparities in Black Communities
No ratings yet
Diabetes Disparities in Black Communities
6 pages

How Shapley Values Work - A Simple Guide

Uploaded by

How Shapley Values Work - A Simple Guide

Uploaded by

Impromptu Engineer

DEC 31, 2022 • 10 MIN READ • EXPLAINABLE AI

How Shapley Values Work

Introducing the Dataset

Table 1. The model input variables used to predict house prices.

features = ["% working class", "number of rooms", "NOX concentration"]

The Mechanics of Shapley Values

p(house1) = b + shapleyA (house1) + shapleyB (house1) + shapleyC (house1)

A Note on Shapley Values versus SHAP Values

Explaining Machine Learning Models: A Non-Technical Guide

Aidan Cooper • Aidan Cooper

Subscribe for more posts like this

1. Retraining the machine learning model for each feature coalition.

2. Calculating the Shapley values from each feature's marginal contributions.

3. Visualising the Shapley values to generate insights.

Train the Machine Learning Models and Make Predictions

from [Link] import RandomForestRegressor

# model with no features

# models with one feature

# models with two features

# model with all features

sv_pwc = 1/3 * (models["% working class"] -

sv_nor = 1/3 * (models["number of rooms"] -

sv_nc = 1/3 * (models["NOX concentration"] -

Create a Waterfall Chart for a Single Instance

Create a Bar Chart of the Mean Absolute Shapley Values

Create a Beeswarm Plot of the Shapley Values

import seaborn as sns

# shape data for beeswarm plot

Approximating Shapley Values for Machine Learning

Impromptu Engineer • Aidan Cooper

Read the next article in this series

Subscribe for future posts on SHAP

Code samples for this article can be found on GitHub-

You might also like...

APR 08 A Guide to Structured Generation Using Constrained Decoding 13 min read

JUN 07 Approximating Shapley Values for Machine Learning 6 min read

APR 07 Gnillehcs' Model of Integration 3 min read

Impromptu Engineer © 2024

You might also like