0% found this document useful (0 votes)

232 views14 pages

White Paper Data Masking 101 Final2022

This document provides an overview of data masking, including definitions, types of masking techniques, best practices, and implementation guidance. It discusses what data masking is, why it is important for protecting sensitive data, and the two main approaches of static and dynamic data masking. Static masking creates copies of data with sensitive information removed, while dynamic masking applies masking in real-time without copying data.

Uploaded by

Ismail Cassiem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

232 views14 pages

White Paper Data Masking 101 Final2022

Uploaded by

Ismail Cassiem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

WHITE PAPER

Data
Masking 101
A Comprehensive Guide
Table of Contents
Introduction 3

What is Data Masking and Why is it Important? 4

Approaches for the Application of Data Masking 6

Popular Data Masking Techniques 8

Best Practices for Masking Sensitive Data 10

How to Implement Secure and Lasting Data Masking 14

Data
DataMasking
Masking101:
101:AAComprehensive
ComprehensiveGuide
Guide | 2
Introduction
Sensitive data is everywhere, generated and collected constantly in a world that
is always online. This data can include personally identifiable information (PII)
like credit card numbers, personal health information (PHI) like biometric data,
and other forms of data that should remain private. When an organization stores
that data for use, it has both a legal and ethical responsibility to keep sensitive
information safe from a potential breach or leak.

According to Immuta’s 2022 State of Data Engineering Survey, roughly two thirds of data
professionals noted that their company already collects sensitive data. Similar recent studies
have shown that more than 64% of financial services organizations have over 1,000 sensitive
data files accessible to their entire employee roster. Effectively protecting sensitive data requires
balancing access and protection. Organizations can under-protect data by leaving the information
overly accessible, or over-protect by completely locking data down. In either case, the data is not
being protected and leveraged effectively.

Ensuring proper access to sensitive data, without compromising safety or accessibility,

is essential. But at an even more basic level, proactively de-identifying sensitive data within
storage and compute environments establishes a base layer of security in the event of breach.
Privacy enhancing technologies (PETs) like data masking make this foundational level of
security achievable.

In this white paper, we’ve gathered a range of essential information about data masking. We break
down basic definitions, masking types and techniques, best practices, and more. This guide will
help you answer the question:

How can I effectively use data masking to secure

my organization’s sensitive data?

Data
DataMasking
Masking101:
101:AAComprehensive
ComprehensiveGuide
Guide | 3
What is Data Masking
and Why is it Important?
Data masking, also referred to as data obfuscation, is a form of data access
control that alters existing sensitive information in a data set to make it
unidentifiable – but still potentially usable for analytics. This process allows
sensitive data to be stored and accessed, while maintaining the anonymity
and safety of the information involved.

Data masking comprises a family of obfuscation techniques for controlling information disclosure.
The choice of specific technique often depends upon the intended application. In determining
which technique to apply, one must consider:

1. Data Disclosure Risks: What fields and portions of values are sensitive? Who are the
downstream recipients? What could an attacker infer from this information? Who may
be harmed by the release of this information and how?

2. Analytical Use Cases vs Masking Characteristics: Are downstream recipients and

processes sensitive to formatting? Is there any information – such as the portion
of a credit card number that encodes the issuing bank – that must be preserved
downstream?

3. Governance: Do compliance and/or regulatory frameworks such as GDPR, CCPA, and

HIPAA apply? What restrictions, if any, do these frameworks place on the use or release
of the data? Can masking lower the operational classification of data processing
activity, thereby reducing compliance burden or allowing for broader sharing? What
masking methods are acceptable for the specified application?

Data Masking 101: A Comprehensive Guide | 4

Outside of its general recognition as an effective security measure, why should
you bother to implement masking techniques on sensitive data?

For one, the continued rise of data use and sharing in business and government is increasing
the risk of data breaches and leaks. According to the Identity Theft Resource Center (ITRC), 83%
of the 1,862 data breaches in 2021 involved sensitive data. In an age where data breaches can
impact organizations as large and (seemingly) secure as Facebook and LinkedIn, it is crucial
that companies incorporate masking techniques into their data storage capabilities to maintain
consumer safety.

Beyond general safety, customers also want to be able to trust the organizations with which
they interact and get assurances that their data is being used for legitimate business purposes.
There is now wide public support for online data privacy and intense legislative effort to adopt
new or modernize rules in this space. Consumers now expect that the organizations to whom
they disclose their personal information will act as good stewards of their data. If this trust is
betrayed and data is not sufficiently safeguarded against misuse, it can severely damage consumer
confidence and relationships. Masking techniques are a must to be able to both reduce the
likelihood of data breaches and achieve data minimization and purpose limitation, in particular
when personal information is processed for secondary purposes such as data analytics. Not only
will masking personal identifiers and/or sensitive data help your organization protect consumers’
privacy through heightened confidentiality and unlinkability across processing activities, but it will
help maintain a high level of trust that your company will do right with their data.

An example of a masking policy in Immuta.

Data Masking 101: A Comprehensive Guide | 5

Approaches for the
Application of Data Masking
How you apply data masking has taken two major forms over time, each best
suited for different scenarios or data environments. By examining the strengths
and weaknesses of these types of data masking, you can evaluate which should
best suit your organization’s situation.

Static Data Masking (SDM)

Static data masking (SDM) masks data at rest rather than in active use. By creating a copy of an
existing data set and scrubbing it of all sensitive and/or personally identifiable information (PII), this
data can then be stored, shared, and accessed for use without putting sensitive information at risk.

The most important aspect of SDM is that it makes a copy of existing data. This means that the
masked output of the SDM process is detached from the initial data, with no connections tying the
two together. This static characteristic of the data means it will not see updates unless you create
another new, more current copy.

When to Use Static Data Masking

Static data masking is best suited for environments where data is only used for a single purpose
and does not change over time. Software and application development or training are examples of
environments when SDM is useful. When creating a new tool or application, developers need to test
their software with data that is realistic enough to be treated in the same way real
data would be.

Since static data masking scrubs real data sets of all sensitive information, it strikes the balance
between utility and safety in a testing environment. Developers can run tests that respond in a
realistic fashion without having to worry that the data could be exposed or used for the wrong
reasons. However, when large data sizes and/or combinations of different levels of access are
introduced, this approach becomes greatly hindered by an inability to scale with ease. Because
of this, static data masking is not recommended for analytical use cases because “live” data is
required, where updates are real-time available and not hindered by stagnant data.

Data Masking 101: A Comprehensive Guide | 6

Dynamic Data Masking (DDM)
Dynamic data masking (DDM) does not require moving or copying data. Instead, it takes the more
agile approach of applying masking techniques at query-time. DDM applies the same types of
data masking techniques as SDM, but does so without needing to separate the data from its
original source.

This maintains a single source of truth for the data set, rather than making multiple copies of
scrubbed and masked data for various uses. DDM helps teams avoid the pitfalls of confusion and
data silos that arise from creating many unnecessary copies of the data. Most importantly, since
it is never a copy, the data remains “live” and updated, which is critical for analytical use cases.

When to Use Dynamic Data Masking

Of the types of data masking, dynamic data masking may be the most widely-applicable. Since
this type of masking is actively enforced at query time, it is not limited based on where the data
is stored or copied to – it is a “live” view of the data. It also allows for more complex logic across
varying sets of users, since masking is applied at runtime rather than requiring the creation of
a copy for every scenario like with SDM. Because of this, DDM supports more complex policy
scenarios and use cases, including dynamically retaining or destroying referential integrity.

Since dynamic masking maintains a single source of truth for data sets and does not require
copying data, this “live” view is perfect for analytical use cases. Compliance is also much easier to
manage because many more complex policy scenarios can be handled without making hundreds of
copies of your data. What’s more, instead of manually maintaining and auditing numerous copies of
a data set, policy is enforced and monitored/audited in a single consistent location.

Data Masking 101: A Comprehensive Guide | 7

Popular Data
Masking Techniques
As the types of data masking developed, different methods and techniques
for carrying out the process proliferated. Whether done in a static or dynamic
manner, a wide range of data masking techniques can now be used to effectively
protect PII in your data environment. Some of the most popular masking
techniques include:

• Replace with NULL: This function replaces any value in a column with a NULL. When this policy is applied,
the underlying data will appear to be NULL. This removes any identifiability from the column, at the cost
of removing all utility. It is useful to apply this policy to numeric or text attributes which have a high re-
identification risk, but little analytic value. These can include names, personal identifiers, etc.

• Replace with Constant: This function replaces any value in a column with a specified value. When this
policy is applied, the underlying data will appear to be a constant. This masking carries the same privacy and
utility guarantees as “Replace with NULL.”

• Replace with REGEX: This function uses a regular expression replacement to replace all or a portion of an
attribute. For example it may be appropriate to reveal only the first three digits of a US Zip Code, reducing
the identifiability of the disclosed zip code. REGEX replacement allows for some groupings to be maintained,
while providing greater ambiguity to the disclosed value. This masking technique is useful when the
underlying data has some consistent structure (such as a 5 digit zip code or a 9 digit social security number
or a 16 digit credit card number), the masked data represents some re-identification risk, and a regular
expression can be used to mask the underlying data to be less identifiable.

• Replace with Hashing: This function masks the values with an irreversible hash, which is consistent for
the same value throughout the data source, so you can count or track the specific values, but not know the
true raw value. This is appropriate for cases where the underlying value is sensitive, but there is a need to
segment the population. Such attributes could be addresses, time segments, countries, etc. It is important
to note that hashing is susceptible to inference attacks based on prior knowledge of the population
distribution. For example if “state” is hashed, and the dataset is a sample across the United States, then an
adversary could assume that the most frequently occurring hash value is California. As such it most secure
to use the hashing mask on attributes which are evenly distributed across a population

• Mask with Reversibility: This function also masks in a way that an authorized user can “unmask” a value,
thereby revealing the value to an authorized user. “Masking with Reversibility” is appropriate when there
is a need to obscure a value, while allowing an authorized user to recover the underlying value. All of the
same use cases and caveats which apply to “Replace with Hashing” apply to this function, to include the
inference attacks based on prior knowledge of population distribution. Additionally, reversibly masked fields
can leak the length of their contents, so it is important to consider whether or not this may be an attack
vector for applications involving its use.

Data Masking 101: A Comprehensive Guide | 8

• Mask with Format Preserving Masking: This function also masks using a reversible function, but does so
in a way that preserves the underlying structure of a given value. This means that the length and type of a
value are maintained. This is appropriate when the masked value should appear in the same format as the
underlying value. Examples of this would include social security numbers and credit card numbers where
“Mask with Format Preserving Masking” would return masked values consistent with credit cards, or social
security numbers, respectively. The original value can also be recovered by an authorized user. There is
larger overhead with this masking type, and it should only be used when format is critically valuable, such
as in situations when an engineer is building an application where downstream systems validate content. In
almost all analytical use cases, format should not matter, so don't do it solely because it “looks nicer.”

• Randomized Response: This policy randomizes the displayed value in an effort to make the true value
uncertain, but maintains some analytic utility. The randomization is applied differently to categorical
and quantitative values. In both cases, the noise is tunable, meaning that the noise can be increased to
enhance privacy or reduced to preserve more analytic value.

• Categorical Randomized Response: Categorical values are randomized by replacing a value with
some non-zero probability. Not all values are randomized, and the consumer of the data is not told
which values are randomized and which ones remain unchanged. Values are replaced by selecting
a different value uniformly at random from among the domain of possible values. For example, if a
randomized response policy were applied to a “state” column, a person’s residency could flip from
Maryland to Virginia. While this would be tragic for this person, it would provide ambiguity to the actual
state of residency. This policy is appropriate, for example, when obscuring sensitive values such as
medical diagnosis or survey responses.

• Datetime and Numeric Randomized Response: Datetime and Numeric randomized response apply
a tunable, unbiased noise term to the nominal value. This can obscure the underlying value, while
reducing the impact of the induced noise in aggregate. This can be applied to sensitive numerical
attributes such as salary, age, or treatment dates.

• Rounding: Masking via rounding rounds or truncates numeric or datetime values to a fixed precision. An
example would be mapping 3.14159 to 3. This policy is appropriate when it is important to maintain some
analytic value of a quantity, but not at its native precision. Rounding is applied differently between numerics
and datetime.

• Numeric Rounding: This policy maps the nominal value to the ceiling of some specified bandwidth.

• Date/Time Rounding: This policy truncates the precision of a datetime value to some user defined
precision. MINUTE, HOUR, DAY, MONTH, and YEAR are the supported precisions.

• k-Anonymization: Masking through k-Anonymization can be viewed as a type of masking policy, but it
is in reality a measure of re-identification risk over a dataset. Rather than applying to a single attribute,
k-Anonymization measures how many rows share a common set of values. By using a combination of
“Rounding” and “NULL” masking policies over multiple columns, the dataset is masked so that the rows
contain at least “K” records, where K is a positive integer. This means that attributes will only be disclosed
when there are a sufficient number of observations. This provides the anonymity of crowds, so that
individual rows are made indistinct from each other. This reduces the re-identification risk by making it
uncertain which record corresponds to a specific person. This policy is appropriate to apply over indirect
identifiers such as zip code, gender, age, etc. Each of these generally are not uniquely linked to an individual,
but in combination can be associated with a single person.

Data Masking 101: A Comprehensive Guide | 9

Best Practices for
Masking Sensitive Data
There are certain factors that must be considered in order to choose a
model that is both secure and efficient. These best practices highlight
some of the key considerations you should think about while building out
your data masking strategy.

Identify Your Sensitive Data

In order for masking to be effective, it’s integral to understand what data exists in your storage and analysis
environments. To choose the proper masking type and technique, you need to know what you’re masking. Is it
credit card numbers, addresses, or BMI data in a healthcare system’s data set? Each of these can be masked in
ways that guarantee their protection and proper compliance with the relevant laws and regulations.

The easiest way to maintain consistent, up-to-date knowledge of your data is to facilitate sensitive data
discovery and classification as data is introduced to your data stack. This gives data teams visibility and
control over the type of data in their possession, and where it is being stored and analyzed. Teams can then
better understand their data in the context of the regulations they are subject to, as well as the users who
need to access sensitive data. Aggregating this information helps determine the who/what/where/when/why
of the masking.

Sensitive Data Discovery occurs in Immuta.

Data Masking 101: A Comprehensive Guide | 10

Understand the Analytical Use Cases
vs Masking Characteristics
In addition to understanding what you are protecting, you also need to understand how it’s going
to be used. This is what is commonly termed the privacy vs utility tradeoff – you must decide how
much utility you wish to provide from the masked data as compared to the privacy risk it entails.
You should consider if your masking methods:

• Preserve Equality and Grouping: Does the masking function preserve equality? Yes if equal values
remain equal under the masking while unequal values remain unequal. This implies that counting statistics
are also preserved. Put more simply, each value will be masked to the same value consistently without
colliding with others.

• Preserve Range Statistics: Is the number of data values falling in a particular range preserved? For strings
this can be interpreted as the number of strings falling between any two values by alphabetical order.

• Preserve Value Locality: Does the masked value need to be near to the raw value? An example would be
someone’s IP address. If the IP address is used to geolocate a device, it may be necessary that the masked
remain consistent with geolocation. This property may be important for analytic purposes.

• Preserve Averages: Is avg(mask(v)) expected to be near avg(v)?

• Preserve Message Length: Is the length of the masked value equal to the length of the original value?

• Preserve Reversibility: Does there exist a process for qualified individuals to reveal the original input
value? Ignoring the fact this is always possible through policy exceptions.

• Preserve Appearance: Does the output masked value belong to the set of valid column values? For
example, consider a masking function which outputs phone numbers when given phone numbers. Here,
NULL values are not counted against this property.

• Are Applicable to Numeric Data: The masking function can be applied to numeric values.

• Provide Deniability of Record Content: A (possibly identified) person can plausibly attribute the
appearance of the value to the masking function. This is a desirable property of masking functions which
retain analytic utility, as such functions must necessarily leak information about the original value. Fields
masked with these functions provide strong protections against value inference attacks.

• Are Suitable for De-Identification: Masking functions which can be used to obscure record identifiers,
hiding data subject identities and preventing future linking against other identified data.

• Allow for Column-Value Determinism: Repeated values in the same column mask to a common output.

• Introduce NULLs: The masking function may, under normal or irregular circumstances, return NULL values.

Data Masking 101: A Comprehensive Guide | 11

Numerical Randomized
Randomized Response
Pattern Based Format
Preserving Masking

Preserving Masking
Format Preserving
Character-based

k-Anonymization

k-Anonymization
Integer Format

Categorical

Categorical
Reversible

Numerical
Response

Rounding
Constant

Masking

Masking
NULLing

Hashing
REGEX

String
TECHNIQUE

Preserves 1 2
Equality and
Grouping
Preserves 3 5 6
4
Range
Statistics
7 8
Preserves
Value Locality

9
Preserves
Averages

Preserves 10
Message
Length

Reversible

Preserves
Appearance

11
Applicable to
Numeric Data

Provides 12
deniability of
record content
Suitable for 13 14 15 16 17
De-
Identification

Column Value
Determinism

Introduces
NULLs

Yes, the masking function No, the masking function does Yes, but with some caveats and assumptions.
LEGEND
has this characteristic not exhibit this characteristic. See footnotes where appropriate.

1
Custom Masking Functions are any function supported by the underlying database. As a result the specific characteristics of the masking are
entirely dependent on the SQL function being used to mask.
2
Regular expression masking is highly dependent on the replacement value.
3
k-Anonymization will preserve grouping but only for groups which have sufficiently high counts. As a result low count groups will be NULLed out
4, 5
Approximate value counts can be recovered via correction factor. The error is tunable by choice of privacy parameters
6
k-Anonymization will preserve grouping but only for groups which have sufficiently high counts. Low count groups will be NULLed out
7
Approximate with tunable error by choice of rounding parameters.
8
Yes in expectation with tunable error by choice of privacy parameters.
9
k-Anonymization will preserve value locality, but only for groups which have sufficiently high counts. Low count groups will be NULLed out
10
Approximate with tunable error by choice of rounding parameters.
11
Yes if every value matching the pattern has the same length, otherwise no.
12
Yes if the pattern consists only of numeric strings of digits, otherwise no.
13, 14, 15
Yes if values are unique to a record. Otherwise, record content may be inferrible through frequency analysis.
16, 17, 18
Yes if values are unique to a record, and the range of format values covers all possible values. Data Masking 101: A Comprehensive Guide | 12
Consider Referential Integrity
Referential integrity means that two or more tables can be joined on a common column or set of columns
because the data in both sets match.

In some cases, you may want to preserve referential integrity even when data is masked. In other cases, you
may want referential integrity destroyed in order to block “toxic” combinations of data that could result in
privacy leaks. Masking techniques such as hashing and reversible masking provide the ability through salting
and encryption keys to retain or destroy referential integrity. If this is done dynamically using DDM, it can be
very powerful.

Consider Governance and its Costs

Compliance frameworks and regulation – such as GDPR, CCPA, HIPAA – may govern the handling of specific
categories of information, placing restrictions on the processing and dissemination of data. It is therefore
necessary to understand any applicable governance requirements.

This is important not only because frameworks often suggest or dictate masking approaches for governed
categories, but also because the masking of select elements may lower the operational classification of data
processing activity, thereby reducing compliance burden or allowing for broader sharing. In such cases, costly
processes such as review and audit may be reduced or eliminated, lowering operational costs and time to value,
and increasing the data’s overall availability.

Ensure Repeatability and the Ability to Scale

One could argue that this is the most essential part of creating a lasting data masking standard for your
organization. Data masking should be viewed as a long-term solution to protecting your data from breach and
ensure compliant analytics, so solutions should therefore be implemented only if they have long-term potential.

The foundations of any masking standard should be built in a way that allows for repeatability and scalability.
Masking techniques should be applicable to any new data in perpetuity, without needing to be overhauled or
greatly adjusted. As data evolves and multiplies, the techniques used to protect it must be able to keep up.
This means that masking techniques should be chosen and implemented only if they can be successful for your
data needs both now and in the future.

Data Masking 101: A Comprehensive Guide | 13

How to Implement Secure
and Lasting Data Masking
It can seem hard to imagine a cut-and-dry masking standard that universally
meets the modern range of applications and use cases. By examining
data masking’s various types, techniques, and best practices against your
organization’s needs, you can build a system that is best suited for your goals.

Modern data use will only increase, and masking methods must be able to maintain their
effectiveness while allowing the flexibility for organizations to evolve and grow.

Immuta’s Data Access Platform allows organizations to control access to their sensitive data,
ensuring that only the right people are given the right access at the right times. By applying
dynamic data masking at query-time, Immuta maintains privacy without slowing time-to-data or
requiring unnecessary copies. PETs like k-anonymization, randomized response, encryption, and
others, also provide the consistent, holistic, and scalable masking capabilities required for current
and future data use.

For a hands-on look at how data masking is dynamically applied through policies,
try our self-guided Immuta walkthrough demo.

About Immuta
Immuta is the market leader in Data Access, providing data teams one universal platform to control
access to analytical data sets in the cloud. Only Immuta can automate access to data by discovering,
protecting, and monitoring data. Data-driven organizations around the world trust Immuta to speed time to
data, safely share more data with more users, and mitigate the risk of data leaks and breaches. Founded in
2015, Immuta is headquartered in Boston, MA.

Open Source Digital Asset Management
No ratings yet
Open Source Digital Asset Management
15 pages
Governance Whitepaper 3
No ratings yet
Governance Whitepaper 3
29 pages
Oracle Rac
100% (1)
Oracle Rac
2 pages
Control M Presentation
No ratings yet
Control M Presentation
41 pages
Data Warehouse
100% (1)
Data Warehouse
12 pages
Archive
100% (1)
Archive
18 pages
Sky's DataOps Transformation Case Study
No ratings yet
Sky's DataOps Transformation Case Study
4 pages
How Will You Test Incremental Loading
100% (1)
How Will You Test Incremental Loading
2 pages
Data Models Data Modelling and Analysis
No ratings yet
Data Models Data Modelling and Analysis
55 pages
DevOps Hype Cycle 2018 Overview
No ratings yet
DevOps Hype Cycle 2018 Overview
9 pages
Integration Patterns For Virtual MDM Implementations - WSN1
No ratings yet
Integration Patterns For Virtual MDM Implementations - WSN1
40 pages
Data Warehousing & BI Guide
No ratings yet
Data Warehousing & BI Guide
88 pages
Financial Crimes in Web3-Empowered Metaverse Taxonomy Countermeasures and Opportunities
No ratings yet
Financial Crimes in Web3-Empowered Metaverse Taxonomy Countermeasures and Opportunities
13 pages
Bigtable: A Distributed Storage System For Structured Data
100% (1)
Bigtable: A Distributed Storage System For Structured Data
4 pages
A Brief History in Time For Data Vault
100% (1)
A Brief History in Time For Data Vault
6 pages
BigID Apps For DSPM
No ratings yet
BigID Apps For DSPM
22 pages
Deep Learning On Deep Fake Creation and Detection
No ratings yet
Deep Learning On Deep Fake Creation and Detection
14 pages
User Behavioural Analytics: Machine Learning For Threat Detection
No ratings yet
User Behavioural Analytics: Machine Learning For Threat Detection
26 pages
DWH & BI in Banking at ET 2 Nov 2004 Chandrasekhar
No ratings yet
DWH & BI in Banking at ET 2 Nov 2004 Chandrasekhar
50 pages
VERA White Paper
No ratings yet
VERA White Paper
35 pages
Velocity DAO
No ratings yet
Velocity DAO
26 pages
Linear Regression in R - R Tutorial
100% (1)
Linear Regression in R - R Tutorial
33 pages
The Impact of Cryptocurrency On Traditional Banking Systems
No ratings yet
The Impact of Cryptocurrency On Traditional Banking Systems
8 pages
Fundamentals of UNIX Administration: Course Length: Course Description
100% (1)
Fundamentals of UNIX Administration: Course Length: Course Description
4 pages
Formalizing ETL Jobs For Incremental Loading of Data Warehouses
100% (1)
Formalizing ETL Jobs For Incremental Loading of Data Warehouses
20 pages
Survey On Anonymization Techniques in Big Data and Privacy Models
No ratings yet
Survey On Anonymization Techniques in Big Data and Privacy Models
20 pages
MCC April2025 Administration en
No ratings yet
MCC April2025 Administration en
148 pages
Set Your Data in Motion
No ratings yet
Set Your Data in Motion
8 pages
2024 Gartner Magic Quadrant for Cloud DBMS
No ratings yet
2024 Gartner Magic Quadrant for Cloud DBMS
42 pages
MoAhten Resume
100% (1)
MoAhten Resume
2 pages
What Is L Tex?: Lingua Franca
No ratings yet
What Is L Tex?: Lingua Franca
5 pages
Unix Command Reference Guide
100% (1)
Unix Command Reference Guide
178 pages
DataOps For Power BI & Fabric 1
No ratings yet
DataOps For Power BI & Fabric 1
49 pages
DoD AI Adoption and Data Strategy 2023
No ratings yet
DoD AI Adoption and Data Strategy 2023
3 pages
OCI Security
No ratings yet
OCI Security
35 pages
Limitless Analytics with Azure Synapse
100% (1)
Limitless Analytics with Azure Synapse
29 pages
The DAO Hack: Blockchain Case Study
No ratings yet
The DAO Hack: Blockchain Case Study
12 pages
Collibra Dashboard Features Overview
No ratings yet
Collibra Dashboard Features Overview
7 pages
Cloud Data Governance and Catalog - Azure Case Study
No ratings yet
Cloud Data Governance and Catalog - Azure Case Study
29 pages
AEB-1184 DataOps Flipbook v2.4.2b
100% (1)
AEB-1184 DataOps Flipbook v2.4.2b
13 pages
Competitive Comparison-Application Servers
100% (1)
Competitive Comparison-Application Servers
64 pages
DeepFake Video Detection with CNN & LSTM
No ratings yet
DeepFake Video Detection with CNN & LSTM
22 pages
APN Partner Project Plan Template
No ratings yet
APN Partner Project Plan Template
8 pages
Data Security and Privacy in Big Data
No ratings yet
Data Security and Privacy in Big Data
12 pages
Privacy & PII Awareness Training
No ratings yet
Privacy & PII Awareness Training
19 pages
Whitepaper - How To Develop An API Strategy For Open Banking
No ratings yet
Whitepaper - How To Develop An API Strategy For Open Banking
29 pages
Mastering Azure Databricks Day-5
No ratings yet
Mastering Azure Databricks Day-5
9 pages
Evolution of Metadata Management
No ratings yet
Evolution of Metadata Management
23 pages
20250129-EB-Ultimate Data Streaming Guide
No ratings yet
20250129-EB-Ultimate Data Streaming Guide
103 pages
AMD Moving Software Quality Upstream: The Positive Impact of Lightweight Peer Code Review
No ratings yet
AMD Moving Software Quality Upstream: The Positive Impact of Lightweight Peer Code Review
12 pages
IBM Data Fabric for Enterprises
No ratings yet
IBM Data Fabric for Enterprises
24 pages
Apache Airflow 50
100% (1)
Apache Airflow 50
50 pages
Generative AI Wireshark Training Manual
No ratings yet
Generative AI Wireshark Training Manual
3 pages
Azure Synapse Analytics Overview Guide
No ratings yet
Azure Synapse Analytics Overview Guide
23 pages
Data Vault & HQDM Insights
No ratings yet
Data Vault & HQDM Insights
8 pages
Databricks Lakehouse & AI Overview
No ratings yet
Databricks Lakehouse & AI Overview
60 pages
Chapter 1 1 PDF
No ratings yet
Chapter 1 1 PDF
60 pages
2.0 Architecture and Design
No ratings yet
2.0 Architecture and Design
24 pages
Data Masking: Techniques & Implementation
No ratings yet
Data Masking: Techniques & Implementation
57 pages
Understanding and Selecting Data Masking Solutions: Creating Secure and Useful Data
No ratings yet
Understanding and Selecting Data Masking Solutions: Creating Secure and Useful Data
33 pages
Marketing 13th Edition by Roger Kerin Full Download
No ratings yet
Marketing 13th Edition by Roger Kerin Full Download
405 pages
Understanding Summative Assessment Methods
No ratings yet
Understanding Summative Assessment Methods
6 pages
Ethical Scenarios in Computing
No ratings yet
Ethical Scenarios in Computing
9 pages
China University of Petroleum GEO Science Professoors Cntacts
No ratings yet
China University of Petroleum GEO Science Professoors Cntacts
27 pages
Handbook of Research Design in Mathematics and Science Education
67% (3)
Handbook of Research Design in Mathematics and Science Education
1,073 pages
Ooad Casestudy
No ratings yet
Ooad Casestudy
12 pages
Elements of Communication Process
No ratings yet
Elements of Communication Process
3 pages
Introduction To Legal Informatics
No ratings yet
Introduction To Legal Informatics
6 pages
UAL L3 Project 3 (Units 5 6 7)
No ratings yet
UAL L3 Project 3 (Units 5 6 7)
5 pages
Lecture 03
No ratings yet
Lecture 03
61 pages
Tech For Good Guidebook 2021
No ratings yet
Tech For Good Guidebook 2021
46 pages
2012 Fire Safety Education For Staff Members - Case Study
No ratings yet
2012 Fire Safety Education For Staff Members - Case Study
95 pages
Cost and Management Accounting Basics
No ratings yet
Cost and Management Accounting Basics
13 pages
S4 SUMMATIVE Fourth Summative Assessment
No ratings yet
S4 SUMMATIVE Fourth Summative Assessment
2 pages
Papert and Wing on Computational Thinking
No ratings yet
Papert and Wing on Computational Thinking
26 pages
Sales and Inventory System for Maperow Store
No ratings yet
Sales and Inventory System for Maperow Store
18 pages
Futureinternet 15 00394
No ratings yet
Futureinternet 15 00394
19 pages
Media Literacy Quiz
No ratings yet
Media Literacy Quiz
2 pages
IB Psychology Paper 2 Structure Guide
No ratings yet
IB Psychology Paper 2 Structure Guide
24 pages
Example of Literature Review in Case Study
100% (1)
Example of Literature Review in Case Study
8 pages
Isua-Ift-Syl-014 Revision: 2 Effectivity: June 1, 2020
No ratings yet
Isua-Ift-Syl-014 Revision: 2 Effectivity: June 1, 2020
7 pages
Angelika: Medical Secretary Duties
No ratings yet
Angelika: Medical Secretary Duties
3 pages
LCS4785 Topic Submission - Chan Audrey
No ratings yet
LCS4785 Topic Submission - Chan Audrey
2 pages
Nur 115 Cfu Sas 1 23
100% (1)
Nur 115 Cfu Sas 1 23
57 pages
Project On:: College Fee Information Sytem
No ratings yet
Project On:: College Fee Information Sytem
13 pages
C_TS452_2020 SAP S/4HANA Certification Guide
No ratings yet
C_TS452_2020 SAP S/4HANA Certification Guide
2 pages
F 850 Gs Adventure 2019
No ratings yet
F 850 Gs Adventure 2019
346 pages
Guidelines For Leaders Apeurmeacala
100% (4)
Guidelines For Leaders Apeurmeacala
39 pages
Ethical and Legal Tech Use in Workplace
100% (1)
Ethical and Legal Tech Use in Workplace
3 pages
GTZ Zopp
No ratings yet
GTZ Zopp
21 pages

White Paper Data Masking 101 Final2022

Uploaded by

White Paper Data Masking 101 Final2022

Uploaded by

WHITE PAPER

What is Data Masking and Why is it Important? 4

Approaches for the Application of Data Masking 6

Popular Data Masking Techniques 8

Best Practices for Masking Sensitive Data 10

How to Implement Secure and Lasting Data Masking 14

Ensuring proper access to sensitive data, without compromising safety or accessibility,

How can I effectively use data masking to secure

2. Analytical Use Cases vs Masking Characteristics: Are downstream recipients and

3. Governance: Do compliance and/or regulatory frameworks such as GDPR, CCPA, and

Data Masking 101: A Comprehensive Guide | 4

An example of a masking policy in Immuta.

Data Masking 101: A Comprehensive Guide | 5

Static Data Masking (SDM)

When to Use Static Data Masking

Data Masking 101: A Comprehensive Guide | 6

When to Use Dynamic Data Masking

Data Masking 101: A Comprehensive Guide | 7

Data Masking 101: A Comprehensive Guide | 8

Data Masking 101: A Comprehensive Guide | 9

Identify Your Sensitive Data

Sensitive Data Discovery occurs in Immuta.

Data Masking 101: A Comprehensive Guide | 10

• Preserve Averages: Is avg(mask(v)) expected to be near avg(v)?

Data Masking 101: A Comprehensive Guide | 11

Consider Governance and its Costs

Ensure Repeatability and the Ability to Scale

Data Masking 101: A Comprehensive Guide | 13

You might also like