0% found this document useful (0 votes)

30 views6 pages

What Is Normalization

Chapter four discusses normalization in databases, which is the process of organizing data to eliminate redundancy and ensure logical data dependencies. It outlines the goals of normalization, the importance of avoiding anomalies (insertion, deletion, update), and the stages of normalization, including First, Second, and Third Normal Forms. Each normal form has specific rules to enhance database design and efficiency, ultimately leading to a more stable and maintainable database structure.

Uploaded by

biniyamgoytame098

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views6 pages

What Is Normalization

Uploaded by

biniyamgoytame098

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

SMU Fundamental of Database lecture note- chapter four

CHPTER FOUR

NORMALIZATION

WHAT IS NORMALIZATION?

Normalization is the process of efficiently organizing data in a database.

There are two goals of the normalization process: eliminating redundant
data (for example, storing the same data in more than one table) and
ensuring data dependencies make sense (only storing related data in a
table). Both of these are worthy goals as they reduce the amount of space
a database consumes and ensure that data is logically stored.

WHY NORMALIZATION?

While creating database applications, why normalization is necessary?.

Normalization:

 serves as the basis for Physical Database Design, ensuring that

customer requirements are properly satisfied and that new
requirements are easier to accommodate,
 makes the design of a modular system easier,
 makes the database easier to maintain,
 ensures structural stability of data,
 prevents various updating anomalies which can occur in non-
normalized record structures,
 enables record processing by a set of simple operators.

While performance considerations often prohibit the direct

implementation of normalized records, the process of
normalization results in a model of the "ideal" data structure and
relationships. Consequently, it is a valuable procedure whether or not a
relational database system is used.

Essentially table optimization is accomplished through the elimination of

all instances of data redundancy and unforeseen scalability issues, such
as:
 Update Anomaly
 Deletion Anomaly
 Insertion Anomaly

Database anomalies, are really just unmatched or missing information

caused by limitations or flaws within a given database. Databases are
designed to collect data and sort or present it in specific ways to the end

Page 1 of 6
SMU Fundamental of Database lecture note- chapter four

user. Entering or deleting information, be it an update or a new record

can cause issues if the database is limited or has „bugs‟.
INSERTION ANOMALY

It is a failure to place information about a new database entry into all the
places in the database where information about the new entry needs to
be stored. In a properly normalized database, information about a new
entry needs to be inserted into only one place in the database, in an
inadequately normalized database, information about a new entry may
need to be inserted into more than one place, and human fallibility being
what it is, some of the needed additional insertions may be missed.

DELETION ANOMALY

It is a failure to remove information about an existing database entry

when it is time to remove that entry. In a properly normalized database,
information about an old, to-be-gotten-rid-of entry needs to be deleted
from only one place in the database, in an inadequately normalized
database, information about that old entry may need to be deleted from
more than one place.

UPDATE ANOMALY

An update of a database involves modifications that may be additions,

deletions, or both. Thus “update anomalies” can be either of the kinds
discussed above.

All three kinds of anomalies are highly undesirable, since their

occurrence constitutes corruption of the database. Properly normalized
database are much less susceptible to corruption than are un-
normalized databases.

In order to overcome the above database anomalies the database

community has developed a series of guidelines for ensuring that
databases are normalized. The stages of normalization are referred to as
normal forms and progress from the least restrictive (First Normal Form)
through the most restrictive (Fifth Normal Form). Generally, most
database designers do not attempt to implement anything higher than
Third Normal Form or Boyce-Codd Normal Form.

Page 2 of 6
SMU Fundamental of Database lecture note- chapter four

FIRST NORMAL FORM (1NF)

First normal form (1NF) sets the very basic rules for an organized
database:
 Eliminate duplicative columns from the same table.
 Create separate tables for each group of related data and identify
each row with a unique column or set of columns (the primary
key).

What do these rules mean when contemplating the practical design of a

database? It‟s actually quite simple.
The first rule dictates that we must not duplicate data within the same
row of a table. Within the database community, this concept is referred
to as the atomicity of a table. Tables that comply with this rule are said
to be atomic. Let‟s explore this principle with a classic example – a table
within a human resources database that stores the manager-subordinate
relationship. For the purposes of our example, we‟ll impose the business
rule that each manager may have one or more subordinates while each
subordinate may have only one manager.

Intuitively, when creating a list or spreadsheet to track this information,

we might create a table with the following fields:
 Manager
 Subordinate1
 Subordinate2
 Subordinate3
 Subordinate4

However, recall the first rule imposed by 1NF: eliminate duplicative

columns from the same table. Clearly, the Subordinate1-Subordinate4
columns are duplicative. Take a moment and ponder the problems raised
by this scenario. If a manager only has one subordinate – the
Subordinate2-Subordinate4 columns are simply wasted storage space (a
precious database commodity). Furthermore, imagine the case where a
manager already has 4 subordinates – what happens if she takes on
another employee? The whole table structure would require modification.
At this point, a second bright idea usually occurs to database novices:
We don‟t want to have more than one column and we want to allow for a
flexible amount of data storage. Let‟s try something like this:
 Manager
 Subordinates
Where the Subordinates field contains multiple entries in the form "Mary,
Bill, Joe"

Page 3 of 6
SMU Fundamental of Database lecture note- chapter four

This solution is closer, but it also falls short of the mark. The
subordinates column is still duplicative and non-atomic. What happens
when we need to add or remove a subordinate? We need to read and
write the entire contents of the table. That‟s not a big deal in this
situation, but what if one manager had one hundred employees? Also, it
complicates the process of selecting data from the database in future
queries.

Here‟s a table that satisfies the first rule of 1NF:

 Manager
 Subordinate

In this case, each subordinate has a single entry, but managers may
have multiple entries.

Now, what about the second rule: identify each row with a unique
column or set of columns (the primary key)? You might take a look at the
table above and suggest the use of the subordinate column as a primary
key. In fact, the subordinate column is a good candidate for a primary
key due to the fact that our business rules specified that each
subordinate may have only one manager. However, the data that we‟ve
chosen to store in our table makes this a less than ideal solution. What
happens if we hire another employee named Jim? How do we store his
manager-subordinate relationship in the database?

It‟s best to use a truly unique identifier (such as an employee ID) as a

primary key. Our final table would look like this:
 Manager ID
 Subordinate ID
Now, our table is in first normal form.

SECOND NORMAL FORM (2NF)

Second Normal Form (2NF) also sets the very basic rules for an organized
database:

 Remove subsets of data that apply to multiple rows of a table and

place them in separate tables.
 Create relationships between these new tables and their
predecessors through the use of foreign keys.
These rules can be summarized in a simple statement: 2NF attempts to
reduce the amount of redundant data in a table by extracting it, placing
it in new table(s) and creating relationships between those tables.

Let's look at an example. Imagine an online store that maintains

Page 4 of 6
SMU Fundamental of Database lecture note- chapter four

customer information in a database. They might have a single table

called Customers with the following elements:
 CustNum
 FirstName
 LastName
 Address
 City
 State
 ZIP
A brief look at this table reveals a small amount of redundant data. We're
storing the "Sea Cliff, NY 11579" and "Miami, FL 33157" entries twice
each. Now, that might not seem like too much added storage in our
simple example, but imagine the wasted space if we had thousands of
rows in our table. Additionally, if the ZIP code for Sea Cliff were to
change, we'd need to make that change in many places throughout the
database.

In a 2NF-compliant database structure, this redundant information is

extracted and stored in a separate table. Our new table (let's call it ZIPs)
might have the following fields:
 ZIP
 City
 State

If we want to be super-efficient, we can even fill this table in advance --

the post office provides a directory of all valid ZIP codes and their
city/state relationships. Surely, you've encountered a situation where
this type of database was utilized. Someone taking an order might have
asked you for your ZIP code first and then knew the city and state you
were calling from. This type of arrangement reduces operator error and
increases efficiency.
Now that we've removed the duplicative data from the Customers table,
we've satisfied the first rule of second normal form. We still need to use a
foreign key to tie the two tables together. We'll use the ZIP code (the
primary key from the ZIPs table) to create that relationship. Here's our
new Customers table:
 CustNum
 FirstName
 LastName
 Address
 ZIP
We've now minimized the amount of redundant information stored within
the database and our structure is in second normal form.

Page 5 of 6
SMU Fundamental of Database lecture note- chapter four

THIRD NORMAL FORM (3NF)

There are two basic requirements for a database to be in third normal

form:
 Already meet the requirements of both 1NF and 2NF
 Remove columns that are not fully dependent upon the primary
key.
Imagine that we have a table of widget orders that contains the following
attributes:
 Order Number
 Customer Number
 Unit Price
 Quantity
 Total
Remember, our first requirement is that the table must satisfy the
requirements of 1NF and 2NF. Are there any duplicative columns? No.
Do we have a primary key? Yes, the order number. Therefore, we satisfy
the requirements of 1NF. Are there any subsets of data that apply to
multiple rows? No, so we also satisfy the requirements of 2NF.

Now, are all of the columns fully dependent upon the primary key? The
customer number varies with the order number and it doesn't appear to
depend upon any of the other fields. What about the unit price? This field
could be dependent upon the customer number in a situation where we
charged each customer a set price. However, looking at the data above, it
appears we sometimes charge the same customer different prices.
Therefore, the unit price is fully dependent upon the order number. The
quantity of items also varies from order to order, so we're OK there.

What about the total? It looks like we might be in trouble here. The total
can be derived by multiplying the unit price by the quantity, therefore it's
not fully dependent upon the primary key. We must remove it from the
table to comply with the third normal form. Perhaps we use the following
attributes:
 Order Number
 Customer Number
 Unit Price
 Quantity
Now our table is in 3NF.

Page 6 of 6

DBMS Unit 3
No ratings yet
DBMS Unit 3
33 pages
Normalization
No ratings yet
Normalization
17 pages
Dbmsmicroproject 2
No ratings yet
Dbmsmicroproject 2
7 pages
Report For Blood Bank
No ratings yet
Report For Blood Bank
10 pages
Normalization of Database-Ass-2
No ratings yet
Normalization of Database-Ass-2
31 pages
Normalization FORM
No ratings yet
Normalization FORM
5 pages
Unit 4-1
No ratings yet
Unit 4-1
11 pages
FINAL Normalisation Presentation
No ratings yet
FINAL Normalisation Presentation
21 pages
Basic Principles of Database Normalization
No ratings yet
Basic Principles of Database Normalization
7 pages
Week 1
No ratings yet
Week 1
15 pages
Normalization
No ratings yet
Normalization
9 pages
Database Normalization
No ratings yet
Database Normalization
6 pages
Normalisation Concepts in Database
No ratings yet
Normalisation Concepts in Database
5 pages
RDBMS Normalization Explained
No ratings yet
RDBMS Normalization Explained
8 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
14 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
14 pages
Database Normalisation
No ratings yet
Database Normalisation
19 pages
DB 2
No ratings yet
DB 2
15 pages
DB NOMILIZATION AND CONSTRAINTS
No ratings yet
DB NOMILIZATION AND CONSTRAINTS
9 pages
SQL Unit2 NORMALIZATION
No ratings yet
SQL Unit2 NORMALIZATION
4 pages
Normalisation
No ratings yet
Normalisation
23 pages
Normalization
No ratings yet
Normalization
11 pages
Database Normalization
No ratings yet
Database Normalization
10 pages
Week 2
No ratings yet
Week 2
34 pages
Data Normalization
No ratings yet
Data Normalization
25 pages
Normalization
No ratings yet
Normalization
4 pages
2nd and 3rd Unit
No ratings yet
2nd and 3rd Unit
87 pages
Normalization
No ratings yet
Normalization
8 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
1 page
Lesson5 NORMALIZATION (Midtrem)
No ratings yet
Lesson5 NORMALIZATION (Midtrem)
29 pages
DB Lecture 9&10
No ratings yet
DB Lecture 9&10
50 pages
Database Normalization Overview
No ratings yet
Database Normalization Overview
16 pages
Normalization Unit2
No ratings yet
Normalization Unit2
9 pages
1NF, 2NF
No ratings yet
1NF, 2NF
9 pages
6 Normalization Part 1
No ratings yet
6 Normalization Part 1
68 pages
Normalization
No ratings yet
Normalization
13 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
33 pages
DBMS Notes UNIT 3-4
No ratings yet
DBMS Notes UNIT 3-4
31 pages
Normalization Lec4
No ratings yet
Normalization Lec4
29 pages
Database Chapter 5
No ratings yet
Database Chapter 5
23 pages
12 Normalization
No ratings yet
12 Normalization
41 pages
RDBMS Unit 4
No ratings yet
RDBMS Unit 4
15 pages
Database Normalization Updated
No ratings yet
Database Normalization Updated
22 pages
Database Techniques DB Normalization
No ratings yet
Database Techniques DB Normalization
37 pages
Normalization Paper
No ratings yet
Normalization Paper
3 pages
Co4, Co5, Co6 Rdbms Assignment Solution
No ratings yet
Co4, Co5, Co6 Rdbms Assignment Solution
32 pages
DBMS Unit-4 Notes
No ratings yet
DBMS Unit-4 Notes
18 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
23 pages
Database Normalization: Mohua Sarkar, PH.D Software Engineer California Pacific Medical Center 415-600-7003
No ratings yet
Database Normalization: Mohua Sarkar, PH.D Software Engineer California Pacific Medical Center 415-600-7003
23 pages
DBMS 4 1745635872963
No ratings yet
DBMS 4 1745635872963
8 pages
Normalization and Normal Form
No ratings yet
Normalization and Normal Form
11 pages
DBMS Unit 3 Sem Exam
No ratings yet
DBMS Unit 3 Sem Exam
64 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
8 pages
Normalization in DBMS: Module 4 Notes
No ratings yet
Normalization in DBMS: Module 4 Notes
23 pages
Cape Notes Unit 2 Module 1 Content 10
No ratings yet
Cape Notes Unit 2 Module 1 Content 10
12 pages
Normalization
No ratings yet
Normalization
19 pages
Unit 4 - SRP
No ratings yet
Unit 4 - SRP
13 pages
Packet Switching vs Circuit Switching
No ratings yet
Packet Switching vs Circuit Switching
20 pages
PDF - Wikipedia
No ratings yet
PDF - Wikipedia
5 pages
K1-K2 Byte
No ratings yet
K1-K2 Byte
6 pages
LIAN 98 (En) - Protocol IEC 60870-5-104, Telegram Structure
No ratings yet
LIAN 98 (En) - Protocol IEC 60870-5-104, Telegram Structure
13 pages
Fundamental of Database Lab 1
No ratings yet
Fundamental of Database Lab 1
5 pages
Oracle SOA Development 12c
0% (1)
Oracle SOA Development 12c
5 pages
Bulletcat4-Gl: Bulletcat4-Gl - 4G/Lte Ethernet/Serial/Usb Gateway
No ratings yet
Bulletcat4-Gl: Bulletcat4-Gl - 4G/Lte Ethernet/Serial/Usb Gateway
2 pages
Patching Oracle Database Physical Standby With Enterprise Manager 12c
No ratings yet
Patching Oracle Database Physical Standby With Enterprise Manager 12c
5 pages
LZW (Lempel Ziv Welch) : 60.1 Brief History
No ratings yet
LZW (Lempel Ziv Welch) : 60.1 Brief History
4 pages
Hip Hop Abs Workout Videos Collection
No ratings yet
Hip Hop Abs Workout Videos Collection
1 page
Adaptive Streaming of Audiovisual Content Using MPEG DASH
No ratings yet
Adaptive Streaming of Audiovisual Content Using MPEG DASH
8 pages
C Programming Patterns & Algorithms
No ratings yet
C Programming Patterns & Algorithms
20 pages
MPMC
No ratings yet
MPMC
232 pages
224 Core Data Best Practices PDF
No ratings yet
224 Core Data Best Practices PDF
198 pages
Linux Basic Commands
No ratings yet
Linux Basic Commands
6 pages
Key Topics and Questions in DBMS
No ratings yet
Key Topics and Questions in DBMS
3 pages
LDPC Codes for CCSDS Channel Coding
No ratings yet
LDPC Codes for CCSDS Channel Coding
8 pages
FALLSEM2023-24 CSI3010 ETH VL2023240104197 2023-08-02 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSI3010 ETH VL2023240104197 2023-08-02 Reference-Material-I
10 pages
Fundamentals of Big Data Analytics
No ratings yet
Fundamentals of Big Data Analytics
151 pages
Class 11 SQL Practical Exercises
100% (1)
Class 11 SQL Practical Exercises
9 pages
Future Proof Your Data Lake Environment
No ratings yet
Future Proof Your Data Lake Environment
10 pages
Information and Communication Technology Paper 2 (Sample Paper)
No ratings yet
Information and Communication Technology Paper 2 (Sample Paper)
5 pages
Oracle Performance Tuning Training
100% (2)
Oracle Performance Tuning Training
10 pages
1Z0-060 Exam Dumps With PDF and VCE Download (1-20) PDF
100% (1)
1Z0-060 Exam Dumps With PDF and VCE Download (1-20) PDF
11 pages
Release Notes: AIX Version 6.1
No ratings yet
Release Notes: AIX Version 6.1
54 pages
Unit 5-1
No ratings yet
Unit 5-1
37 pages
Bcs-011 Important Concepts
100% (1)
Bcs-011 Important Concepts
92 pages
C - MDG - 1909: Answers B D E
No ratings yet
C - MDG - 1909: Answers B D E
2 pages
Create Iemma Order
No ratings yet
Create Iemma Order
5 pages
Impresora Brightek
No ratings yet
Impresora Brightek
31 pages

What Is Normalization

Uploaded by

What Is Normalization

Uploaded by

SMU Fundamental of Database lecture note- chapter four

Normalization is the process of efficiently organizing data in a database.

While creating database applications, why normalization is necessary?.

 serves as the basis for Physical Database Design, ensuring that

While performance considerations often prohibit the direct

Essentially table optimization is accomplished through the elimination of

Database anomalies, are really just unmatched or missing information

user. Entering or deleting information, be it an update or a new record

It is a failure to remove information about an existing database entry

An update of a database involves modifications that may be additions,

All three kinds of anomalies are highly undesirable, since their

In order to overcome the above database anomalies the database

FIRST NORMAL FORM (1NF)

What do these rules mean when contemplating the practical design of a

Intuitively, when creating a list or spreadsheet to track this information,

However, recall the first rule imposed by 1NF: eliminate duplicative

Here‟s a table that satisfies the first rule of 1NF:

It‟s best to use a truly unique identifier (such as an employee ID) as a

SECOND NORMAL FORM (2NF)

 Remove subsets of data that apply to multiple rows of a table and

Let's look at an example. Imagine an online store that maintains

customer information in a database. They might have a single table

In a 2NF-compliant database structure, this redundant information is

If we want to be super-efficient, we can even fill this table in advance --

THIRD NORMAL FORM (3NF)

There are two basic requirements for a database to be in third normal

You might also like