0% found this document useful (0 votes)

14 views7 pages

Normalization

Normalization in DBMS is a technique to organize data in database tables by reducing data redundancy and establishing proper relationships between tables. It involves a multi-step process that includes First Normal Form (1NF), Second Normal Form (2NF), and Third Normal Form (3NF), each addressing specific types of data dependencies. The main goals of normalization are to ensure data integrity, consistency, and efficient storage by breaking down large tables into smaller, logically related ones.

Uploaded by

jlurker77

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views7 pages

Normalization

Uploaded by

jlurker77

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Normalization in DBMS

Normalization in DBMS is a technique using which you can organize the data in the database tables so
that:

• There is less repetition of data,

• A large set of data is structured into a bunch of smaller tables,
• and the tables have a proper relationship between them.

DBMS Normalization is a systematic approach to decompose (break down) tables to eliminate data
redundancy(repetition) and undesirable characteristics like Insertion anomaly in DBMS, Update anomaly
in DBMS, and Delete anomaly in DBMS.

It is a multi-step process that puts data into tabular form, removes duplicate data, and set up the
relationship between tables.

Why we need Normalization in DBMS?

Normalization is required for,

• Eliminating redundant(useless) data, therefore handling data integrity, because if data is

repeated it increases the chances of inconsistent data.
• Normalization helps in keeping data consistent by storing the data in one table and referencing it
everywhere else.
• Storage optimization although that is not an issue these days because Database storage is cheap.
• Breaking down large tables into smaller tables with relationships, so it makes the database
structure more scalable and adaptable.
• Ensuring data dependencies make sense i.e., data is logically stored.

Primary Key and Non-key attributes

Types of DBMS Normal forms
First Normal Form (1NF)
• For a table to be in the First Normal Form, it should follow the following 4 rules:
o It should only have single (atomic) valued attributes/columns.
o Values stored in a column should be of the same domain.
o All the columns in a table should have unique names.
o And the order in which data is stored should not matter.

Let's see an example.

If we have an employee table in which we store the employee information along with the employee
skillset, the table will look like this:

The above table has 4 columns:

• All the columns have different names.

• All the columns hold values of the same type like emp_name has all the names, emp_mobile
has all the contact numbers, etc.
• The order in which we save data doesn't matter
• But the emp_skills column holds multiple comma-separated values, while as per the First
Normal form, each column should have a single value.

Hence the above table fails to pass the First Normal form.

So how do you fix the above table? There are two ways to do this:

1. Remove the emp_skills column from the Employee table and keep it in some other table.
2. Or add multiple rows for the employee and each row is linked with one skill.
Create Separate tables for Employee and Employee Skills

So, the Employee table will look like this,

And the new Employee_Skill table:

Add Multiple rows for Multiple skills

You can also simply add multiple rows to add multiple skills. This will lead to repetition of the data, but
that can be handled as you further Normalize your data using the Second Normal form and the Third
Normal form.

Second Normal Form (2NF)

For a table to be in the Second Normal Form,

1. It should be in the First Normal form.

2. And, it should not have Partial Dependency.

Let's take an example to understand Partial dependency and the Second Normal Form.

What is Partial Dependency?

When a table has a primary key that is made up of two or more columns, then all the columns (not
included in the primary key) in that table should depend on the entire primary key and not on a part of
it. If any column (which is not in the primary key) depends on a part of the primary key then we say we
have Partial dependency in the table.
Confused? Let's take an example.

If we have two tables’ Students and Subjects, to store student information and information related to
subjects.

Student table:

And we have another table Score to store the marks scored by students in any subject like this,

Now in the above table, the primary key is student_id + subject_id, because both this information is
required to select any row of data.

But in the Score table, we have a column teacher_name, which depends on the subject information or
just the subject_id, so we should not keep that information in the Score table.

The column teacher_name should be in the Subjects table. And then the entire system will be
Normalized as per the Second Normal Form.
Updated Subject table:

Updated Score table:

Third Normal Form (3NF)

A table is said to be in the Third Normal Form when,

1. It satisfies the First Normal Form and the Second Normal form.
2. And, it doesn't have Transitive Dependency.

What is Transitive Dependency?

In a table we have some column that acts as the primary key and other columns depends on this column.
But what if a column that is not the primary key depends on another column that is also not a primary
key or part of it? Then we have Transitive dependency in our table.

Let's take an example. We had the Score table in the Second Normal Form above. If we have to store
some extra information in it, like,

1. exam_type
2. total_marks
To store the type of exam and the total marks in the exam so that we can later calculate the percentage
of marks scored by each student.

The Score table will look like this,

In the table above, the column exam_type depends on both student_id and subject_id,
because,
1. a student can be in the CSE branch or the Mechanical branch,
2. and based on that they may have different exam types for different subjects.
3. The CSE students may have both Practical and Theory for Compiler Design,
4. whereas Mechanical branch students may only have Theory exams for Compiler Design.
But the column total_marks just depend on the exam_type column. And the exam_type
column is not a part of the primary key. Because the primary key is student_id + subject_id,
hence we have a Transitive dependency here.
How to Transitive Dependency?
You can create a separate table for ExamType and use it in the Score table.
New ExamType table,

We have created a new table ExamType and we have added more related information in it like
duration (duration of exam in mins.), and now we can use the exam_type_id in the Score table.

1NF, 2NF, 3NF New
No ratings yet
1NF, 2NF, 3NF New
20 pages
Database Management System - 2 - 1753699708974 Conv
No ratings yet
Database Management System - 2 - 1753699708974 Conv
13 pages
Normalization FNL
No ratings yet
Normalization FNL
14 pages
DBMS Unit-4 Notes
No ratings yet
DBMS Unit-4 Notes
18 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
14 pages
Normalization of Database
No ratings yet
Normalization of Database
10 pages
Normalization Lesson
No ratings yet
Normalization Lesson
13 pages
NORMALIZATION
No ratings yet
NORMALIZATION
11 pages
Normalization
No ratings yet
Normalization
36 pages
Understanding Normalization in DBMS
No ratings yet
Understanding Normalization in DBMS
10 pages
DBMS Unit3
No ratings yet
DBMS Unit3
57 pages
Normal Forms
No ratings yet
Normal Forms
30 pages
Week 2
No ratings yet
Week 2
34 pages
Database Normalization
No ratings yet
Database Normalization
44 pages
KKS Normalization
No ratings yet
KKS Normalization
16 pages
Database Normalization and Dependencies
No ratings yet
Database Normalization and Dependencies
65 pages
1NF, 2NF
No ratings yet
1NF, 2NF
9 pages
Normalization
No ratings yet
Normalization
15 pages
DB Week 10 Lec 1
No ratings yet
DB Week 10 Lec 1
32 pages
DB 2
No ratings yet
DB 2
15 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
33 pages
Unit 5: Data Normalization
No ratings yet
Unit 5: Data Normalization
27 pages
CSC2243-Databases-Part III
No ratings yet
CSC2243-Databases-Part III
60 pages
NORMALIZATION
No ratings yet
NORMALIZATION
6 pages
Normalization of Database-Ass-2
No ratings yet
Normalization of Database-Ass-2
31 pages
Understanding Second Normal Form (2NF)
100% (1)
Understanding Second Normal Form (2NF)
36 pages
CH 9
No ratings yet
CH 9
75 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
16 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
17 pages
12 Normalization
No ratings yet
12 Normalization
41 pages
12.1 Manupulating Data - Relational Data Base
No ratings yet
12.1 Manupulating Data - Relational Data Base
25 pages
Module3 PartB
No ratings yet
Module3 PartB
41 pages
Functional Dependency Notes
No ratings yet
Functional Dependency Notes
52 pages
DBMS, Unit-5
No ratings yet
DBMS, Unit-5
9 pages
Normalization
No ratings yet
Normalization
11 pages
Normalization in SQL
No ratings yet
Normalization in SQL
12 pages
Unit 3 (KCS501)
No ratings yet
Unit 3 (KCS501)
13 pages
Normalization
No ratings yet
Normalization
9 pages
3512916071
No ratings yet
3512916071
2 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
23 pages
NORMALIZATION Notes
No ratings yet
NORMALIZATION Notes
5 pages
DBMS Normalization
100% (1)
DBMS Normalization
53 pages
Normalization
No ratings yet
Normalization
17 pages
RDBMS Unit 4
No ratings yet
RDBMS Unit 4
15 pages
Third No
No ratings yet
Third No
6 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
18 pages
Database Normalization Basics
No ratings yet
Database Normalization Basics
61 pages
DBMS Unit-III
No ratings yet
DBMS Unit-III
42 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
17 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
14 pages
Normalization
No ratings yet
Normalization
13 pages
Int 306 Normalization
No ratings yet
Int 306 Normalization
66 pages
Normalization Notes
No ratings yet
Normalization Notes
7 pages
Normalization Dbms
No ratings yet
Normalization Dbms
20 pages
Normalization
No ratings yet
Normalization
19 pages
Unit III Dbms
No ratings yet
Unit III Dbms
8 pages
Normalization
No ratings yet
Normalization
21 pages
Chapter 5
No ratings yet
Chapter 5
29 pages
Enterprise Data Warehouse (EDW) Full Guide
No ratings yet
Enterprise Data Warehouse (EDW) Full Guide
20 pages
Ijwis 05 2013 0014
No ratings yet
Ijwis 05 2013 0014
17 pages
Database
No ratings yet
Database
5 pages
Kid Chef
No ratings yet
Kid Chef
3 pages
Oracle SQL
100% (3)
Oracle SQL
29 pages
Upload A Document To Access Your Download: How Food Works - The Facts Visually Explained (2017) (DK Publishing) PDF
No ratings yet
Upload A Document To Access Your Download: How Food Works - The Facts Visually Explained (2017) (DK Publishing) PDF
3 pages
Normalization Based K Means Clustering Algorithm
No ratings yet
Normalization Based K Means Clustering Algorithm
5 pages
Knowledge Management Systems Guide
No ratings yet
Knowledge Management Systems Guide
27 pages
Oyelade K Mean1002.2425
No ratings yet
Oyelade K Mean1002.2425
5 pages
Linux Privilege Escalation via Shared Libraries
No ratings yet
Linux Privilege Escalation via Shared Libraries
11 pages
Deepak Rawat
No ratings yet
Deepak Rawat
2 pages
The Journal of Academic Librarianship: Geoffrey Little
No ratings yet
The Journal of Academic Librarianship: Geoffrey Little
3 pages
AIML Lab Programs
No ratings yet
AIML Lab Programs
13 pages
RightFind XML Open Access
No ratings yet
RightFind XML Open Access
1 page
Data Anonymization Techniques Overview
No ratings yet
Data Anonymization Techniques Overview
19 pages
Overview of Information Systems Components
No ratings yet
Overview of Information Systems Components
3 pages
Big Data Tech Guide for Organizations
No ratings yet
Big Data Tech Guide for Organizations
8 pages
Data Warehousing Concepts Transparencies: © Pearson Education Limited 1995, 2005
No ratings yet
Data Warehousing Concepts Transparencies: © Pearson Education Limited 1995, 2005
58 pages
Cs8080 Irt Unit 1 PDF
No ratings yet
Cs8080 Irt Unit 1 PDF
28 pages
Database Fundamentals Overview
No ratings yet
Database Fundamentals Overview
13 pages
IoT Wiki Guide for IT Technicians
No ratings yet
IoT Wiki Guide for IT Technicians
8 pages
GL R12 Techincal Changes
No ratings yet
GL R12 Techincal Changes
15 pages
LP VI Bi Lab Manual
No ratings yet
LP VI Bi Lab Manual
28 pages
4.1 - Data Preprocessing
No ratings yet
4.1 - Data Preprocessing
28 pages
Data Cleaning: Definition
No ratings yet
Data Cleaning: Definition
2 pages
Types of User Interfaces Explained
No ratings yet
Types of User Interfaces Explained
25 pages
Dbms Lab2 Ce097
No ratings yet
Dbms Lab2 Ce097
13 pages
Interactive Application Development Guide
No ratings yet
Interactive Application Development Guide
61 pages
APS
No ratings yet
APS
52 pages
Sabre Profiles Knowledge Share 2025
No ratings yet
Sabre Profiles Knowledge Share 2025
15 pages

Normalization

Uploaded by

Normalization

Uploaded by

Normalization in DBMS

• There is less repetition of data,

Why we need Normalization in DBMS?

• Eliminating redundant(useless) data, therefore handling data integrity, because if data is

Primary Key and Non-key attributes

Let's see an example.

The above table has 4 columns:

• All the columns have different names.

So, the Employee table will look like this,

And the new Employee_Skill table:

Second Normal Form (2NF)

1. It should be in the First Normal form.

What is Partial Dependency?

Updated Score table:

Third Normal Form (3NF)

What is Transitive Dependency?

The Score table will look like this,

You might also like