0% found this document useful (0 votes)

29 views14 pages

Normalization FNL

Uploaded by

neupaneroshni7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views14 pages

Normalization FNL

Uploaded by

neupaneroshni7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Normalization in DBMS - 1NF, 2NF, 3NF,

BCNF, 4NF

Normalization in DBMS is a technique using which you can organize the

data in the database tables so that:

There is less repetition of data,

A large set of data is structured into a bunch of smaller tables,

and the tables have a proper relationship between them.

DBMS Normalization is a systematic approach to decompose (break

down) tables to eliminate data redundancy(repetition) and undesirable
characteristics like Insertion anomaly in DBMS, Update anomaly in
DBMS, and Delete anomaly in DBMS.

It is a multi-step process that puts data into tabular form, removes

duplicate data, and set up the relationship between tables.

Why we need Normalization in DBMS?

Normalization is required for,

Eliminating redundant(useless) data, therefore handling data integrity,

because if data is repeated it increases the chances of inconsistent data.

Normalization helps in keeping data consistent by storing the data in

one table and referencing it everywhere else.

1 of 16 9/12/2024, 8:29 PM
..

Storage optimization although that is not an issue these days because

Database storage is cheap.

Breaking down large tables into smaller tables with relationships, so it

makes the database structure more scalable and adaptable.

Ensuring data dependencies make sense i.e. data is logically stored.

Problems without Normalization in DBMS

If a table is not properly normalized and has data redundancy(repetition)

then it will not only eat up extra memory space but will also make it
difficult for you to handle and update the data in the database, without
losing data.

Insertion, Updation, and Deletion Anomalies are very frequent if the

database is not normalized.

To understand these anomalies let us take an example of a Student

table.

rollno name branch hod office_tel

401 Akon CSE Mr. X 53337

402 Bkon CSE Mr. X 53337

403 Ckon CSE Mr. X 53337

404 Dkon CSE Mr. X 53337

In the table above, we have data for four Computer Sci. students.

As we can see, data for the fields branch, hod(Head of Department),

2 of 16 9/12/2024, 8:29 PM
and office_tel are repeated for the students who are in the same branch
in the college, this is Data Redundancy.

1. Insertion Anomaly in DBMS

Suppose for a new admission, until and unless a student opts for a
branch, data of the student cannot be inserted, or else we will have to set
the branch information as NULL.

Also, if we have to insert data for 100 students of the same branch, then
the branch information will be repeated for all those 100 students.

These scenarios are nothing but Insertion anomalies.

If you have to repeat the same data in every row of data, it's better to
keep the data separately and reference that data in each row.

So in the above table, we can keep the branch information separately,

and just use the branch_id in the student table, where branch_id can
be used to get the branch information.

2. Updation Anomaly in DBMS

What if Mr. X leaves the college? or Mr. X is no longer the HOD of the
computer science department? In that case, all the student records will
have to be updated, and if by mistake we miss any record, it will lead to
data inconsistency.

This is an Updation anomaly because you need to update all the records
in your table just because one piece of information got changed.

3. Deletion Anomaly in DBMS

In our Student table, two different pieces of information are kept

together, the Student information and the Branch information.

3 of 16 9/12/2024, 8:29 PM
So if only a single student is enrolled in a branch, and that student leaves
the college, or for some reason, the entry for the student is deleted, we
will lose the branch information too.

So never in DBMS, we should keep two different entities together, which

in the above example is Student and branch,

The solution for all the three anomalies described above is to keep the
student information and the branch information in two different tables.
And use the branch_id in the student table to reference the branch.

Primary Key and Non-key attributes

Before we move on to learn different Normal Forms in DBMS, let's first

understand what is a primary key and what are non-key attributes.

Primary key and non-key attribute

As you can see in the table above, the student_id column is a primary
key because using the student_id value we can uniquely identify each
row of data, hence the remaining columns then become the non-key
attributes.

Types of DBMS Normal forms

Normalization rules are divided into the following normal forms:

1. First Normal Form

2. Second Normal Form

3. Third Normal Form

4. BCNF

5. Fourth Normal Form

4 of 16 9/12/2024, 8:29 PM
A relation is in 1NF if it contains an atomic value;
1. First Normal Form (1NF) eliminate repeating groups

For a table to be in the First Normal Form, it should follow the following 4
rules:

1. It should only have single(atomic) valued attributes/columns.

2. Values stored in a column should be of the same domain.

3. All the columns in a table should have unique names.

4. And the order in which data is stored should not matter.

Let's see an example.

If we have an Employee table in which we store the employee

information along with the employee skillset, the table will look like this:

emp_id emp_name emp_mobile emp_skills

1 John Tick 9999957773 Python, JavaScript

2 Darth Trader 8888853337 HTML, CSS, JavaScript

3 Rony Shark 7777720008 Java, Linux, C++

The above table has 4 columns:

All the columns have different names.

All the columns hold values of the same type like emp_name has all the
names, emp_mobile has all the contact numbers, etc.

The order in which we save data doesn't matter

5 of 16 9/12/2024, 8:29 PM
But the emp_skills column holds multiple comma-separated values,
while as per the First Normal form, each column should have a single
value.

Hence the above table fails to pass the First Normal form.

So how do you fix the above table? There are two ways to do this:

1. Remove the emp_skills column from the Employee table and keep it in
some other table.

2. Or add multiple rows for the employee and each row is linked with one
skill.

1. Create Separate tables for Employee and Employee Skills

So the Employee table will look like this,

emp_id emp_name emp_mobile

1 John Tick 9999957773

2 Darth Trader 8888853337

3 Rony Shark 7777720008

And the new Employee_Skill table:

emp_id emp_skill

1 Python

1 JavaScript

2 HTML

2 CSS

2 JavaScript

6 of 16 9/12/2024, 8:29 PM
emp_id emp_skill

3 Java

3 Linux

3 C++

2. Add Multiple rows for Multiple skills

You can also simply add multiple rows to add multiple skills. This will lead
to repetition of the data, but that can be handled as you further Normalize
your data using the Second Normal form and the Third Normal form.

emp_id emp_name emp_mobile emp_skill

1 John Tick 9999957773 Python

1 John Tick 9999957773 JavaScript

2 Darth Trader 8888853337 HTML

2 Darth Trader 8888853337 CSS

2 Darth Trader 8888853337 JavaScript

3 Rony Shark 7777720008 Java

3 Rony Shark 7777720008 Linux

3 Rony Shark 7777720008 C++

A relation will be in 2NF if

2. Second Normal Form (2NF) it is in 1NF
and all non-key attributes are fully functional
dependent on the primary key.
For a table to be in the Second Normal Form,

7 of 16 9/12/2024, 8:29 PM
Normalization in DBMS - 1NF, 2NF, 3NF, BCNF, 4NF and 5NF | Study... about:reader?url=https%3A%2F%2Fwww.studytonight.com%2Fdbm...

1. It should be in the First Normal form.

2. And, it should not have Partial Dependency.

Let's take an example to understand Partial dependency and the

Second Normal Form.

What is Partial Dependency?

When a table has a primary key that is made up of two or more columns,
then all the columns(not included in the primary key) in that table should
depend on the entire primary key and not on a part of it. If any
column(which is not in the primary key) depends on a part of the primary
key then we say we have Partial dependency in the table.

Confused? Let's take an example.

If we have two tables Students and Subjects, to store student information

and information related to subjects.

Student table:

student_id student_name branch

1 Akon CSE

2 Bkon Mechanical

Subject Table:

subject_id subject_name

1 C Language

2 DSA

8 of 16 9/12/2024, 8:29 PM
subject_id subject_name

3 Operating System

And we have another table Score to store the marks scored by students
in any subject like this,

student_id subject_id marks teacher_name

1 1 70 Miss. C

1 2 82 Mr. D

2 1 65 Mr. Op

Now in the above table, the primary key is student_id + subject_id,

because both these information are required to select any row of data.

But in the Score table, we have a column teacher_name, which

depends on the subject information or just the subject_id, so we should
not keep that information in the Score table.

The column teacher_name should be in the Subjects table. And then

the entire system will be Normalized as per the Second Normal Form.

Updated Subject table:

subject_id subject_name teacher_name

1 C Language Miss. C

2 DSA Mr. D

3 Operating System Mr. Op

Updated Score table:

student_id subject_id marks

9 of 16 9/12/2024, 8:29 PM
student_id subject_id marks

1 1 70

1 2 82

2 1 65

3. Third Normal Form (3NF)

A table is said to be in the Third Normal Form when,

1. It satisfies the First Normal Form and the Second Normal form.

2. And, it doesn't have Transitive Dependency.

What is Transitive Dependency?

In a table we have some column that acts as the primary key and other
columns depends on this column. But what if a column that is not the
primary key depends on another column that is also not a primary key or
part of it? Then we have Transitive dependency in our table.

Let's take an example. We had the Score table in the Second Normal
Form above. If we have to store some extra information in it, like,

1. exam_type

2. total_marks

To store the type of exam and the total marks in the exam so that we can

10 of 16 9/12/2024, 8:29 PM
later calculate the percentage of marks scored by each student.

The Score table will look like this,

student_id subject_id marks exam_type total_marks

1 1 70 Theory 100

1 2 82 Theory 100

2 1 42 Practical 50

In the table above, the column exam_type depends on both student_id

and subject_id, because,

a student can be in the CSE branch or the Mechanical branch,

and based on that they may have different exam types for different
subjects.

The CSE students may have both Practical and Theory for Compiler
Design,

whereas Mechanical branch students may only have Theory exams for
Compiler Design.

But the column total_marks just depends on the exam_type column.

And the exam_type column is not a part of the primary key. Because the
primary key is student_id + subject_id, hence we have a Transitive
dependency here.

How to Transitive Dependency?

You can create a separate table for ExamType and use it in the Score
table.

New ExamType table,

11 of 16 9/12/2024, 8:29 PM
exam_type_id exam_type total_marks duration

1 Practical 50 45

2 Theory 100 180

3 Workshop 150 300

We have created a new table ExamType and we have added more

related information in it like duration(duration of exam in mins.), and now
we can use the exam_type_id in the Score table.

4. Boyce-Codd Normal Form (BCNF)

Boyce and Codd Normal Form is a higher version of the Third Normal
Form.

This form deals with a certain type of anomaly that is not handled by 3NF.

A 3NF table that does not have multiple overlapping candidate keys
is said to be in BCNF.

For a table to be in BCNF, the following conditions must be satisfied:

R must be in the 3rd Normal Form

and, for each functional dependency ( X → Y ), X should be a Super Key.

12 of 16 9/12/2024, 8:29 PM
5. Fourth Normal Form (4NF)

A table is said to be in the Fourth Normal Form when,

1. It is in the Boyce-Codd Normal Form.

2. And, it doesn't have Multi-Valued Dependency.

13 of 16 9/12/2024, 8:29 PM
Database Normalization helps you design and structure your table
properly so that you have proper relationships between tables. It helps
you with the following:

1. Data Integrity

2. Data consistency

3. Better relationship between tables

4. More scalable design for tables.

5. No large tables, small tables with a proper relationship.

6. Removing dependencies, like Partial Dependency, Transitive

Dependency, Join Dependency, etc.

Advantages of Data Normalization

The process of normalizing a database has numerous advantages. The following are a few key benefits:

Utilizing database or data redundancy through normalization

Duplication may be eliminated.
By normalizing, we may reduce null values.
Results in a smaller database (since there is less data duplication or zero).
Minimize/avoid issues with data modification.
It makes the queries easier.
The database structure is more comprehensible and straightforward.
Existing data can be added to the database without having an impact.
Because the table is compact and more rows can fit on the data page, finding, sorting, as well as
indexing may be quicker.

14 of 16 9/12/2024, 8:29 PM

DBMS Unit-4 Notes
No ratings yet
DBMS Unit-4 Notes
18 pages
NORMALIZATION
No ratings yet
NORMALIZATION
11 pages
Understanding Normalization in DBMS
No ratings yet
Understanding Normalization in DBMS
10 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
14 pages
Database Management System - 2 - 1753699708974 Conv
No ratings yet
Database Management System - 2 - 1753699708974 Conv
13 pages
Normalization
No ratings yet
Normalization
7 pages
Week 2
No ratings yet
Week 2
34 pages
1NF, 2NF
No ratings yet
1NF, 2NF
9 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
18 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
14 pages
Normalization Lesson
No ratings yet
Normalization Lesson
13 pages
DBMS, Unit-5
No ratings yet
DBMS, Unit-5
9 pages
DBMS Unit 3
No ratings yet
DBMS Unit 3
33 pages
RDBMS Normalization
No ratings yet
RDBMS Normalization
29 pages
DBMS Unit3
No ratings yet
DBMS Unit3
57 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
9 pages
Normalization of Database-Ass-2
No ratings yet
Normalization of Database-Ass-2
31 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
17 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
7 pages
2.chapter 5
No ratings yet
2.chapter 5
21 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
11 pages
DB 2
No ratings yet
DB 2
15 pages
DB NOMILIZATION AND CONSTRAINTS
No ratings yet
DB NOMILIZATION AND CONSTRAINTS
9 pages
12.1 Manupulating Data - Relational Data Base
No ratings yet
12.1 Manupulating Data - Relational Data Base
25 pages
KKS Normalization
No ratings yet
KKS Normalization
16 pages
Unit 5: Data Normalization
No ratings yet
Unit 5: Data Normalization
27 pages
Normalization Unit2
No ratings yet
Normalization Unit2
9 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
16 pages
Normalization
No ratings yet
Normalization
17 pages
Normal Forms
No ratings yet
Normal Forms
30 pages
Understanding Database Normalization Anomalies
No ratings yet
Understanding Database Normalization Anomalies
10 pages
Normalization
No ratings yet
Normalization
9 pages
12 Normalization
No ratings yet
12 Normalization
41 pages
DBMS Chap 3
No ratings yet
DBMS Chap 3
17 pages
Normalization
No ratings yet
Normalization
19 pages
1NF, 2NF, 3NF New
No ratings yet
1NF, 2NF, 3NF New
20 pages
22it305 - Database Management Systems: 1.1 Relation For Every Entity
No ratings yet
22it305 - Database Management Systems: 1.1 Relation For Every Entity
11 pages
DBMS Unit-III
No ratings yet
DBMS Unit-III
42 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
13 pages
Unit 3 (KCS501)
No ratings yet
Unit 3 (KCS501)
13 pages
DBMS Notes
No ratings yet
DBMS Notes
7 pages
Normalization
No ratings yet
Normalization
17 pages
DBMS Normalization: 1NF to BCNF Explained
No ratings yet
DBMS Normalization: 1NF to BCNF Explained
10 pages
Normalization of Database
No ratings yet
Normalization of Database
10 pages
Understanding Database Normalization Techniques
No ratings yet
Understanding Database Normalization Techniques
14 pages
RDBMS Unit 4
No ratings yet
RDBMS Unit 4
15 pages
NORMALIZATION
No ratings yet
NORMALIZATION
4 pages
Unit III Dbms
No ratings yet
Unit III Dbms
8 pages
Database Normalization and Dependencies
No ratings yet
Database Normalization and Dependencies
65 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
14 pages
Normalization
No ratings yet
Normalization
11 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
23 pages
Functional Dependency Notes
No ratings yet
Functional Dependency Notes
52 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
33 pages
Normlaization in DBMS
No ratings yet
Normlaization in DBMS
5 pages
Week 1
No ratings yet
Week 1
11 pages
DBMS 2: Anomalies + Normalization PDF
No ratings yet
DBMS 2: Anomalies + Normalization PDF
10 pages
Recently Asked Data Analyst Interview Questions
No ratings yet
Recently Asked Data Analyst Interview Questions
4 pages
IIPU UnitTest CS 70MarksJune2025-1
No ratings yet
IIPU UnitTest CS 70MarksJune2025-1
3 pages
Text Analysis for Beginners
No ratings yet
Text Analysis for Beginners
8 pages
Scientific Literature Search Guide
No ratings yet
Scientific Literature Search Guide
32 pages
3 Hours / 70 Marks: Seat No
No ratings yet
3 Hours / 70 Marks: Seat No
4 pages
BCA Assignments For Students
No ratings yet
BCA Assignments For Students
4 pages
Android Activity Basics
No ratings yet
Android Activity Basics
19 pages
ITPC206 DBMS Mid Term Exam 2023-24
No ratings yet
ITPC206 DBMS Mid Term Exam 2023-24
1 page
BDD Automation GUI Testing with Cucumber
No ratings yet
BDD Automation GUI Testing with Cucumber
6 pages
Veera Resume ETL TESTER 4 Years
No ratings yet
Veera Resume ETL TESTER 4 Years
4 pages
Coursera Ibm Data
No ratings yet
Coursera Ibm Data
1 page
Gujarat Technological University: Mca - Semester Ii-Examination - Winter-2024
No ratings yet
Gujarat Technological University: Mca - Semester Ii-Examination - Winter-2024
2 pages
Emasters in Data Science Data Analytics
No ratings yet
Emasters in Data Science Data Analytics
12 pages
MS-DOS Command Guide
No ratings yet
MS-DOS Command Guide
13 pages
1 Bit 20803 Chapter 1 Dbconcepts
No ratings yet
1 Bit 20803 Chapter 1 Dbconcepts
41 pages
Literature Review Museum
100% (3)
Literature Review Museum
4 pages
Computer Aided Software Engineering
No ratings yet
Computer Aided Software Engineering
9 pages
Govind Barbade Resume
No ratings yet
Govind Barbade Resume
3 pages
Database Lifecycle Guide
100% (2)
Database Lifecycle Guide
4 pages
Chapter 2 Data Models
No ratings yet
Chapter 2 Data Models
75 pages
Group Assignment
No ratings yet
Group Assignment
2 pages
S66020 Information Technology Part A 31761H U2 AddSAM
100% (1)
S66020 Information Technology Part A 31761H U2 AddSAM
11 pages
AI Innovations in Vascular Surgery
100% (1)
AI Innovations in Vascular Surgery
16 pages
Cad Seattle
No ratings yet
Cad Seattle
114 pages
Sas Master Data Management Factsheet
No ratings yet
Sas Master Data Management Factsheet
4 pages
SQL Database Interview Questions
No ratings yet
SQL Database Interview Questions
16 pages
Backend Engineering Intern Case Study
No ratings yet
Backend Engineering Intern Case Study
4 pages
Introduction To Clinical Informatics - 2020
No ratings yet
Introduction To Clinical Informatics - 2020
24 pages
CBIS
No ratings yet
CBIS
16 pages
CP4094 - MPC - As2
No ratings yet
CP4094 - MPC - As2
3 pages