0% found this document useful (0 votes)

15 views25 pages

6.data Normalization Process and The Normal Forms

The document discusses the process of data normalization in relational databases, outlining the importance of organizing attributes to avoid redundancies and anomalies such as insertion, deletion, and update anomalies. It details the three normal forms (1NF, 2NF, 3NF) and their definitions, emphasizing functional dependencies and the significance of candidate and primary keys. The document also highlights the need for normalization to ensure a well-structured database schema.

Uploaded by

itsme789yo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views25 pages

6.data Normalization Process and The Normal Forms

Uploaded by

itsme789yo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

• Data Normalization process and the normal forms

• Introduction to data normalization

• 1st Normal Form(1st NF)

• 2nd Normal Form (2nd NF)

• 3rd Normal Form (3rd NF)

Introduction
• Conceptual Modeling is a subjective process
• Therefore, the schema after the logical database design
phases may not be very good. (contain redundancies)
• However, there are formalisms to ensure that the
schema is good.

• This process is called Normalization.

Normalization
• Relational database schema = set of relations
• Relations = set of attributes
• However we group the attributes to relations is very
important.
• Too many attributes in a relation
❑ Waste space
❑ Anomalies
• Decomposing the relation into too smaller set of relations.
❑ Loss-less join property
❑ Dependency preserving property.
❑ Example Lecturer (id,name,address,salary,department,building)
Anomalies
• Insertion anomaly
Inserting a new lecturer to the LECTURER table. Department
information is reported (ensure that correct department
information is inserted)
Note: Inserting a department with no employees (impossible
– because null values for id is not allowed)

• Deletion anomaly
Deleting the last lecturer from the department will lose
information about the department.
Anomalies
• Update Anomaly
Updating the department’s building needs to be done for all
lecturers working for that department.

• When redundancies exits , we should decompose the

relations to smaller relations.
Anomalies
• Decomposing the relation into too smaller relations
• Less-less join property : we might lose information if we
decompose relations…
• Dependency-preserving property: The set of dependencies in
S can be verified by a set of dependencies in R1 and R2.
• Example
S R1 R2
S P D S P P D
S1 P1 D1 S1 P1 P1 D1
S2 P2 D2 S2 P2 P2 D2
S3 P3 D3 S3 S3 P1 D3
Anomalies
• Joining them together, we get spurious
tuples…
S P D
S1 P1 D1
S1 P1 D3
S2 P2 D2
S3 P1 P1
S3 P1 D3
Normalization
• To avoid the above mentioned issues in the relational schema,
we can apply a formal process called Normalizations.

• Normalization is based on functional dependence.

Functional Dependency
• Functional dependencies (FDs) are used to specify formal
measures of the "goodness" of relational designs.
• FDs and keys are used to define normal forms for
relations.
• FDs are constraints that are derived from the meaning
and interrelationships of the data attributes.
• A set of attributes X functionally determines a set of
attributes Y if the value of X determines a unique value
for Y.
Functional Dependency
• X -> Y holds if whenever two tuples have the same value
for X, they must have the same value for Y.
• For any two tuples t1 and t2 in any relation instance r(R):
If t1[X]=t2[X], then t1[Y]=t2[Y]
• X -> Y in R specifies a constraint on all relation instances
r(R) Written as X -> Y;
• can be displayed graphically on a relation schema as in
Figures
• FDs are derived from the real-world constraints on the
attributes
Functional Dependency
• social security number determines employee name
SSN -> ENAME

• project number determines project name and location

PNUMBER -> {PNAME, PLOCATION}

• employee ssn and project number determines the hours per

week that the employee works on the project
{SSN, PNUMBER} -> HOURS
Functional Dependency
• An FD is a property of the attributes in the schema R
• The constraint must hold on every relation instance r(R)
• If K is a key of R, then K functionally determines all attributes
in R (since we never have two distinct tuples with t1[K]=t2[K])
Keys
• A superkey of a relation schema R = {A1, A2, ...., An} is a set of
attributes S subset-of R with the property that no two tuples
t1 and t2 in any legal relation state r of R will have t1[S] =t2[S]

• A key K is a superkey with the additional property that

removal of any attribute from K will cause K not to be a
superkey any more.
Keys
• If a relation schema has more than one key, each is called a
candidate key. One of the candidate keys is arbitrarily
designated to be the primary key, and the others are called
secondary keys.
• A Prime attribute must be a member of some candidate key
• A Nonprime attribute is not a prime attribute—that is, it is
not a member of any [Link] key
1st Normal Form
• Disallows composite attributes, multivalued attributes, and
nested relations; attributes whose values for an individual
tuple are non-atomic.
• A relation R is in first normal form (1NF) if domains of all
attirbutes in the relations are atomic.
• Considered to be part of the definition of relation
1st Normal Form
2nd Normal Form
• Uses the concepts of FDs, primary key
• Prime attribute - attribute that is member of the primary
key K
• Full functional dependency - a FD Y -> Z where removal
of any attribute from Y means the FD does not hold any
more
{SSN, PNUMBER} -> HOURS is a full FD since neither SSN
-> HOURS nor PNUMBER -> HOURS hold
- {SSN, PNUMBER} -> ENAME is not a full FD (it is called a
partial dependency ) since SSN -> ENAME also holds
2nd Normal Form
• A relation schema R is in second normal form (2NF) if every
non-prime attribute A in R is fully functionally dependent on
the primary key

• R can be decomposed into 2NF relations via the process of

2NF normalization
2nd Normal Form
3rd Normal Form
• Transitive functional dependency
• a FD X -> Z that can be derived from two FDs X -> Y
and Y -> Z

- SSN -> DMGRSSN is a transitive FD since

SSN -> DNUMBER and DNUMBER -> DMGRSSN hold
-SSN -> ENAME is non-transitive since there is no set of
attributes X where SSN -> X and X -> ENAME
3rd Normal Form
• A relation schema R is in third normal form (3NF) if it is in 2NF
and no non-prime attribute A in R is transitively dependent
on the primary key
• R can be decomposed into 3NF relations via the process of
3NF normalization
NOTE:
In X -> Y and Y -> Z, with X as the primary key, we consider this
a problem only if Y is not a candidate key. When Y is a
candidate key, there is no problem with the transitive
dependency .
E.g., Consider EMP (SSN, Emp#, Salary ).
Here, SSN -> Emp# -> Salary and Emp# is a candidate key
• Normalization
• Redancadancy
• Anomalies
• Insert, delete and update anomalies
• Candidate key, primary key, prime attribute , non prime
attribute.
• 1st normal, 2nd normal form, 3 rd normal form.
• There are many Normal Forms proposed to reduce
redundancies.
• Some of the well-known ones are:
• 1st Normal Form
• 2nd Normal Form
• 3rd Normal Form
• Each key of a relation is called candidate key.
• A candidate key is chosen to be the primary key.
• An attribute key which is a member of a candidate key is
prime attribute.
• Discuss the attribute semantics as an informal measure of
goodness for a relation schema.
• What is insertion, deletion and modification anomalies.
• Why are they considered bad? Illustrate with example.
• What is functional dependency?
• Define the first, second and third normal forms with a
example.

Functional Dependencies and Normalization For Relational Databases
No ratings yet
Functional Dependencies and Normalization For Relational Databases
36 pages
Normalization
No ratings yet
Normalization
35 pages
212 Lecture 11 Chapter8-Normalization
No ratings yet
212 Lecture 11 Chapter8-Normalization
52 pages
370 - Lec 6
No ratings yet
370 - Lec 6
24 pages
Semantics of The Relation Attributes: Each Tuple in A Relation Should Represent One Entity or Relationship Instance
No ratings yet
Semantics of The Relation Attributes: Each Tuple in A Relation Should Represent One Entity or Relationship Instance
36 pages
Functional Dependencies and Normalization
No ratings yet
Functional Dependencies and Normalization
25 pages
Normalization
No ratings yet
Normalization
27 pages
Chapter 19 Normalization NEW
No ratings yet
Chapter 19 Normalization NEW
49 pages
Normalization Unit 4
No ratings yet
Normalization Unit 4
34 pages
Lec02 - Normalization
No ratings yet
Lec02 - Normalization
35 pages
Normalization
No ratings yet
Normalization
30 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
35 pages
Normalization GFGC
No ratings yet
Normalization GFGC
44 pages
CH-4 DBMS Normalisation
No ratings yet
CH-4 DBMS Normalisation
38 pages
Unit 3 Dbms
No ratings yet
Unit 3 Dbms
30 pages
Functional Dependencies and Normalization
No ratings yet
Functional Dependencies and Normalization
49 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
21 pages
Lect 5 - Functional Dependencies and Normalization
No ratings yet
Lect 5 - Functional Dependencies and Normalization
36 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
47 pages
Informal Guidelines for Relational Schemas
No ratings yet
Informal Guidelines for Relational Schemas
14 pages
DBMS Module4
No ratings yet
DBMS Module4
16 pages
Functional Dependency & Normalization
No ratings yet
Functional Dependency & Normalization
10 pages
Module 3 Part 1
No ratings yet
Module 3 Part 1
14 pages
Bcs403 Dbms m3 Notes
No ratings yet
Bcs403 Dbms m3 Notes
12 pages
4 Normalization
No ratings yet
4 Normalization
46 pages
Week 6
No ratings yet
Week 6
36 pages
Normalization
100% (1)
Normalization
51 pages
Chapter Normalization Part 1and Part 2
No ratings yet
Chapter Normalization Part 1and Part 2
35 pages
UNIT-3DBMS (Normalization and Functional Dependency)
No ratings yet
UNIT-3DBMS (Normalization and Functional Dependency)
34 pages
DBMS Lesson 5.1
No ratings yet
DBMS Lesson 5.1
17 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
16 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
26 pages
Functional Dependencies and Normalization For Relational Databases
No ratings yet
Functional Dependencies and Normalization For Relational Databases
31 pages
Normalizekiit PDF
No ratings yet
Normalizekiit PDF
68 pages
DBMS Module 3 Study Notes
No ratings yet
DBMS Module 3 Study Notes
10 pages
Normalisation
No ratings yet
Normalisation
29 pages
Functional Dependencies
No ratings yet
Functional Dependencies
61 pages
Lecture 5 Normalization
No ratings yet
Lecture 5 Normalization
12 pages
CS 380 Introduction To Database Systems: King Saud University
No ratings yet
CS 380 Introduction To Database Systems: King Saud University
45 pages
Introduction To SQL Programming Techniques: Course Name: Course Code
No ratings yet
Introduction To SQL Programming Techniques: Course Name: Course Code
151 pages
L5 Normalization
No ratings yet
L5 Normalization
42 pages
5.relational DB Design
No ratings yet
5.relational DB Design
33 pages
Normalization
No ratings yet
Normalization
31 pages
Understanding Database Management Systems
No ratings yet
Understanding Database Management Systems
55 pages
Functional Dependencies & Normalization
No ratings yet
Functional Dependencies & Normalization
65 pages
Unit 6 RDB Design
No ratings yet
Unit 6 RDB Design
103 pages
Functional Dependencies and Normilization
No ratings yet
Functional Dependencies and Normilization
60 pages
IS2511 Module 9
No ratings yet
IS2511 Module 9
32 pages
Ch10-Functional Dependencies and Normalization For Relational Databases
No ratings yet
Ch10-Functional Dependencies and Normalization For Relational Databases
31 pages
Normalization 1
No ratings yet
Normalization 1
10 pages
18CS53 - 2022 - 23 - Module4 - DBMS
No ratings yet
18CS53 - 2022 - 23 - Module4 - DBMS
53 pages
Chapter 14 Database Mangement Systems Book
No ratings yet
Chapter 14 Database Mangement Systems Book
48 pages
Chapter 4
No ratings yet
Chapter 4
48 pages
14 - FDs and Normalization Part-1
No ratings yet
14 - FDs and Normalization Part-1
26 pages
System Analysis & Design in Healthcare
No ratings yet
System Analysis & Design in Healthcare
10 pages
Python Scripts for TV Show Data Analysis
No ratings yet
Python Scripts for TV Show Data Analysis
11 pages
HDI Hardware Components v1-0
No ratings yet
HDI Hardware Components v1-0
39 pages
SAP ERP New General Ledger Migration Guide
No ratings yet
SAP ERP New General Ledger Migration Guide
21 pages
SAP Training for Business Users
100% (1)
SAP Training for Business Users
179 pages
DDDD Excel
No ratings yet
DDDD Excel
8 pages
Overview of SQL Statement Types
No ratings yet
Overview of SQL Statement Types
3 pages
Penetration Testing with Cobalt Strike
No ratings yet
Penetration Testing with Cobalt Strike
14 pages
Data Mining with Apriori Algorithm
No ratings yet
Data Mining with Apriori Algorithm
14 pages
List of HTTP Status Codes
No ratings yet
List of HTTP Status Codes
6 pages
Secure Desktop Data Grid Protocols
No ratings yet
Secure Desktop Data Grid Protocols
6 pages
Oe5091 - Business Data Analytics
No ratings yet
Oe5091 - Business Data Analytics
83 pages
Oracle PL SQL Interview Questions For 3 - Years Experience
50% (2)
Oracle PL SQL Interview Questions For 3 - Years Experience
89 pages
Put The License
No ratings yet
Put The License
4 pages
Learning Apache Cassandra - Sample Chapter
No ratings yet
Learning Apache Cassandra - Sample Chapter
20 pages
Revit 2013 API Developer Guide
No ratings yet
Revit 2013 API Developer Guide
330 pages
Blockchain & AI in Education Systems
No ratings yet
Blockchain & AI in Education Systems
3 pages
Cs8592 - Object Oriented Analysis and Design
No ratings yet
Cs8592 - Object Oriented Analysis and Design
48 pages
Resume Arshad Final
No ratings yet
Resume Arshad Final
1 page
Command Injection
No ratings yet
Command Injection
9 pages
Mary Sds
No ratings yet
Mary Sds
28 pages
Unit 5 EH
No ratings yet
Unit 5 EH
13 pages
MySQL Q&A for Beginners
No ratings yet
MySQL Q&A for Beginners
7 pages
SSL Tls
No ratings yet
SSL Tls
17 pages
B SRV Admin Ref Windows PDF
No ratings yet
B SRV Admin Ref Windows PDF
1,686 pages
Abhilash - Data Analyst - Resume
No ratings yet
Abhilash - Data Analyst - Resume
2 pages
Python Jinja2 Template Tutorial
No ratings yet
Python Jinja2 Template Tutorial
10 pages
Magic Quadrant For Observability Platforms, 2024
No ratings yet
Magic Quadrant For Observability Platforms, 2024
17 pages
Personnel Cost Planning Guide
No ratings yet
Personnel Cost Planning Guide
52 pages
KOE 093 Data Warehousing Exam Paper
No ratings yet
KOE 093 Data Warehousing Exam Paper
1 page

6.data Normalization Process and The Normal Forms

Uploaded by

6.data Normalization Process and The Normal Forms

Uploaded by

• Data Normalization process and the normal forms

• Introduction to data normalization

• 1st Normal Form(1st NF)

• 2nd Normal Form (2nd NF)

• 3rd Normal Form (3rd NF)

• This process is called Normalization.

• When redundancies exits , we should decompose the

• Normalization is based on functional dependence.

• project number determines project name and location

• employee ssn and project number determines the hours per

• A key K is a superkey with the additional property that

• R can be decomposed into 2NF relations via the process of

- SSN -> DMGRSSN is a transitive FD since

You might also like