0% found this document useful (0 votes)

16 views83 pages

Database Design with ER Models

Uploaded by

Anna Andersson

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views83 pages

Database Design with ER Models

Uploaded by

Anna Andersson

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 83

TDA357/DIT622 – Databases

Lecture 4 & 5 – Design using ER

Jonas Duregård
Week 2: Design using ER
• We now know how to implement a design in SQL and query it for data
• We have used constraints to prevent certain operations
• Primary keys and unique constraints to prevent duplicate values
• Reference constraints (FOREIGN KEY) to check that values exist in
other tables
• This week we will take a step back and learn how to design a database
from a domain description
Recall: Relational schemas
• A database schema is a collection of relations like this one:
Grades(student, course, grade)
student -> Students.idNumber
course -> Courses.code
grade {0,3,4,5}
• Relation schemas can be translated into SQL tables
CREATE TABLE Grades (
student TEXT,
course CHAR(6),
grade INT NOT NULL,
PRIMARY KEY (student, course),
FOREIGN KEY (student) REFERENCES Students(idNumber),
FOREIGN KEY (course) REFERENCES Courses(code),
CHECK (grade IN (0,3,4,5))
);
Spotting bad schemas
• Here is a database for keeping track of room bookings. Why is it bad?
Table: Bookings
courseCode name day timeslot room seats
TDA357 Databases Tuesday 0 GD 236
TDA357 Databases Tuesday 1 GD 236
ERE033 Reglerteknik Tuesday 0 HB4 224
ERE033 Reglerteknik Friday 0 GD 236
• Redundancy
• Mentions 3 times that GD has 236 seats and twice that TDA357 is Databases
• Risk of:
• Update anomaly: I change one of the seat values but not the other two
• Deletion anomaly: Remove all bookings in GD and loose knowledge of its size
Consider:
Decomposing the table
Table: Bookings
What do we get courseCode name day timeslot room seats
if we join the TDA357 Databases Tuesday 0 GD 236

decomposed TDA357 Databases Tuesday 1 GD 236

ERE033 Reglerteknik Tuesday 0 HB4 224
tables together?
ERE033 Reglerteknik Friday 0 GD 236

Table: Bookings
Table: Courses courseCode day timeslot room
Table: Rooms
courseCode name TDA357 Tuesday 0 GD
room seats
TDA357 Databases TDA357 Tuesday 1 GD
GD 236
ERE033 Reglerteknik ERE033 Tuesday 0 HB4
HB4 224
ERE033 Friday 0 GD

Fairly simple process for this tiny example

Not so much for large databases with hundreds of columns
How do we design database schemas?
Domain descriptions
• The domain of a database is an informal description of everything a database
should contain
• Typically given by a client (an expert in the domain, but not in databases)
• Natural language, no reliable automatic way to translate to SQL code
• Sometimes ambiguous on important details
• Sometimes hard to understand if you are not a domain expert
• In real life, there is continuous communication between the client who wants
the database and the developers of the database
• In this course, you get domain descriptions that are more precise (and
much shorter) than typical domain descriptions
Modelling domains
• The database we create is based on a model of the domain
• A model contains formal definitons
(in contrast to the informal domain description)
• Look at a sentence from the domain like "Rooms can be booked for
courses at particular timeslots.", when designing the database we
have to decide how to model that sentence in the database
The problem of design
• Going from an informal domain description directly to SQL or even to
a schema is difficult and error prone
• It's also hard to use the tables directly to describe the database to
other developers
• We want an intermediate high-level design that is easy for humans to
understand, and still easy to translate into tables
Domain Relational
description High-level Relational
DBMS
(informal text) design schema
(SQL code)

This is our focus this week

The Entity-Relationship model
• The ER model is a high-level design model that expresses every aspect of the
database as entities (entity sets to be more precise) and relationships (not the
same as relations!)
• A third category that is easy to forget (not in the name) are attributes
• Entities are things from the domain: courses, employees, car models, cars …
• Relationships are things that connect entities, like "a car is of a car model", or
"an employee has another employee as its boss", "courses have students" …
• Attributes are simple properties of entities: names of courses, salaries of
employees, …
Entities
• What is an entity?
• Entities are objects with attributes (e.g. courses with name, credits..)
• Rule of thumb: Entities can exist independently from other entities, for
instance courses and teachers, or players and teams etc.
• Car models are entities (can exist even if there are no cars of that model)
• Objects that are inherently dependent on other entities are not entities
• Course names are not entities (what is a course name without a course?)
• Course results cannot exist without courses, so results are not entities (but
also not attributes like course names!)
• This distinction is not clear-cut and there are some tricky cases
• As a guideline: most basic objects you want in your database tend to be entities
Entities, attributes and redundancy
• One way to express the problem with the redundancy we saw earlier:
• We treated rooms as attributes of bookings, when they should be entities
• Rooms are logically separate entities that can exist independently of
bookings, same thing goes for courses
Table: Bookings
courseCode name day timeslot room seats
TDA357 Databases Tuesday 0 GD 236
TDA357 Databases Tuesday 1 GD 236
ERE033 Reglerteknik Tuesday 0 HB4 224
ERE033 Reglerteknik Friday 0 GD 236
Attributes
• Attributes are "simple" properties of entities
• Can be numbers, text strings, etc.
• Values of every attribute should fit in a table cell
• Something that every instance of that entity has, like each course
having a name or each car having a license number
• What are NOT attributes:
• Collections of values (like a list of students)
• Optional values (if cars sometimes have an owner, owner is not an
attribute of the car entity)
• Things that refer to attributes of other entities (see relationships)
Drawing ER-diagrams
• ER-diagrams are visual representations of ER-models
• ER-diagrams are popular because they are:
• Graphical, human-friendly
• Mechanically translatable into SQL (thus machine-friendly)
• Quite expressive (a lot of design concepts can be expressed using ER)
• Our first ER-diagram, an entity (Course) with two attributes (name, credits):
credits
Attributes The primary key is underlined
(here courses are identified by name)
name Course
Entities are boxes
Translating to a relational schema
• Entities become relations (become tables in SQL)
• Attributes become attributes (become columns in SQL)
• Primary keys are translated in the obvious way

credits CREATE TABLE Courses(

name TEXT,
Courses(name, credits) credits INT NOT NULL,
name Course PRIMARY KEY (name)
);

Mechanical translation into schema Semi-mechanical translation to SQL

Domain Relational
description High level Relational
DBMS
(informal text) design (ER) schema
(SQL code)

"Each course has its own name, and a number of credits."

credits

Courses(name, credits)
name Course

CREATE TABLE Courses(

name TEXT,
credits INT NOT NULL,
PRIMARY KEY (name)
);
Naming in ER vs relational schemas
• Entities are by convention named in singular (Course)
• Relations are named in plural (Courses)
• We also follow the convention we are using for relations/tables,
meaning capitalized entity names and common attribute names

credits

Courses(name, credits)
name Course
Multiple entities
• Separate entities are (for now) translated independently
name name

credits Course Program

Courses(name, credits)
Programs(name)
The rest of this and tomorrows lecture
• I will present lots of features of ER-diagrams
• For each feature I will state something from a domain description (an
informal description of the database) and …
• … show how to model it in ER-diagrams
• … show how it is translated into a relational schema
• … say something about how to identify the feature in domains
• If there is time left, I will spend it demonstrating some more examples
and common mistakes
Courses(name, credits)
A simple relationship Programs(name)

• I want to extend my database so every program has a set of courses

• First question: How would I modify my schema?
• Do I add an attribute "listOfCourses" to Programs?
• No! Lists are not attributes. And if they where, how do we state they
need to be names of courses?
• Do I add an attribute "listOfPrograms" to Courses?
• No! Same reasons! Courses(name, credits)
• Solution: Add a new relation  Programs(name)
• Essentially a list of (course,program)-pairs CourseIn(course,program)
course -> Courses.name
• In ER, we express CourseIn as a relationship program -> Program.name
Relationships in ER-diagrams
• Relationships are drawn using diamond-shapes
• Names typically describe the relationship (OwnedBy, RegisteredTo, …), or
sometimes just adding the related entities names (like ProgramCourse)
• Should relay the intent of the relationship, and work as a table name

name name

credits Course CourseIn Program

Constraints from relationships
• The references ensure only existing courses and programs can be in CourseIn
• The primary key of CourseIn is the PKs of both referenced relations:
• A course can be in several programs (same course, different program)
• A program can have multiple courses (same program, different course)
• A program cannot have the same course multiple times (both equal)

Courses(name, credits)
name name
Programs(name)
CourseIn(course,program)
credits Course CourseIn Program
course -> Courses.name
program -> Programs.name

Note that the credits attribute is not here, only the key
Naming and relationships
• During translation you need to decide a few names in the schema
• You can use the same name for attributes everywhere, but this is
sometimes confusing (like having an column called name in courses,
that is actually the name of a teacher) and sometimes does not work
• I like using the (lowercase) names of the connected entities:
name name

credits Course CourseIn Program

CourseIn(course,program)
course -> Courses.name
program -> Programs.name
Compound keys and relationships
• Remember: Always include the whole key of both the related relations!

place idnr

time Meeting Attends Person name

Meetings(time, place) Compound keys (2 or more attributes)

Persons(idnr, name)
Attends(person,time,place) Compound reference
person -> Persons.idnr
(time,place) -> Meetings.(time,place)
Attributes on relationships
• What if we wanted to add an attribute stating which year in a
program a certain course is?
• The year is not an attribute of course (because different programs
may have the course in different years)
• The year is not an attribute of program (because different courses
may be in different years of the program)
• The year is an attribute of the relationship between them!
year
name name

credits Course CourseIn Program

Translating relationships with attributes
• Just add the extra attributes to the created relation
• Note that relationships can never have key attributes!
• Always identified by the related entities
Table: CourseIn
year course program year
name name

Databases TIDAL 2
credits Course CourseIn Program Databases TKIEK 3
Webapp TIDAL 2

Courses(name, credits)
Programs(name) When I add a
CourseIn(course, program, year) (course,program)-pair
course -> Courses.name to CourseIn, I also
program -> Programs.name
have to specify a year
Identify attributes on relationships in domains
• Things like "X may have a z in/for/at a Y" where X and Y are entities
and z is an attribute typically signify that z is an attribute of a relation
• Example: "Teachers can be assigned roles in courses"
(role is an attribute of a relationship between teachers and courses,
unless roles should be an entity…)
Multiplicity
• The relationships we have seen so far are called many-to-many
• E.g. A course can belong to many programs and programs have many courses
• What if I wanted to model something like "Each course has a single teacher" or
"Some of the teachers have an office"?
• These examples describe relationships, but they are not many-to-many
Many-to-exactly-one
• The Teaches relationship models "each course has a single teacher"
• If the arrow was in the other direction it would model "each teacher
has a single course"
credits tname

name Course Teaches Teacher

No arrow indicates "many" Curved arrow indicates "exactly one"

• An exception to the guideline that entities can exist independently
(since courses cannot exist without teachers)
Translating many-to-exactly-one
• If you had a table of courses, where each row is a separate course. How
would you express that each course has a teacher?
• By adding a column for teacher of course!
credits tname

Not every relationship

name Course Teaches Teacher becomes a relation/table!

Teachers(tname)
Courses(name, credits, teacher)
teacher -> Teachers.tname
Reference constraint
Identify many-to-exactly-one in domains
• Anything like "Every X has a Y" (where Y is another entity) are typically
many-to-exactly-ones
• It's common that the language is ambiguous, e.g. "Xs have Ys" can
mean that each X can have multiple Ys, or that they each have one.
• If you want to add an attribute, but that attribute is more accurately
modelled as an entity, you should use many-to-one relationships
Many-to-at-most-one
• This diagram expresses "Some teachers have an office", or "A teacher
may have an office" or equivalent
• Also called many-to-one-or-zero

name number

Teacher TeacherOffice Office

Pointy arrow means "at most one" (= one or zero)

Translating many-to-at-most-one: ER-approach
Teachers(name)
name No arrow number
Offices(number)
TeacherOffice(teacher,office)
Teacher TeacherOffice Office
teacher -> Teachers.name
office -> Offices.number

• How can we modify the translation of a many-to-many relationship to

make it many-to-at-most-one?

name
Teachers(name) Change the key!
number
Offices(number)
TeacherOffice(teacher,office)
Teacher TeacherOffice Office
teacher -> Teachers.name
office -> Offices.number
Translating many-to-at-most-one: Null-approach
name Exactly one number Offices(number)
Teachers(name, office)
Teacher TeacherOffice Office office -> Offices.number

• How can we modify a many-to-one to make it many-to-at-most-one?

Allow the foreign key to be null
name number

Offices(number)
Teacher TeacherOffice Office Teachers(name, office (or null))
office -> Offices.number
When can the Null approach be used?
The null approach can only be used under the following conditions:
• The "at-most-one"-side does not have a compound key
• The relationship does not have any attributes
• You are morally OK with having null values in your database

The ER-approach always works 

• Side-note: Is the Null-approach more efficient?

• Depends on usage, but often it's less efficient
(slightly simplified: SELECTs are slower and INSERTS are faster)
Identify many-to-at-most-one in domains
• Anything like "X may have a Y" or "Some Xs have a Y"
(where Y is an entity)
• Often text is ambiguous about many/exactly one (e.g. "courses have
teachers" could sometimes mean that many courses do, but not all)
A side-note on naming attributes
• In several of these examples I have had the same name for attributes of
different entities (e.g. name for both courses and programs)
• This is allowed, but there may be advantages to having globally unique
attribute names in diagrams (less confusion, fewer qualifed names later
on in SQL etc.)
Multiway relationships
• A relationship can connect to more than two entities, but this is fairly rare
tname

code cname Teacher rname

Note that I have chosen
globally unique names here
Course Lecture Room

Courses(code, cname)
Teachers(tname)
Rooms(rname)
Lectures(course,teacher,room)
course -> Courses.code
teacher -> Teachers.tname
room -> Rooms.rname
A better example of a multiway relationship
• "Teachers can be assigned any of a list of roles (e.g. examiner, course
responsible, assistant) for any course."
• Here, roles should be an entity (the concept of examiner exists even if
there are currently no courses)
• Ambiguity: Can a teacher have roles on multiple courses?
• Reasonable assumption: yes (from our knowledge of the domain)
• Ambiguity: Can a teacher have multiple roles in the same course? (does
"any of" mean "any one of" or "any subset of"?)
• Let's assume a teacher can only have one role (and there is a special
role for being both examiner and course responsible for instance)
Key ensures we can associate any
Multiway relationship number of teachers with any number
of courses, and for each association
we have to select a valid role
rname

Courses(code, cname)
Role
tname
Teachers(idnr,tname)
code
Roles(rname)
CourseRole(course,teacher,role)
Course CourseRole Teacher
course -> Courses.code
teacher -> Teachers.idnr
cname idnr
role -> Roles.rname

Very similar to a role attribute on the relationship, but with a reference

Weird cases: What are we saying here?
• Probably "Some courses have a teacher with a role (from a list of roles)"?
• Switching the arrows expresses the same thing?
rname

Key ensures: At most one (role,teacher)-pair per course

Then what does this mean?
• Same as the one on the last slide?
• If we interpret the course-teacher connection as the main one it may
mean "courses may have a teacher and if they do they may have a role",
… but then it would look exactly the same as "courses may have a role
and if they do that role may have a teacher", which is different…
rname

Role
code tname

Course CourseRole Teacher

cname idnr
Confusion conclusion
• Multiway relationships that are many-to-many-to-many and
many-to-many-to-one seem to make sense, other combinations are more
difficult to interpret
• There are even weirder examples like all connections having pointed arrow
• Some that are obviously the same as having two separate relationships, like
many-to-exactly-one-to-exactly-one
Identifying multiway relationships in domains
• Something like "Xs have Ys in Zs" where X, Y and Z are all entities
• Ambiguity: "Xs have Ys and Zs" usually mean two separate
relationships (X-Y and X-Z) but sometimes a multiway relationship
• Attributes on relationships should be replaced by multiway
relationships (and an entity) when the values are chosen from a list
(like roles in the earlier example)
• "Teachers have roles in courses" (assuming role is an entity not an
attribute), "A person must have a contract for each project they are
involved in" (assuming persons, contracts and projects are entities)
• Identifying multiplicity of the various connections is often difficult
New problem!
• Model: "Most employees have another employee as their boss."
• Being boss of someone is a relationship
• So this is a relationship between employees and employees
idnr ename

Employee
Arrow indicates at most one boss
Any resemblance between diagram per employee
and Gary the snail is coincidental
BossOf
Translating self relationships
… is done exactly like other relationships!
• Here using the null-approach (add the key from the at-most-one side
as an extra attribute on the many side, and make it nullable):
idnr ename Employees(idnr,ename, boss (or null))
boss -> Employees.idnr

Employee

Self reference!

BossOf CREATE Table Employees(

idnr INT PRIMARY KEY,
ename TEXT NOT NULL,
Nullable boss INT REFERENCES Employees
);
A naming problem
• Suppose we want to model "Users can block other users"
• How do we translate this?
Problem: Who is blocking who here?
uname
No arrows, Users(uname)
many users User Blocked(uname1,uname2)
can block uname1 -> Users.uname
many users uname2 -> Users.uname
Blocked
Solution: Annotate connections with names
Remember: The keys of both sides
become the key for many-to-many
uname

User
Users(uname)
blocking blocked Blocked(blocking,blocked)
blocking -> Users.uname
Blocked blocked -> Users.uname
Limitations of self-relationships
• Some things that cannot be expressed in ER-diagrams:
• Can a value be related to itself? (I'm my own boss? I block myself?)
• Can a there be cycles (I'm the boss of my bosses boss? I'm blocked by a
person I block?)
• Is the relationship symmetric, e.g. for a Sibling-relationship:
If a is a sibling of b, then b must also be a sibling of a
• These can be expressed in side-notes/comments, but may be difficult to
implement in SQL
Identifying self relationships in domain
• Anything on the form "X has … to [an]other X"
• If you want to model any kind of tree-structure (many-to-at-most-one) or
graph structures (many-to-many) on values of an entity
Start of lecture 5 – The story so far
Domain description ER-diagram schema SQL code
ER-diagrams:
attribute
Modelling Translation
Entity

uname

N to N
Relationships
User
(many-to-many,
N to 1
many-to-one, blocking blocked
many-to-at-most-one)
Blocked
N to 0/1

Self-relationships
Weak entities
• Model this: "Each company has divisions, and each division has a name.
Two divisions in the same company cannot have the same name."
• Problem: What is the primary key of Divisions?
Too strong key
dname cname
Companies(cname)
Divisions(dname,company)
Division Of Company company -> Companies.cname

???
dname cname
Companies(cname)
Divisions(dname,company, company2)
company Division Of Company company2 -> Companies.cname
Missing reference for company
Weak entities
• To deal with this situation, we add a feature called Weak entities
• A weak entity cannot be identified only by its own attributes
• It requires support from at least one other entity
• The diagram below expresses that a division is identified by its name
along with the identity of the company it belongs to:
dname cname

Dashed underlining
Division Of Company
(partial primary key)

Weak entity Identifying relationship (always many-to-one)

Translating Weak entities
dname cname
Companies(cname)
Divisions(dname,company)
Division Of Company company -> Companies.cname

The key is the only difference!

• Compare to a regular many-to-one:

dname cname
Companies(cname)
Divisions(dname,company)
Division Of Company
company -> Companies.cname
Identify weak entities in domain descriptions
• Things like "Player numbers are unique within teams" or "players can
have the same number assuming they are on different teams"
• If you notice that the attributes you have determined for an entity are not
sufficient to identify members, perhaps it should be a weak entity
• If you want a key attribute on a relationship, you can turn the relationship
into a weak entity supported by both the related entities (a bit of a hack)
Courses(cname)
rname time cname
Rooms(rname)
Room In Lecture Of Course
Lectures(course,room,time)
course -> Courses.cname
room -> Rooms.rname
Inheritance and subentities
• Next up: What if I have something like "Some companies are
corporations. Each corporations has a year of incorporation."
• Adding a year attribute to Company does not work, that implies all
companies have a year of incorporation, and it's impossible to tell
which companies are corporations.

name

Company corpyear
Why is this bad?
• In this design some companies have corporation years.
• Problem: Look at the CorpYear entity in isolation. What does it
contain? All years? All years where at least one corporation has been
formed (and no other years can be added)?

Company name

Incoporation years cannot exist

independently, and should not
incorp
be modelled as entities!
Many-to-at-
most-one
CorpYear year
Inheritance in ER: ISA-relationships (ISA = "is a")
• This ER diagram expresses "Corporations are a special kind of
companies, they have a year in addition to all properties of other
companies."
• We call Corporation a subentity, and Company its superentity

address Company name

Points at superentity
No relationship name
ISA
(always labeled ISA)
Corporation year

No key attributes
Translating ISA-relationships: ER-approach
• Make a new relation that only has the primary key of the superentity and the
extra attributes added by the subentity, and a reference to the superentity

Note: No address here

address Company name
Companies(name, address)
Corporations(name, year)
ISA name -> Companies.name

Corporation year
The reference means each
corporation also has an address
etc. - it really IS A company
Translating ISA-relationships: Null approach
• Just add the extra attribute to the superentity, but nullable
address Company name

ISA

Corporation year

Companies(name, address, year (or null))

null approach is (sometimes) bad m'kay
"Some courses are PhD Courses"
Courses(code)
code
PhdCourses(code)
code -> Courses.code
Course

ISA

PhdCourse Courses(code)

Sanity check: Is every design choice I made

in my diagram reflected in my schema?
Persons(idnr)
Companies(name, address)
Another null-problem Corporations(name, year)
name -> Companies.name
BoardMembers(person,corp)
address Company person -> Persons.idnr
corp -> Corporations.name
name

ISA
Reference to Corporations 
year Corporation
Persons(idnr)
Companies(name, address, year (or null))
BoardMember NULL BoardMembers(person,corp)
person -> Persons.idnr
corp -> ???
idnr Person
Putting Companies.name here would be bad
(means all companies can have board members)
Problems with the NULL-approach
The NULL approach can only be used if:
• The subentity has a single attribute
• The subentity has no relationships to other entities
• Includes not having its own subentitites

• Once again, the ER-approach always works...

Object Oriented translation of ISA
• A third type of translation is possible, where the subentity gets all the
attributes of the superentity, and every value is a member of just one of the
relations (in the ER-approach it is a member of the superentity or both)
• This translation is even more problematic than the Null-approach, so I will
not explain it in more detail
Multiple subentities
• In ER (unlike most Object Oriented Programming), a value can be a
member of several subentities (if it is also a member of the superentity)
• No easy way to limit membership to a single subentity
Students(idnr,name)
idnr
PhdStudents(idnr)
idnr -> Students.idnr
name Student MasterStudents(idnr)
idnr -> Students.idnr

Four possibilites for a student X:

ISA ISA
X is only in Student
PhdStudent MasterStudent
X is in Student and PhdStudent
X is in Student and MasterStudent
X is in all three (!)
Quiz
• Can employees be employed by corporations in this design?
• Answer: Yes!
"Companies can have employees"

Employed Company name

"Every corporation is a company"

ISA
Employee

Corporation year
idnr
Translation of example in last slide
Employed Company name

ISA
Employee

Corporation year
idnr
All corporation names
Employees(idnr) are also company names
Companies(name)
Corporations(name, year)
name -> Companies.name
Employed(employee, company)
Any company name
employee -> Employees.idnr
company -> Companies.name can be used here
ISA relationships and primary keys
• Subentities can never have any key attributes of their own!
• Defies the concept of inheritance, if entity X is not identified by the same
attributes as entity Y, we can never claim "X is a Y", it is something else
• If you need additional identifying attributes, use weak entities
• In a way, weak entities are "subentities with extra identifying attributes"
Identifying ISA-relationships in domains
• When it makes sense to say "X is a Y", like "Corporation is a Company" or
"Employee is a Person" or "Car is a Vehicle"
• In domain texts, it may also be stated as variations of "some X have y",
where y is an attribute and not another entity (optional attributes)
• Sometimes when you want to model a subset of some entity even if they
don't have extra attributes
Mission Accomplished!

You now know (or have at least heard of) all features of ER required for task 2!
Practicalities of making ER-diagrams
• We recommend using Dia to make diagrams (http://dia-installer.de/)
• Make sure you set it to ER-mode
• Pro-tip: Double-click an entity/relation to
set its properties (e.g. weak etc) ER-mode
• Pro-tip: Double-click an attribute to set
primary key etc.
• Pro-tip: Double-click a line to change arrows, the curved arrow is
available under "more arrows"
• Pro-tip: There are no ISA-relationships in the ER mode, but you can use
merge/extract from flowchart (and just write ISA on them)
• Pro-tip: Always connect lines to the points of diamonds, not their center
I made a tutorial video explaining all of this and more!
Order of translation
• Always start with an entity whose translation does not depend on knowing the
keys of other entities
• Example: If two entities have a many-to-exactly-one relationship, always
start with the entity on the exactly-one side
• Example2: Translate superentities before you translate subentities
• Basically: Do translations in reverse order of how the arrows are pointing
• Do many-to-many relationships last
• If you find yourself writing a reference to a relation you have not yet
translated, you are doing things in the wrong order
A small domain example
"Make a database of companies in a corporate group. The companies
have employees, that are divided into divisions within each company.
Divisions in the same company cannot have the same names. Most
employees have a boss.
Some employees are given a managerial position with a special title.
Also make an inventory of the company cars, who has access to which
cars (only managers can have access to company cars) and for how
long they have had access."
An ER-diagram with almost everything
BossOf cname
dname
ssn boss

ename Employee Of Division In Company

salary
ISA
sinceyear model

title Manager CarAccess CompanyCar plates

Companies(cname) Entities (including many-to-exactly-one relationships)
Divisions(dname,company)
company -> Companies.cname
Employees(ssn,division,company,ename,salary)
(division,company) -> Divisions.(dname,company)
Managers(ssn,title)
ssn -> Employees.ssn
CompanyCars(plates,model)

CarAccess(ssn,car,sinceyear) Remaining relationships

ssn -> Managers.ssn
car -> CompanyCars.plates
BossOf(employee, boss)
employee -> Employees.ssn Used ER-approach here
boss -> Employees.ssn
Assumptions made in this model
• Several assumptions were made that where not clearly stated in the
domain description:
• Each employee has a unique social security number (ssn)
• It's enough to know which year a car access was issued
• The model of each company car is known
• Titles for managers do not exist if no one has them (there is no
Title entity)
• Making different assumptions would give a different diagram/schema
• It's always good to keep track of the assumptions you make!
Things that are not in the model (but should be?)
• Cars belonging to specific companies or divisions
• Managers can only have access to cars from their own company?
• Bosses and their employees belong to the same company?
• An employee cannot be the boss of themselves
• More general: No cycles in Boss-relation (two employees being each others
bosses etc)

• Some of these would be impossible to model in ER without loosing other

constraints we have, and some would be very difficult to implement in SQL
Clever ways to patch up the schema
• Make sure that bosses and their employees work in the same company.
BossOf(employee, boss) References express:
employee -> Employees.ssn
boss -> Employees.ssn
"employee works at company"
and "boss works at company"
BossOf(employee, boss, company)
(employee,company) -> Employees.(ssn,company)
(boss,company) -> Employees.(ssn,company)

• Difficult or impossible to express in the ER-Diagram 

• In SQL we must add (ssn,company) UNIQUE to Employees for the
reference to work (you can only have references to values that are
explicitly stated to be keys)
Some things that cannot be expressed in ER
• Secondary keys/uniqueness constraints
• Can be identified using functional dependencies (next weeks topic)
• Value constraints (like "grade is one of 'U','3','4','5' ")
• Can be added as side-notes/comments
• "Relationships between relationships"
• Example: If we have one relationship for access to a company car, and one for
currently using it, we cannot express that the latter is a subset of the former
• Can sometimes be implemented by cleverly modifying the schema (e.g.
adding/modifying references) after translation from ER
Can we extend ER to express more things?
• Yes, but some of the limitations are due to limitations in the underlying
relational model, or in the end limitations in SQL
• Means we may get diagrams that cannot easily be translated into
executable code
Common mistakes – check before you submit!
• Connecting entities directly without a relationship
• Connecting relationships directly to each other
• Pointing the ISA the wrong way or pointing it at empty space
• Ignoring multiplicity and translating everything as many-to-many
• Sanity check: Is every choice you made in the diagram reflected in your
schema?
• Not using inheritance for optional attributes
• If an entity has an attribute, it means every instance of it has a value
• "I just set the limit attribute to 1000 000 if there is no limit" 
• Sanity check: If your diagram looks exactly as it would without a design
choice, that design choice does not exist
Common mistakes: Inheritance
• Overusing Inheritance, e.g. introducing a "WinningTeam" or "CompletedCourse"
entity when wins/grades are already stored in relationships
• Introduces redundancy!
• The inheritance should be an inherent property of the entity (ask yourself:
"can I look at a course and determine if it's a completed course or not?")
• "Branch ISA Program" or similar things that don't make sense to say out loud
Common mistakes: Playing with arrows
• If you ever get the feeling you should have something like this:
a2 b1

a1 A R B b2

… more likely A and B should be a single entity?

• It's hard to understand the exact intention here, are the values in A
and B in 1 to 1 correspondence? Or can several values in A map to the
same value in B? (Then it should be two relationships!)

Lecture 5
No ratings yet
Lecture 5
41 pages
ER Modeling
No ratings yet
ER Modeling
74 pages
ER Model Basics for Database Design
No ratings yet
ER Model Basics for Database Design
40 pages
Entity Relationship (ER) Diagrams
No ratings yet
Entity Relationship (ER) Diagrams
36 pages
3 ERMtoRM
No ratings yet
3 ERMtoRM
31 pages
Fasih PPT Part
No ratings yet
Fasih PPT Part
10 pages
Chap 2
No ratings yet
Chap 2
60 pages
Chapter 3 - ER
No ratings yet
Chapter 3 - ER
73 pages
Entity-Relationship Model Overview
No ratings yet
Entity-Relationship Model Overview
73 pages
E R Diagram
No ratings yet
E R Diagram
51 pages
Module 1
No ratings yet
Module 1
27 pages
ER Model and Diagram Tutorial
No ratings yet
ER Model and Diagram Tutorial
37 pages
DBMS Unit 2
No ratings yet
DBMS Unit 2
20 pages
ER Model and Data Design Overview
No ratings yet
ER Model and Data Design Overview
74 pages
Lecture 3
No ratings yet
Lecture 3
41 pages
DBS Lecture 3
No ratings yet
DBS Lecture 3
26 pages
Unit 3 - PPT
No ratings yet
Unit 3 - PPT
27 pages
Updated-Data Base Management System
No ratings yet
Updated-Data Base Management System
77 pages
ER Model for CS Students
No ratings yet
ER Model for CS Students
36 pages
Database Management System Lecture 1
No ratings yet
Database Management System Lecture 1
30 pages
Lecture 8 ER Model
No ratings yet
Lecture 8 ER Model
6 pages
Database Application SAK 3408
No ratings yet
Database Application SAK 3408
51 pages
Understanding ER Diagrams in DBMS
No ratings yet
Understanding ER Diagrams in DBMS
35 pages
Chapter 4 - Database Using Er Model
No ratings yet
Chapter 4 - Database Using Er Model
72 pages
DBMS Unit 2
No ratings yet
DBMS Unit 2
14 pages
Database Design Concepts and Models
100% (1)
Database Design Concepts and Models
48 pages
Normalizing Person Relation to 1NF
No ratings yet
Normalizing Person Relation to 1NF
478 pages
Unit 2 - Relational Data Model and ERModel
No ratings yet
Unit 2 - Relational Data Model and ERModel
63 pages
DBMS Chapter 3
No ratings yet
DBMS Chapter 3
28 pages
ER - Model 2024
No ratings yet
ER - Model 2024
69 pages
Entity-Relationship Model Guide
No ratings yet
Entity-Relationship Model Guide
67 pages
DB Ch07
No ratings yet
DB Ch07
63 pages
Entity Relationship Er Modeling
No ratings yet
Entity Relationship Er Modeling
67 pages
E R Diagram 1753266931389
No ratings yet
E R Diagram 1753266931389
20 pages
Database Design I: The Entity-Relationship Model
No ratings yet
Database Design I: The Entity-Relationship Model
33 pages
Chapter 3 - 2
No ratings yet
Chapter 3 - 2
84 pages
Database Design Essentials
No ratings yet
Database Design Essentials
52 pages
Unit 2
No ratings yet
Unit 2
39 pages
DBMS - Unit-2
No ratings yet
DBMS - Unit-2
71 pages
CS8492 - DBMS - Unit 2
No ratings yet
CS8492 - DBMS - Unit 2
18 pages
Entity-Relationship Data Modeling Guide
No ratings yet
Entity-Relationship Data Modeling Guide
37 pages
Relational Databases - 3rd Semester
No ratings yet
Relational Databases - 3rd Semester
23 pages
ch3 Slide
No ratings yet
ch3 Slide
43 pages
Unit-2 DBMS
No ratings yet
Unit-2 DBMS
43 pages
Summary ERD
No ratings yet
Summary ERD
34 pages
Data Modeling (Week 2)
No ratings yet
Data Modeling (Week 2)
31 pages
Ch-04 (DB) - Entity - Relationship Model
No ratings yet
Ch-04 (DB) - Entity - Relationship Model
91 pages
Chapter 6
No ratings yet
Chapter 6
35 pages
Chapter 4
No ratings yet
Chapter 4
38 pages
Module 4 Entity-Relationship Model-1
No ratings yet
Module 4 Entity-Relationship Model-1
47 pages
ERD
No ratings yet
ERD
34 pages
A724912995 - 23614 - 12 - 2018 - Lecture 4
No ratings yet
A724912995 - 23614 - 12 - 2018 - Lecture 4
48 pages
Data Relationships in E-R Modeling
No ratings yet
Data Relationships in E-R Modeling
14 pages
Dbms Notes Unit II
No ratings yet
Dbms Notes Unit II
29 pages
Lecture1 - DBMS and ERD
No ratings yet
Lecture1 - DBMS and ERD
46 pages
Chapter 3 IM NOTES
No ratings yet
Chapter 3 IM NOTES
8 pages
Lecture 5 - Entity Relationship Model
No ratings yet
Lecture 5 - Entity Relationship Model
26 pages
Bcs Sem II Dbms Unit III
No ratings yet
Bcs Sem II Dbms Unit III
11 pages
Targeted Search Exercise
No ratings yet
Targeted Search Exercise
2 pages
Scanning The Documents
No ratings yet
Scanning The Documents
2 pages
sph-10bt - FW Update Instructions Ver 818 PDF
No ratings yet
sph-10bt - FW Update Instructions Ver 818 PDF
4 pages
DraftSight 2017 SP3 System Requirements v1
No ratings yet
DraftSight 2017 SP3 System Requirements v1
1 page
Duetto Scoreboard
No ratings yet
Duetto Scoreboard
3 pages
Title: Employee Attendance Management System
No ratings yet
Title: Employee Attendance Management System
16 pages
Calibrating Freeway Simulation Models in Vissim: Tony Woody
No ratings yet
Calibrating Freeway Simulation Models in Vissim: Tony Woody
24 pages
CH 02
No ratings yet
CH 02
8 pages
OpenText Archiving and Document Access For SAP Solutions CE 21.4 - User Guide English (ER210400-UGD-En-02)
No ratings yet
OpenText Archiving and Document Access For SAP Solutions CE 21.4 - User Guide English (ER210400-UGD-En-02)
174 pages
Question Bank SE uNIT 5
No ratings yet
Question Bank SE uNIT 5
15 pages
Matlab Vit
0% (1)
Matlab Vit
73 pages
OceanStor Dorado 6.x & OceanStor 6.x DM-Multipath Configuration Guide For Citrix XenServer
No ratings yet
OceanStor Dorado 6.x & OceanStor 6.x DM-Multipath Configuration Guide For Citrix XenServer
12 pages
English User Manual
No ratings yet
English User Manual
18 pages
CMPT 120, Fall 2008, SFU Burnaby Instructor: Diana Cukierman
No ratings yet
CMPT 120, Fall 2008, SFU Burnaby Instructor: Diana Cukierman
3 pages
PGP Key Generation in Oracle ERP Cloud
No ratings yet
PGP Key Generation in Oracle ERP Cloud
2 pages
Module 4 Sheet 2.3 Software Packages and Use of Application Programs
No ratings yet
Module 4 Sheet 2.3 Software Packages and Use of Application Programs
22 pages
RFID Smart Card for Students
No ratings yet
RFID Smart Card for Students
5 pages
Experiment No 6
No ratings yet
Experiment No 6
11 pages
Istqb Ctel Test Manager Sample Exam Paper v2011 Release Version
No ratings yet
Istqb Ctel Test Manager Sample Exam Paper v2011 Release Version
29 pages
Filamentalist V3 Manual V1.1.1
No ratings yet
Filamentalist V3 Manual V1.1.1
30 pages
Q1 - WK2 - DLL - Mapeh 6 Updated
No ratings yet
Q1 - WK2 - DLL - Mapeh 6 Updated
8 pages
VOD M3U Vipvodle Top
No ratings yet
VOD M3U Vipvodle Top
4 pages
HCI Proposal
No ratings yet
HCI Proposal
5 pages
Excel Lesson 1 - Candy Is Dandy
No ratings yet
Excel Lesson 1 - Candy Is Dandy
12 pages
PDF For Hinglish
No ratings yet
PDF For Hinglish
39 pages
A Comparative Analysis of DXPRUM and DSDM: March 2017
No ratings yet
A Comparative Analysis of DXPRUM and DSDM: March 2017
7 pages
Module IV Microsoft Powerpoint
No ratings yet
Module IV Microsoft Powerpoint
6 pages
Java Programming Language Report: January 2021
No ratings yet
Java Programming Language Report: January 2021
15 pages
Python Scatter Plot with Matplotlib
No ratings yet
Python Scatter Plot with Matplotlib
49 pages
Faizan Resume
No ratings yet
Faizan Resume
2 pages

Database Design with ER Models

Uploaded by

Database Design with ER Models

Uploaded by

TDA357/DIT622 – Databases

Lecture 4 & 5 – Design using ER

decomposed TDA357 Databases Tuesday 1 GD 236

Fairly simple process for this tiny example

This is our focus this week

credits CREATE TABLE Courses(

Mechanical translation into schema Semi-mechanical translation to SQL

"Each course has its own name, and a number of credits."

CREATE TABLE Courses(

credits Course Program

• I want to extend my database so every program has a set of courses

credits Course CourseIn Program

credits Course CourseIn Program

time Meeting Attends Person name

Meetings(time, place) Compound keys (2 or more attributes)

credits Course CourseIn Program

name Course Teaches Teacher

No arrow indicates "many" Curved arrow indicates "exactly one"

Not every relationship

Teacher TeacherOffice Office

Pointy arrow means "at most one" (= one or zero)

• How can we modify the translation of a many-to-many relationship to

• How can we modify a many-to-one to make it many-to-at-most-one?

The ER-approach always works 

• Side-note: Is the Null-approach more efficient?

code cname Teacher rname

Very similar to a role attribute on the relationship, but with a reference

Key ensures: At most one (role,teacher)-pair per course

Course CourseRole Teacher

BossOf CREATE Table Employees(

Weak entity Identifying relationship (always many-to-one)

The key is the only difference!

Incoporation years cannot exist

address Company name

Note: No address here

Companies(name, address, year (or null))

Sanity check: Is every design choice I made

• Once again, the ER-approach always works...

Four possibilites for a student X:

Employed Company name

"Every corporation is a company"

ename Employee Of Division In Company

title Manager CarAccess CompanyCar plates

CarAccess(ssn,car,sinceyear) Remaining relationships

• Some of these would be impossible to model in ER without loosing other

• Difficult or impossible to express in the ER-Diagram 

… more likely A and B should be a single entity?

You might also like