0% found this document useful (0 votes)

73 views27 pages

Database Design Pitfalls Explained

This document discusses common database design best practices and when they could potentially go wrong or be inappropriate. It covers topics like stored procedures, clustered indexes, identity columns, indexing, fragmentation, naming conventions, partitioning, and object-relational mappers. The key message is that while general guidelines are useful, the optimal design depends on the specific situation and workload. Flexibility is important to avoid practices that may work against performance or scalability needs.

Uploaded by

Clouddrops 360

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views27 pages

Database Design Pitfalls Explained

Uploaded by

Clouddrops 360

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

March 2015

When Good Design Goes Bad

Bob Duffy
Database Architect
Prodata SQL Centre of Excellence
Bob Duffy
• 20 years in database sector, 250+ projects
• Senior Consultant with Microsoft 2005-2008
• One of about 25 MCA for SQL Server globally (aka SQL Ranger)
• SQL MCM on SQL 2005 and 2008
• SQL Server MVP 2009+
• SSAS Maestro
• Database Architect at Prodata SQL Centre of Excellence

• http://blogs.prodata.ie/author/bob.aspx
• [email protected]

@bob_duffy
What We Will Cover
Stored Procedures
Clustered Tables
Identity and Primary Keys
Indexes
Fragmentation
Naming Conventions
Partitioning
ORM
1. Stored Procedures
Why use Stored Procedures ?
The Dreaded Search Screen

http://www.sommarskog.se/dyn-search.html
The Chunky v Chatty Debate
Two Types of “Chunkiness”
Data Transferred per Call
Number of Calls
Network latency Important here
Stored Procedure Weigh In

Dynamic Design Patterns

Performance Slow Interpreted TSQL
Security Application Agility
Plan Cache Developer Agility
Maintainability The ORM Debate
Chunky Chatty
2. Clustered vs Heap
Best Practise: Cluster ALL Tables ?
Ever Increasing, Narrow, Unique, Static
Always use an Identity Column
When do Clustered Tables go Bad ?
Harder to Scale, especially for some key choices
Large Table Scan Workloads
Non sequential Clustering Keys Cause Fragmentation
Why is Fragmentation the Achilles Heel of table Scans
More Pages => More IO
Kills Read Ahead and disk performance
Heavy NCI Requirement
Best Practise “Clustered Index are Better for Seeks”
Well it depends on if the seek is on a NCI or not!
Clustered Index v Heap

Most Logging Tables

Ascending Keys Insert and Scan heavy Tables
Range Scans OLTP Transaction Tables (banking)
Lots of Deletes Bulk Loading
Tables with Heavy Primary
Seek
3 Always use Identity for Primary/CX Key
Best Practise
Always Use an Identity Column as Primary Key
Extension: Always add a new Surrogate Key

This may shoot you in the foot on large Fact Tables

The Distributed Database
Choice of Identity will cause a lot of pain!

How common is this Issue?

Very Frequent with replication and new MPP Architectures
The Distributed Write Cache ?
Identity creates a bottleneck on the DB
Serializes new records
What if database offline ?
The Over Zealous Dimensional Modeller
This may go bad if you are not a “single hop” data source
Customer DimCustomer

PK CustomerID PK CustomerKey
DimCustomer
CustomerName CustomerID

CustomerAddress PK CustomerKeyKeyKeyKey
CustomerName

CustomerAddress

CustomerKeyKeyKey

CustomerKeyKey

CustomerKey

CustomerID

CustomerName

CustomerAddress
4. We Don’t Need no Indexes?
Best Practise. Add Indexes..
To Reduce IO on important Queries
Seek rather than scan. SELECT * WHERE CustomerID=2
Narrower Scan. SELECT SUM(Qty)

DateKey Customer RegionKey Sales € Qty Cost

Jan 1 1 1000 100 80

Jan 2 1 1000 100 80

Jan 3 1 1000 100 80

Jan 4 1 1000 100 80

Jan 1 1 1000 100 80

Jan 2 1 1000 100 80

Feb 3 1 1000 100 80

Mar 1 1 1000 100 80

April 1 1 1000 100 80

May 1 1 1000 100 80

June 1 1 1000 100 80

July 1 1 1000 100 80

Aug 1 1 1000 100 80

When Indexes go bad
OLTP
Small Tables
Larger Results – See “The Tipping Point” by Kimberly Tripp
When upsert is more important then select
When every column Indexed
High Throughput Queueing Design Patterns

DWH
Bad “Tipping Points”
Staging Tables
Tables that we scan
When avoiding bad statistics is very hard
Data Analytics
Where we need guaranteed query performance for varied workloads.
Guaranteed Performance !!?!
We have a 1TB Table. Query SLA is 5 mins… Add indexes?
5 Stop Worrying about Fragmentation
Best Practise – Defragment the hell out of your database

Why could this be bad ?

Takes a long time and may interfere with query performance
Why Could this be not worth the bother ?
More Memory will reduce reliance on contiguous disk blocks
Most SANs only do random IO anyway
Its mainly important if our primary concerns are Scans
6 Naming Conventions
Best Practice – use one!
Goes Bad When prefix is meta data (object type, data type, size)
Naming – Common Sense
Project with following prefix standards on SSIS
DATA Source
Transform Type (LOAD, TRANSFORM, EXTRACT)
Package
Control Flow
Shape
7 Partitioning
Best Practise – Partition when table it too big or too slow

Ordered Queries
Maintenance Operations Serial Queries
Parallel Queries Dynamic Parallel Queries
8 ORMs
Best Practise ? Hotly debated

Good For
Developer Agility
Code First, Database Second
Integrated Debugging
Domain Business Model
Cache Management
Key Management
Portable
Query Plan Nightmares

Source: http://www.scarydba.com/2014/12/19/pretty-plans-vs-performance/
When ORMs go Bad
Can write truly horrible TSQL and Plans
Naïve context
Parameterisation
The Disaster Scenario

Lazy/Eager Loading
Can be used as an excuse of lack of database expertise
Hard to Index for (lots of Select *)
Everything has good and bad Aspects

It Depends ;-)

SQL Server Health Checklist
100% (1)
SQL Server Health Checklist
26 pages
Performance SQL Server PDF
100% (1)
Performance SQL Server PDF
81 pages
Optimize Data Warehouse Query Performance
No ratings yet
Optimize Data Warehouse Query Performance
41 pages
SQLCAT's Guide To Relational Engine
No ratings yet
SQLCAT's Guide To Relational Engine
238 pages
Best Practices - BW Data Loading & Performance
0% (1)
Best Practices - BW Data Loading & Performance
37 pages
Index Management and Database Tuning Guide
No ratings yet
Index Management and Database Tuning Guide
2 pages
Essential Guide to Database Indexing
No ratings yet
Essential Guide to Database Indexing
6 pages
Essential SQL Server DBA Best Practices
100% (1)
Essential SQL Server DBA Best Practices
27 pages
SSIS Best Practices
100% (2)
SSIS Best Practices
47 pages
11g New Features
No ratings yet
11g New Features
27 pages
Selecting An Index Strategy
No ratings yet
Selecting An Index Strategy
13 pages
Dba Notes
No ratings yet
Dba Notes
202 pages
Perf Monitoring and Troubleshooting - PASS Saturday Oregon
No ratings yet
Perf Monitoring and Troubleshooting - PASS Saturday Oregon
49 pages
Db2 SQL Tuning
No ratings yet
Db2 SQL Tuning
26 pages
SQL Interviw Quetions 2025
No ratings yet
SQL Interviw Quetions 2025
49 pages
IderaWP 7IndexingTipsToImproveSQLServerPerformance PDF
No ratings yet
IderaWP 7IndexingTipsToImproveSQLServerPerformance PDF
8 pages
SQL Server Indexing Tips for Performance
No ratings yet
SQL Server Indexing Tips for Performance
5 pages
SQL Server Administration Best Practices
No ratings yet
SQL Server Administration Best Practices
20 pages
Oracle SQL High Performance Tuning: Guy Harrison Director, R&D Melbourne
100% (1)
Oracle SQL High Performance Tuning: Guy Harrison Director, R&D Melbourne
56 pages
Performance Tuning in SQL Server 2000
No ratings yet
Performance Tuning in SQL Server 2000
5 pages
Top 30 Database Administrator Interview Questions For 2024 - Datacamp
No ratings yet
Top 30 Database Administrator Interview Questions For 2024 - Datacamp
28 pages
Oracle DBA Checklist
No ratings yet
Oracle DBA Checklist
17 pages
SSAS-Analysis Services Query Performance Top 10 Best Practices
No ratings yet
SSAS-Analysis Services Query Performance Top 10 Best Practices
5 pages
SQL SERVER 2005/2008 Performance Tuning For The Developer: Michelle Gutzait
No ratings yet
SQL SERVER 2005/2008 Performance Tuning For The Developer: Michelle Gutzait
112 pages
SQL Server Index Design Best Practices
No ratings yet
SQL Server Index Design Best Practices
27 pages
T-SQL Coding Standards and Best Practices
No ratings yet
T-SQL Coding Standards and Best Practices
32 pages
Oracle Database Tuning Tips
No ratings yet
Oracle Database Tuning Tips
8 pages
SQL Server DBA Daily Checklist
No ratings yet
SQL Server DBA Daily Checklist
3 pages
SQL Server Performance Management Guide
No ratings yet
SQL Server Performance Management Guide
19 pages
Exadata Insights for Oracle Users
No ratings yet
Exadata Insights for Oracle Users
38 pages
SQL Database Development Course 20762C
No ratings yet
SQL Database Development Course 20762C
7 pages
Tuning SQL Queries For Performance
No ratings yet
Tuning SQL Queries For Performance
5 pages
SQL Tuning and Optimization Techniques
No ratings yet
SQL Tuning and Optimization Techniques
20 pages
DBMS Indexes Enhanced
No ratings yet
DBMS Indexes Enhanced
16 pages
Oracle BI Design Best Practices Guide
No ratings yet
Oracle BI Design Best Practices Guide
109 pages
SQL and Database Performance Tuning Guide
No ratings yet
SQL and Database Performance Tuning Guide
5 pages
SQL Server Tuning Interview Q&A
100% (1)
SQL Server Tuning Interview Q&A
12 pages
SQL Statement Tunning
No ratings yet
SQL Statement Tunning
19 pages
The Return of The DB2 Top Ten Lists
No ratings yet
The Return of The DB2 Top Ten Lists
30 pages
Trends in Software Industry Lecture
No ratings yet
Trends in Software Industry Lecture
73 pages
Simple Settings To Help Your AX Solution To Run Faster
No ratings yet
Simple Settings To Help Your AX Solution To Run Faster
42 pages
SQL Query Optimization: 6 Essential Tips
No ratings yet
SQL Query Optimization: 6 Essential Tips
4 pages
Oracle SQL Profile Baselines Bad OATUG 2025
No ratings yet
Oracle SQL Profile Baselines Bad OATUG 2025
68 pages
A Course in In-Memory Data Management: Prof. Hasso Plattner
No ratings yet
A Course in In-Memory Data Management: Prof. Hasso Plattner
8 pages
Oracle Database Performance Tuning Guide
No ratings yet
Oracle Database Performance Tuning Guide
16 pages
Impact of SQL Server On Business Performance
No ratings yet
Impact of SQL Server On Business Performance
10 pages
T-SQL Index Tuning Strategies
No ratings yet
T-SQL Index Tuning Strategies
16 pages
SQL Index Types and Management Guide
No ratings yet
SQL Index Types and Management Guide
7 pages
Exadata Mistakes for Oracle Experts
No ratings yet
Exadata Mistakes for Oracle Experts
49 pages
SQL DBA Question
No ratings yet
SQL DBA Question
4 pages
Understanding Database Indexes and Security
No ratings yet
Understanding Database Indexes and Security
18 pages
Database Maintenance Best Practices
No ratings yet
Database Maintenance Best Practices
26 pages
IEC 61508-Functional Safety Overview
100% (1)
IEC 61508-Functional Safety Overview
33 pages
The Circuit Designer S Companion Third Edition Peter Wilson Digital Download
No ratings yet
The Circuit Designer S Companion Third Edition Peter Wilson Digital Download
502 pages
Firelock Alarm Check Valve: Hang These Instructions On The Installed Valve For Easy Future Reference
No ratings yet
Firelock Alarm Check Valve: Hang These Instructions On The Installed Valve For Easy Future Reference
16 pages
Master Thesis Supervisor Cbs
100% (3)
Master Thesis Supervisor Cbs
4 pages
NXT Head Maintenance
No ratings yet
NXT Head Maintenance
121 pages
EnodeB Moshell Important Commands
No ratings yet
EnodeB Moshell Important Commands
51 pages
Reading Promotion Week
No ratings yet
Reading Promotion Week
5 pages
Analysis and Design of Algorithm Lab Manual
No ratings yet
Analysis and Design of Algorithm Lab Manual
49 pages
Extreme Tourism Lesson 1 4
No ratings yet
Extreme Tourism Lesson 1 4
4 pages
Dsai 130
100% (1)
Dsai 130
228 pages
2.1 Terms of Sizes: Limit and Fits
No ratings yet
2.1 Terms of Sizes: Limit and Fits
21 pages
MAAE 4102 Strength & Fracture Solutions
No ratings yet
MAAE 4102 Strength & Fracture Solutions
31 pages
JavaFX Login with JDBC Example
No ratings yet
JavaFX Login with JDBC Example
3 pages
List of Important Links
No ratings yet
List of Important Links
12 pages
Structural Design Criteria for Hotel Building
No ratings yet
Structural Design Criteria for Hotel Building
13 pages
1910.107 - Spray Finishing Using Flammable and Combustible Materials. - Occupational Safety and Health Administration
No ratings yet
1910.107 - Spray Finishing Using Flammable and Combustible Materials. - Occupational Safety and Health Administration
14 pages
Scooter Price List-2024 (Updated On 3 - Oct)
No ratings yet
Scooter Price List-2024 (Updated On 3 - Oct)
1 page
Metal Detector Final Report Subm
No ratings yet
Metal Detector Final Report Subm
7 pages
Horizontal Pump System Cable Data Sheet
No ratings yet
Horizontal Pump System Cable Data Sheet
14 pages
Iso 5801 2017
No ratings yet
Iso 5801 2017
15 pages
İstanbul Teknik Üniversitesi Fen Bilimleri Enstitüsü
No ratings yet
İstanbul Teknik Üniversitesi Fen Bilimleri Enstitüsü
139 pages
CS405 Assignment 2 Solution Spring 2024
No ratings yet
CS405 Assignment 2 Solution Spring 2024
6 pages
2.abstract Data Type and C++ Classes PDF
No ratings yet
2.abstract Data Type and C++ Classes PDF
40 pages
Hypermesh Checklist
No ratings yet
Hypermesh Checklist
2 pages
For A Good Start & Practice In: Prepared by T. Ibrahim Khalil
100% (1)
For A Good Start & Practice In: Prepared by T. Ibrahim Khalil
42 pages
C2 Hirsch - 2002
No ratings yet
C2 Hirsch - 2002
19 pages
It Syllabus 2015 Regulations PDF
No ratings yet
It Syllabus 2015 Regulations PDF
360 pages
1000 Most Common Words in Portuguese Translated Into Spanish and English
No ratings yet
1000 Most Common Words in Portuguese Translated Into Spanish and English
25 pages
Data Structures: Course Code: 13CT1106 L TPC 4 0 0 3
No ratings yet
Data Structures: Course Code: 13CT1106 L TPC 4 0 0 3
3 pages

Database Design Pitfalls Explained

Uploaded by

Database Design Pitfalls Explained

Uploaded by

March 2015

When Good Design Goes Bad

Dynamic Design Patterns

Most Logging Tables

This may shoot you in the foot on large Fact Tables

How common is this Issue?

DateKey Customer RegionKey Sales € Qty Cost

Jan 1 1 1000 100 80

Jan 2 1 1000 100 80

Jan 3 1 1000 100 80

Jan 4 1 1000 100 80

Jan 1 1 1000 100 80

Jan 2 1 1000 100 80

Feb 3 1 1000 100 80

Mar 1 1 1000 100 80

April 1 1 1000 100 80

April 1 1 1000 100 80

May 1 1 1000 100 80

June 1 1 1000 100 80

July 1 1 1000 100 80

Aug 1 1 1000 100 80

Why could this be bad ?

You might also like