0% found this document useful (0 votes)

33 views29 pages

Lec 4

Uploaded by

p20232002567

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views29 pages

Lec 4

Uploaded by

p20232002567

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Advanced Database Systems

Spring 2025

Lecture #04:
HW & Disk Space Management

R&G: Chapters 1, 9.1, 9.3

DBMS: B IG P ICTURE
SQL clients interact with a DBMS SQ L C lie nt

You know how to write a SQL query

How is a SQL query executed?

Da taba se
Mana ge me nt Syste m

Da taba se
3

DBMS: Q UERY P LANNING

Parse, check, and verify the SQL query SQ L C lie nt

SELECT [Link] Q uer y Pl anning

FROM Student S JOIN Enrolled E
ON [Link] = [Link]
WHERE [Link] = ‘INF-11199’
Da taba se
Mana ge me nt Syste m
Translate into an efficient relational
query plan that can be executed

Da taba se
4

DBMS: O PERATOR E XECUTION

Execute a dataflow by operating on SQ L C lie nt
records and files
Q uer y Pl anning
sorting
π [Link] O pera tor Exe cuti on
sort-merge join
⋈ [Link] = [Link] Da taba se
Mana ge me nt Syste m
B+ tree
σ [Link] = ‘INF-11199’
scan
Student Enrolled

Da taba se
5

DBMS: F ILES & I NDEX M ANAGEMENT

Organise tables and records as SQ L C lie nt
groups of pages in a logical file
Q uer y Pl anning
sid name dept age
12344 Jones CS 18 O pera tor Exe cuti on
12355 Smith Physics 23
12366 Gold CS 21 F ile s & Index Mana ge me nt

Directory Header Header

Page1 Page2
Da taba se
Disk

Database File
6

DBMS: B UFFER M ANAGEMENT

Transfer data between disk and memory SQ L C lie nt
Buffer Pool
Q uer y Pl anning
Directory Header
O pera tor Exe cuti on
Page2
Memory F ile s & Index Mana ge me nt

Buf fe r Mana ge me nt

Directory Header Header

Page1 Page2
Da taba se
Disk
Database File
7

DBMS: D ISK S PACE M ANAGEMENT

Translate page requests into reading & SQ L C lie nt
writing physical bytes on devices
Q uer y Pl anning

01010 10100 O pera tor Exe cuti on

11101 11010
10010 10101 01111
01010 F ile s & Index Mana ge me nt
10110

Buf fe r Mana ge me nt

Di sk S pace Ma nag em e nt

Da taba se

Di sk
8

A RCHITECTURE OF A DBMS
Organised in layers SQ L C lie nt

Each layer abstracts the layer below Q uer y Pl anning

Manage complexity
O pera tor Exe cuti on
Performance assumptions
F ile s & Index Mana ge me nt
Example of good systems design
Buf fe r Mana ge me nt

Di sk S pace Ma nag em e nt

Da taba se
9

DBMS: C ONCURRENCY & R ECOVERY

Two cross-cutting modules related to SQ L C lie nt
storage and memory management
Q uer y Pl anning

O pera tor Exe cuti on

F ile s & Index Mana ge me nt

C onc urre ncy Control
Buf fe r Mana ge me nt
Re cove r y
Di sk S pace Ma nag em e nt

Da taba se
10

O UTLINE
Storage Media SQ L C lie nt

Disk Space Management Q uer y Pl anning

Buffer Management O pera tor Exe cuti on

F ile s & Index Mana ge me nt

File Layout
Buf fe r Mana ge me nt
Page Layout
Di sk S pace Ma nag em e nt
Record Layout
Da taba se
11

D ISK -O RIENTED A RCHITECTURE

Most database systems are designed for non-volatile disk storage*
The primary location of the database is on disks (HDD and/or SSD)
Data processing happens in volatile main memory
The DBMS responsible for moving data between disk and main memory

Major implications
Data stored on disk is not byte addressable. Instead, an API:
READ: transfer “page” of data from disk to RAM
WRITE: transfer “page” of data from RAM to disk
Disk reads & writes are very, very slow! ⇒ Must plan carefully!

* Volatile storage only maintains its data while the device is powered
12

W HY N OT S TORE A LL IN M AIN M EMORY ?

Costs too much
Cost of 1TB storage (2020): 50$ for HDD, 200$ for SSD, 6000$ for RAM
High-end databases today in the petabyte range!
Roughly 60% of the cost of a production system is in the disks

Main memory is volatile

Obviously important if DB stops/crashes. We want data to be saved!

Some specialised systems do store entire databases in main memory

Faster than disk-oriented but with much higher cost/GB
Suitable for small databases
13

S TORAGE H IERARCHY
Faster
CPU Registers Smaller
Volatile
CPU Caches
Byte-Addressable
DRAM

SSD
Non-Volatile
Block-Addressable HDD

Network Storage Slower

Larger
14

S TORAGE H IERARCHY Latency Capacity

CPU Registers < 1ns B

CPU Caches < 10ns KB/MB

Memory for active data
(primary storage) DRAM 100ns GB

SSD 0.1ms GB/TB

Disk for main database
(secondary storage) HDD 10ms TB

Network Storage 30ms PB

A NATOMY OF A D ISK
Platters rotate (say 15000 rpm)

Disk arm moves in or out to position

disk heads on a desired track
Tracks under heads make a “cylinder”

Only one head reads/writes at any one time

Block size is a multiple of (fixed) sector size

Sector = minimum storage unit (512 B or 4 KB)

Video on how disk drives work

A CCESSING A D ISK PAGE

Data is stored and retrieved in units called disk blocks
Block size is determined by the filesystem (usually 4 KB, sometimes up to 64 KB)

Unlike RAM, time to retrieve a block depends on its location

Time to access (read/write) a disk block:
Seek time: moving disk arm to position disk heads on track

Rotational delay: waiting for target block to rotate under a head

Transfer time: actually moving data to/from disk surface

Seagate Cheetah 15K.7 17

4 disks, 8 heads, avg. 512 KB/track, 600GB capacity

rotational speed: 15 000 rpm
average seek time: 3.4 ms
transfer rate ≈ 163 MB/s

Access time to read one block of size 8KB

Average seek time 3.40 ms

Average rotational delay 1/2 · 1/15000 min 2.00 ms

Transfer time 8KB / 163 MB/s 0.05 ms

Total access time 5.45 ms

Seek time and rotational delay dominate!

S EQUENTIAL VS . R ANDOM A CCESS

What about accessing 1000 blocks of size 8 KB
Random: 1000 · 5.45 ms = 5.45 s
Sequential: 3.4 ms + 2 ms + 1000 · 0.05 ms ≈ 55 ms
tracks store only 512KB ⟹ some additional (< 5 ms) track-to-track seek time

Sequential I/O orders of magnitude faster than random I/O

avoid random I/O at all cost

A RRANGING B LOCKS ON D ISK

‘Next’ block concept:
sequential blocks on same track, followed by
blocks on same cylinder, followed by
blocks on adjacent cylinder

Arrange file pages sequentially by ‘next’ on disk

Minimize seek and rotational delay

For a sequential scan, pre-fetch several blocks at a time!

Reading large consecutive blocks

“Amortises” seek time and rotational delay
20

S OLID S TATE D RIVES

Alternative to conventional hard disks
Data accessed in pages, internally pages are organised into blocks

Fine-grain reads (4-8 KB pages), coarse-grain writes (1-2 MB blocks)

Issues in current generation (NAND)

Write amplification: Writing data in small pages causes erasing big blocks
Limited endurance: Only 2K-3K erasures before cell failure
Wear levelling: SSD controller needs to keep moving hot write units around
Price: SSD is 2-5x more expensive than HDD
21

S OLID S TATE D RIVES

Read is fast and predictable
Single read access time: 30 µs
4KB random reads: ~500 MB/sec
Sequential reads: ~525 MB/sec

But write is not! Slower for random

Single write access time: 30 µs
4KB random writes: ~120 MB/sec
Sequential writes: ~480 MB/sec

Random access still slower than sequential access

SSD VS . HDD
SSD can achieve 1-10x the bandwidth (bytes/sec) of ideal HDD
Note: Ideal HDD spec numbers are hard to achieve
Expect 10-100x bandwidth for non-sequential reads

Locality matters for both

Reading/writing to “far away” blocks on HDD requires slow seek/rotation delay
Writing 2 “far away” blocks on SSD can require writing multiple much larger units
High-end flash drives are getting much better at this

And don’t forget

SSD is 2-5x more expensive than HDD
24

B OTTOM L INE
Very large DBs: relatively traditional
Disk still offers the best cost/GB by a lot
SSDs improve performance and performance variance

Smaller DB story is changing quickly

SSDs win at the low end (modest DB sizes)
Many interesting databases fit in RAM

Lots of change brewing on the HW storage tech side

Non-volatile memory likely to affect the design of future systems

We will focus on traditional RAM and disk

D ATABASE S TORAGE
Most DBMSs store data as one or more files on disk
Files consist of pages (loaded in memory), pages contain records

Data on disk is read & written in large chunks of sequential bytes

Block = Unit of transfer for disk read/write
Page = A common synonym for “block”
In some textbooks, “page” = a block-sized chunk of RAM

We will treat “block” and “page” as synonyms

I/O operation = read/write disk operation

Sequential pages: reading “next” page is fastest

S YSTEM D ESIGN G OALS

Goal: allow the DBMS to manage databases > available main memory

Disk reads/writes are expensive ⟹ must be managed carefully

Minimise disk I/O, maximise usage of data per I/O

Spatial control
Where to write pages on disk
Goal: keep pages often used together as physically close as possible on disk

Temporal control
When to read pages into memory and when to write them to disk
Goal: minimise the number of CPU stalls from having to read data from disk
27

D ISK S PACE M ANAGEMENT

Lowest layer of DBMS, manages space on disk
Map pages to locations on disk S QL C l ie nt

Load pages from disk to memory Qu e ry P l a nn in g

Save pages back to disk Op e ra t o r E xe c ut i o n

Introduces the concept of a page F i le s & Ind ex M a na g e m e nt

Typical page size: 4 – 64KB (a multiple of 4KB) B u ff e r Ma na g e m e nt

Each page has a unique identifier: page ID D i s k S pa c e Ma na g e m e n t

Higher levels call upon this layer to:

Datab ase
Allocate/de-allocate a page
Read/write a page
28

D ISK S PACE M ANAGEMENT : PAGE R EQUESTS

Disk space manager can get requests for a sequence of pages
E.g., when higher levels execute a scan operator on a relation

Such requests are best satisfied by pages stored sequentially on disk

Physical details hidden from higher levels of system
Higher levels may “safely” assume Next Page is fast, so they will
simply expect sequential runs of pages to be quick to scan

Disk space manager aims to intelligently lay out data on disk

to meet the performance expectation of higher levels as best as possible
30

D ISK S PACE M ANAGEMENT : I MPLEMENTATION

Using local filesystem (FS)
Allocate one large “contiguous” file on an empty disk
Rely on OS and FS that sequential pages in this file are physically contiguous on disk
A logical database “file” may span multiple FS files on multiple disks/machines
Disk space manager maintains a mapping from page IDs to physical locations
physical location = filename + offset within that file

The OS and other apps know nothing about the contents of these files
Only the DBMS knows how to decipher their contents
Early DBMSs in the 1980s used custom ‘filesystems’ on raw storage
31

S UMMARY
Magnetic disk and flash storage
Random access vs. sequential access (10x) S QL C l ie nt
Physical data placement is important
Qu e ry P l a nn in g

Disk space management Op e ra t o r E xe c ut i o n

Exposes data as a collection of pages F i le s & Ind ex M a na g e m e nt
Pages: block-level organisation of bytes on disk
B u ff e r Ma na g e m e nt
API to read/write pages to disk
D i s k S pa c e Ma na g e m e n t
Provides “next” locality
Abstracts device and file system details Datab ase

DBMS Architecture: Disks & Buffers
No ratings yet
DBMS Architecture: Disks & Buffers
50 pages
Introduction To DBMS Internals: Alvin Cheung Aditya Parameswaran
No ratings yet
Introduction To DBMS Internals: Alvin Cheung Aditya Parameswaran
31 pages
Notes 02 - Hardware
No ratings yet
Notes 02 - Hardware
62 pages
RDBMS Architecture and Disk Management
No ratings yet
RDBMS Architecture and Disk Management
121 pages
Unit - V
No ratings yet
Unit - V
87 pages
03 Storage1
No ratings yet
03 Storage1
5 pages
PDB 2 Session Persistence
No ratings yet
PDB 2 Session Persistence
43 pages
SQL Views, Subqueries, and NULL Logic
No ratings yet
SQL Views, Subqueries, and NULL Logic
18 pages
Chapter 6 - File - and - Storage
No ratings yet
Chapter 6 - File - and - Storage
63 pages
Understanding DBMS Internals and Architecture
No ratings yet
Understanding DBMS Internals and Architecture
94 pages
Review: (R&G Chapter 9) - Aren't Databases Great? - Relational Model - SQL
No ratings yet
Review: (R&G Chapter 9) - Aren't Databases Great? - Relational Model - SQL
7 pages
Data Storage in DBMS: Disks & Files
No ratings yet
Data Storage in DBMS: Disks & Files
6 pages
03 Storage1
No ratings yet
03 Storage1
4 pages
Understanding DBMS Disk Management
No ratings yet
Understanding DBMS Disk Management
47 pages
ADBMS
No ratings yet
ADBMS
23 pages
7 Disk Storage Architectures File Structures and Hashing Class22to24 8april2025
No ratings yet
7 Disk Storage Architectures File Structures and Hashing Class22to24 8april2025
59 pages
Storing Data: Disks and Files: (R&G Chapter 9)
No ratings yet
Storing Data: Disks and Files: (R&G Chapter 9)
39 pages
Disk Storage and File Structures Overview
No ratings yet
Disk Storage and File Structures Overview
118 pages
Disk Storage & DBMS Basics
No ratings yet
Disk Storage & DBMS Basics
33 pages
UNIT I Storage Fundamentals
No ratings yet
UNIT I Storage Fundamentals
63 pages
Computer Architecture and Organization: Lecture10: Rotating Disks
No ratings yet
Computer Architecture and Organization: Lecture10: Rotating Disks
21 pages
DBMS Storage and Indexing
No ratings yet
DBMS Storage and Indexing
90 pages
Module-4 Data Storage
No ratings yet
Module-4 Data Storage
78 pages
Disk Management Essentials
No ratings yet
Disk Management Essentials
57 pages
Data Storage and Access Methods: Min Song IS698
No ratings yet
Data Storage and Access Methods: Min Song IS698
50 pages
Optimizing RAID and Hardware Tuning
No ratings yet
Optimizing RAID and Hardware Tuning
36 pages
Unit 4
No ratings yet
Unit 4
10 pages
Notes 03 - Database Storage - I
No ratings yet
Notes 03 - Database Storage - I
42 pages
Disk Management
No ratings yet
Disk Management
46 pages
DE Unit-4
No ratings yet
DE Unit-4
35 pages
03-Storage1 Notes
No ratings yet
03-Storage1 Notes
4 pages
Disks, Memories & Buffer Management: "The Two Offices of Memory Are Collection and Distribution." - Samuel Johnson
No ratings yet
Disks, Memories & Buffer Management: "The Two Offices of Memory Are Collection and Distribution." - Samuel Johnson
28 pages
Lecture 3 - Storage Systems
No ratings yet
Lecture 3 - Storage Systems
81 pages
Cloud Computing
No ratings yet
Cloud Computing
18 pages
Lecture 15
No ratings yet
Lecture 15
19 pages
DBMS Storage & File Structures
No ratings yet
DBMS Storage & File Structures
45 pages
Secondary Storage & System Software
No ratings yet
Secondary Storage & System Software
31 pages
Secondary Storage Devices (1) :: Magnetic Disks
No ratings yet
Secondary Storage Devices (1) :: Magnetic Disks
56 pages
Disk-Based Storage Oct. 23, 2008: "The Course That Gives CMU Its Zip!"
No ratings yet
Disk-Based Storage Oct. 23, 2008: "The Course That Gives CMU Its Zip!"
38 pages
INFO445: Advanced Database Design, Management, and Maintenance
No ratings yet
INFO445: Advanced Database Design, Management, and Maintenance
21 pages
File Organization
No ratings yet
File Organization
93 pages
Database Internals PDF
No ratings yet
Database Internals PDF
238 pages
Disk Performance in DBMS Systems
No ratings yet
Disk Performance in DBMS Systems
49 pages
Chapter 3 P4
No ratings yet
Chapter 3 P4
22 pages
CH 1
No ratings yet
CH 1
39 pages
Chapter 4: Spatial Storage and Indexing
No ratings yet
Chapter 4: Spatial Storage and Indexing
39 pages
Physical Storage
No ratings yet
Physical Storage
14 pages
Scalable Storage for Physics DBs
No ratings yet
Scalable Storage for Physics DBs
22 pages
Disk-Based Storage Oct. 23, 2008: "The Course That Gives CMU Its Zip!"
No ratings yet
Disk-Based Storage Oct. 23, 2008: "The Course That Gives CMU Its Zip!"
38 pages
Oracle Internals-Sample
No ratings yet
Oracle Internals-Sample
20 pages
Advanced Databases Course Guide
No ratings yet
Advanced Databases Course Guide
721 pages
FULL
No ratings yet
FULL
449 pages
Computer Architecture: Storage and Other I/O Topics
No ratings yet
Computer Architecture: Storage and Other I/O Topics
82 pages
06 External Memory
No ratings yet
06 External Memory
39 pages
PT Lect 05 (Preprocessing)
No ratings yet
PT Lect 05 (Preprocessing)
13 pages
PT Lect 08 (Bit Manipulation)
No ratings yet
PT Lect 08 (Bit Manipulation)
6 pages
PT Lect 03 (Unions and Enumerations)
No ratings yet
PT Lect 03 (Unions and Enumerations)
21 pages
PT Lect 02 (Structures)
No ratings yet
PT Lect 02 (Structures)
40 pages
Lecture 1
No ratings yet
Lecture 1
67 pages
Lec 17
No ratings yet
Lec 17
24 pages
Lec 22
No ratings yet
Lec 22
45 pages
Lec 23
No ratings yet
Lec 23
28 pages
Lec 19
No ratings yet
Lec 19
28 pages
Lec 15
No ratings yet
Lec 15
43 pages
Lec 11
No ratings yet
Lec 11
43 pages
Lec 8
No ratings yet
Lec 8
30 pages
K-3 Teachers' Digital Workshop
No ratings yet
K-3 Teachers' Digital Workshop
6 pages
G2 IBM App Connect iPaaS Grid Report - Summer 2023
No ratings yet
G2 IBM App Connect iPaaS Grid Report - Summer 2023
108 pages
SQP CS 2024 25
No ratings yet
SQP CS 2024 25
10 pages
CODS-COMAD 2024 Conference Agenda
No ratings yet
CODS-COMAD 2024 Conference Agenda
2 pages
Funambol DSServer Overview
No ratings yet
Funambol DSServer Overview
6 pages
(Computational) Corpus Linguistics
No ratings yet
(Computational) Corpus Linguistics
20 pages
Gujrat Institute of Management Sciences (Gujrat) : PMAS Arid Agriculture University Rawalpindi
No ratings yet
Gujrat Institute of Management Sciences (Gujrat) : PMAS Arid Agriculture University Rawalpindi
2 pages
Igcse Ict Storage Devices and Media 0417
No ratings yet
Igcse Ict Storage Devices and Media 0417
3 pages
DM74ALS373 Octal TRI-STATE Latch
No ratings yet
DM74ALS373 Octal TRI-STATE Latch
6 pages
Alice Quiz Review Chapt1-5 Jan2011
No ratings yet
Alice Quiz Review Chapt1-5 Jan2011
9 pages
Enterprise Resource Planning: Assignment-2
No ratings yet
Enterprise Resource Planning: Assignment-2
24 pages
Study of Routing Protocols Using Glomosim: EX - NO: 7
No ratings yet
Study of Routing Protocols Using Glomosim: EX - NO: 7
5 pages
Information Security Model Papers
No ratings yet
Information Security Model Papers
4 pages
Netlink Pro Compact Helmhoz PDF
No ratings yet
Netlink Pro Compact Helmhoz PDF
65 pages
Java Developer CV for Finance Tech
No ratings yet
Java Developer CV for Finance Tech
4 pages
Virtual Server Solutions for Businesses
No ratings yet
Virtual Server Solutions for Businesses
16 pages
Student Online Exam Guide
No ratings yet
Student Online Exam Guide
28 pages
Guide For Writing Requirements
100% (1)
Guide For Writing Requirements
110 pages
Programmable Logic Controller PLC in Automation
No ratings yet
Programmable Logic Controller PLC in Automation
10 pages
Arma3Launcher Exception 20250510T042028
No ratings yet
Arma3Launcher Exception 20250510T042028
4 pages
Telenor Health's Tonic Digital Marketing Analysis
No ratings yet
Telenor Health's Tonic Digital Marketing Analysis
76 pages
KPRverse (Kprverse - Com) Front-End Analysis
No ratings yet
KPRverse (Kprverse - Com) Front-End Analysis
5 pages
Encoding Techniques and Exercises Guide
No ratings yet
Encoding Techniques and Exercises Guide
4 pages
PC-CS603 CN Sugestion Set1
No ratings yet
PC-CS603 CN Sugestion Set1
27 pages
Programming For Industrial Control Using (IEC 1131-3 and OPC)
0% (1)
Programming For Industrial Control Using (IEC 1131-3 and OPC)
2 pages
PixHawk LiDAR Integration Guide
No ratings yet
PixHawk LiDAR Integration Guide
9 pages
Project Selector
No ratings yet
Project Selector
12 pages
Internet Basics for ICT Students
No ratings yet
Internet Basics for ICT Students
4 pages
CEIR Portal A Police Tool For Mobile Phone Tracking
No ratings yet
CEIR Portal A Police Tool For Mobile Phone Tracking
8 pages
Cns Final Lab Manual
No ratings yet
Cns Final Lab Manual
25 pages