0% found this document useful (0 votes)

25 views10 pages

PostgreSQL Database Architecture

Uploaded by

Kaushal Sutar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views10 pages

PostgreSQL Database Architecture

Uploaded by

Kaushal Sutar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Understanding PostgreSQL database architecture is key to mastering how it stores,

manages, and processes data. Here's a breakdown of the core components and their roles:

🧠 High-Level PostgreSQL Architecture

PostgreSQL follows a client-server architecture. It has the following main components:

Client <---> PostgreSQL Server (Postmaster) <--> Data Files (Storage)

|
--> Background Processes
--> Shared Memory

Image taken from the internet.

1. 🧍‍♂️ Client

● The frontend—applications, tools, or users that send SQL commands to the server.

● Communicate with the server via libpq (PostgreSQL's C API) or other drivers (JDBC,
psycopg, etc.).
2. 🧠 PostgreSQL Server (Postmaster)

This is the main daemon process. It listens for client connections and manages:

● Authentication

● Connection pooling

● Process spawning (forks child processes)

3. ⚙️ Backend Processes

Each client connection gets its own backend process. This process:

● Parses and plans queries

● Accesses data from disk or shared memory

● Returns results to the client

There are also background processes:

Process Description

checkpointer Writes dirty pages to disk at checkpoints

writer Flushes modified buffers to disk in the background

wal writer Writes WAL (Write-Ahead Log) records to disk

autovacuum Reclaims storage by removing dead tuples

bgworker Custom background workers (extensions, etc.)

stats Gathers DB usage statistics
collector

4. 🧊 Shared Memory

Shared between all PostgreSQL processes. Key parts:

● Buffer Pool (Shared Buffers): Caches table and index pages.

● WAL Buffers: Temporarily stores WAL data before writing.

● Work Memory: For sorts, hashes, etc. (per backend).

● Locks & LWLocks: To coordinate access.

5. 🗃️ Storage Layer

Data is stored in files on disk in the data directory ($PGDATA). Important subfolders:

● base/ → actual table data

● pg_wal/ → WAL (Write-Ahead Logs)

● pg_clog/ → transaction commit status

● pg_stat/ → runtime statistics

Each table/index corresponds to one or more files.

6. ✍️ Write-Ahead Logging (WAL)

To ensure durability and crash recovery:

● Changes are written to WAL first (sequential writes)

● Then applied to data files later (checkpointing)

This allows crash recovery by replaying WAL.

7. 📊 System Catalogs

PostgreSQL stores metadata in its own internal tables (in pg_catalog):

● pg_class → tables and indexes

● pg_attribute → columns

● pg_stat_activity → current queries and connections

🧩 Putting It Together

When a query is executed:

1. Client sends SQL to server

2. Server process parses, plans, and executes it

3. Results sent back to client

4. Modified data written to WAL first

5. Eventually flushed to data files by background processes

Let’s dive deeper into the PostgreSQL database architecture, focusing on the critical
internal components, their interaction, and how queries are processed.
1. Query Lifecycle: From Client to Data

Step-by-Step Flow of a Query (e.g., SELECT * FROM users WHERE id =

5;)
Stage Description

1. Client Sends the SQL query to the PostgreSQL server.

2. Postmaster Accepts the connection, forks a backend process to handle the

session.

3. Parser Converts SQL into a parse tree (SELECT → what table, what column,

what condition).

4. Planner/Optimizer Chooses the best execution plan using statistics, indexes, cost
estimates.

5. Executor Executes the plan, accesses data from shared buffers or disk,
returns result.

🔍 Example
SELECT name FROM users WHERE id = 5;

1. The planner checks if there's an index on id.

2. If found, uses Index Scan, else Sequential Scan.

3. Fetches name from the matching row.

4. Sends the result to the client.

2. Backend Processes (Detailed)

Each PostgreSQL session runs as a separate OS process (forked from postmaster). Key
background processes:

Process Role

Checkpointer Periodically writes dirty pages (modified buffers) to disk for

durability.

WAL Writer Flushes WAL buffers to disk to ensure WAL is persistent.

Background Writer Frees memory by writing less-used data from shared buffers
to disk.

Autovacuum Removes dead tuples and prevents table bloat (important for
MVCC).

Stats Collector Tracks query usage, index usage, table activity, etc.

Logical Replication Handle replication of data between PostgreSQL instances.

Workers

3. Shared Memory Areas (Memory Architecture)

🔹 Shared Buffers

● Stores most recently used table/index pages.

● Acts as a cache to reduce disk I/O.

Think of it like PostgreSQL’s internal "RAM-based cache".

🔹 WAL Buffers

● Temporary space for uncommitted changes before they're flushed to WAL logs.

🔹 Work Memory (per backend)

● Temporary memory used for query operations like sorting, hashing, joins.

🔹 Maintenance Work Memory

● Used for VACUUM, CREATE INDEX, ANALYZE.

4. Write-Ahead Logging (WAL)

WAL ensures durability and crash recovery.

💡 How WAL Works:

1. Before modifying a data file, PostgreSQL writes a log record to WAL.

2. WAL is flushed to disk (fsync) before confirming a transaction commit.

3. If the system crashes, WAL is replayed to restore consistency.

📂 WAL Files

● Stored in $PGDATA/pg_wal/

● Each file is 16MB by default.

● Used in replication, point-in-time recovery (PITR).

5. MVCC – Multi-Version Concurrency Control

MVCC allows concurrent reads and writes without locks.

🔸 What it means:
● Every transaction sees a snapshot of the database.

● Updates don't overwrite data—they create new versions (tuples).

● Old tuples are marked as "dead" and cleaned up by autovacuum.

🔹 Tuple Metadata:

Each row has hidden fields:

● xmin: ID of the transaction that inserted it.

● xmax: ID of the transaction that deleted/updated it.

6. Storage Engine & Filesystem

Tables and indexes are stored as binary heap files.

📂 Data Directory ($PGDATA)

● base/: contains subdirectories for each database.

● pg_wal/: write-ahead logs.

● global/: system-wide metadata.

● pg_tblspc/: tablespaces (alternative locations for data).

🧱 File Structure:

● Each table is stored in one or more files named after the OID of the table.

● If a table grows beyond 1GB, PostgreSQL splits it into multiple segments (e.g., 12345,
12345.1, 12345.2, etc.)
7. System Catalogs
PostgreSQL stores its metadata in regular tables (called system catalogs).

Catalog Table Description

pg_class Info about tables and indexes.

pg_attribute Info about columns.

pg_namespace Info about schemas.

pg_stat_activ Active queries and

ity connections.

pg_locks Lock monitoring.

8. Query Planning & Optimization (More Depth)

PostgreSQL generates multiple plans and chooses the cheapest.

🔹 Key Plan Types:

● Seq Scan: Full table scan.

● Index Scan: Uses B-tree or GIN index.

● Bitmap Scan: Efficient for multiple matches.

● Nested Loop / Hash Join / Merge Join: For joining tables.

🔍 View Execution Plan

EXPLAIN ANALYZE SELECT * FROM users WHERE email = '[email protected]';

You’ll see the chosen scan type, costs, row estimates, and actual execution time.
Summary Diagram (Mental Model)
Client
↓
Postmaster (accepts connection)
↓
Backend Process (per user)
├─ Parser → Planner → Executor
├─ Shared Buffers (data cache)
├─ WAL Buffer
↓
Data Directory (on disk)
├─ Base (table data)
├─ pg_wal (logs)
├─ global (metadata)

Postgre SQL
No ratings yet
Postgre SQL
55 pages
Foundations PostgreSQL Administration 13
100% (1)
Foundations PostgreSQL Administration 13
307 pages
Postgresql Course Material
No ratings yet
Postgresql Course Material
205 pages
PostgreSQL Architecture Presentation
No ratings yet
PostgreSQL Architecture Presentation
6 pages
PostgreSQL Architecture Overview
No ratings yet
PostgreSQL Architecture Overview
35 pages
Introduction Postgre SQLAdministration V11
No ratings yet
Introduction Postgre SQLAdministration V11
274 pages
The Internals of PostgreSQL - Chapter 2 Process and Memory Architecture
No ratings yet
The Internals of PostgreSQL - Chapter 2 Process and Memory Architecture
3 pages
PostgreSQL Interview QA
No ratings yet
PostgreSQL Interview QA
9 pages
PostgreSQL Performance Tuning Guide
No ratings yet
PostgreSQL Performance Tuning Guide
8 pages
PostgreSQL Performance & Storage Guide
100% (1)
PostgreSQL Performance & Storage Guide
18 pages
PostgreSQL Essentials v16 Student
100% (1)
PostgreSQL Essentials v16 Student
400 pages
PostgreSQL Architecture Document by Subham Dash 1710404181
100% (1)
PostgreSQL Architecture Document by Subham Dash 1710404181
11 pages
PostgreSQL Guide for Python Developers
No ratings yet
PostgreSQL Guide for Python Developers
215 pages
PostgreSQL Write Processes Explained
No ratings yet
PostgreSQL Write Processes Explained
5 pages
PG Install
No ratings yet
PG Install
3 pages
0292 Introduction Postgresql
No ratings yet
0292 Introduction Postgresql
91 pages
PostgreSQL Beginner's Guide
No ratings yet
PostgreSQL Beginner's Guide
175 pages
Postgres The First Experience
No ratings yet
Postgres The First Experience
173 pages
Postgres Amdocs Day1
No ratings yet
Postgres Amdocs Day1
199 pages
POSTGRES SQL ARCHITECTURE With Timestamp
No ratings yet
POSTGRES SQL ARCHITECTURE With Timestamp
2 pages
12 Algorithms For System Design Interviews
No ratings yet
12 Algorithms For System Design Interviews
8 pages
Postgres Arch
No ratings yet
Postgres Arch
13 pages
PostgreSQL For Beginners
100% (6)
PostgreSQL For Beginners
142 pages
Distributed PostgreSQL Overview
No ratings yet
Distributed PostgreSQL Overview
118 pages
PostgreSQL Performance Optimization Guide
No ratings yet
PostgreSQL Performance Optimization Guide
30 pages
PostgreSQL Interview Questions Overview
100% (1)
PostgreSQL Interview Questions Overview
5 pages
PostgreSQL Overview for CENG301
100% (1)
PostgreSQL Overview for CENG301
13 pages
PostgreSQL Architecture and Use Cases
No ratings yet
PostgreSQL Architecture and Use Cases
21 pages
Postgres MVCC
No ratings yet
Postgres MVCC
2 pages
Conceptual Architecture of Postgresql
100% (1)
Conceptual Architecture of Postgresql
23 pages
Introbook v4 en
No ratings yet
Introbook v4 en
145 pages
Understanding The PostgreSQL Architecture - Severalnines
No ratings yet
Understanding The PostgreSQL Architecture - Severalnines
13 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
DBA Roadmap - Learn To Become A Database Administrator With Postg
No ratings yet
DBA Roadmap - Learn To Become A Database Administrator With Postg
8 pages
CENG301 DBMS - Session-7
No ratings yet
CENG301 DBMS - Session-7
31 pages
CSE-6001 PostgreSQL Tutorial
No ratings yet
CSE-6001 PostgreSQL Tutorial
34 pages
Accidentaldbalinuxcon 130102190320 Phpapp02
No ratings yet
Accidentaldbalinuxcon 130102190320 Phpapp02
61 pages
PostgreSQL Interview Q&A Guide
No ratings yet
PostgreSQL Interview Q&A Guide
6 pages
Postgres DB Installation SOW Guide
No ratings yet
Postgres DB Installation SOW Guide
7 pages
PostGresSQL Study Stuff For Will
No ratings yet
PostGresSQL Study Stuff For Will
3 pages
01 Become A PostgreSQL DBA Understanding The Architecture
No ratings yet
01 Become A PostgreSQL DBA Understanding The Architecture
10 pages
PostgeeSQL7 2
No ratings yet
PostgeeSQL7 2
36 pages
PostgreSQL Database Administration Vol 1
100% (3)
PostgreSQL Database Administration Vol 1
124 pages
Postgresql: Complete
No ratings yet
Postgresql: Complete
56 pages
PostgreSQL Architecture Overview
No ratings yet
PostgreSQL Architecture Overview
5 pages
PostgreSQL Architecture Deep-Dive - Brijesh Mehra
No ratings yet
PostgreSQL Architecture Deep-Dive - Brijesh Mehra
75 pages
Post GRE
No ratings yet
Post GRE
59 pages
PostgreSQL Overview & Windows Installation
No ratings yet
PostgreSQL Overview & Windows Installation
21 pages
PostgreSQL Database Administration Guide
No ratings yet
PostgreSQL Database Administration Guide
32 pages
PostgreSQL Database Application Guide
No ratings yet
PostgreSQL Database Application Guide
23 pages
PostgreSQL DBA Responsibilities & Features
No ratings yet
PostgreSQL DBA Responsibilities & Features
14 pages
ADC Theory
No ratings yet
ADC Theory
7 pages
CS232L-Lab Manual
No ratings yet
CS232L-Lab Manual
198 pages
Postgresql 7.2 Tutorial: The Postgresql Global Development Group
No ratings yet
Postgresql 7.2 Tutorial: The Postgresql Global Development Group
35 pages
52492-rc071 Postgresql PDF
No ratings yet
52492-rc071 Postgresql PDF
11 pages
Admin Workshop
No ratings yet
Admin Workshop
117 pages
Database Recovery & Logging Case Studies
No ratings yet
Database Recovery & Logging Case Studies
64 pages
Dgraph: Distributed Graph Database Overview
No ratings yet
Dgraph: Distributed Graph Database Overview
11 pages
Autovaccum in PostgreSQL
No ratings yet
Autovaccum in PostgreSQL
9 pages
DBMS Unit 5 Arti Kak
No ratings yet
DBMS Unit 5 Arti Kak
15 pages
PostgreSQL IQ
100% (1)
PostgreSQL IQ
27 pages
DBMS Project Report 2023-2024
No ratings yet
DBMS Project Report 2023-2024
42 pages
Best Practices and Examples For Developing Roles in SAP HANA - Example Project
No ratings yet
Best Practices and Examples For Developing Roles in SAP HANA - Example Project
48 pages
20 Hard Dbms Questions
No ratings yet
20 Hard Dbms Questions
3 pages
Swati Saxena Essential PostgreSQL Your Guide To Database Design Query Optimization and Administra
No ratings yet
Swati Saxena Essential PostgreSQL Your Guide To Database Design Query Optimization and Administra
429 pages
A Brief Overview On Apache CouchDB1122
No ratings yet
A Brief Overview On Apache CouchDB1122
25 pages
Sap Hana Mini Check
No ratings yet
Sap Hana Mini Check
232 pages
Fast Serializable Multi-Version Concurrency Control For Main-Memory Database Systems
No ratings yet
Fast Serializable Multi-Version Concurrency Control For Main-Memory Database Systems
13 pages
CockroachDB - The Resilient Geo-Distributed SQL Database PDF
No ratings yet
CockroachDB - The Resilient Geo-Distributed SQL Database PDF
17 pages
Unit 3 DBMS 2
No ratings yet
Unit 3 DBMS 2
9 pages
IOT Training Report MSME
No ratings yet
IOT Training Report MSME
36 pages
Case Study On Google Spanner
No ratings yet
Case Study On Google Spanner
12 pages
Concurrency Control in Database Systems
No ratings yet
Concurrency Control in Database Systems
22 pages
Concurrency Control Techniques in DBMS
No ratings yet
Concurrency Control Techniques in DBMS
57 pages
PostgreSQL When It's Not Your Job
100% (1)
PostgreSQL When It's Not Your Job
183 pages
Locks Distributed Systems LOCKS
No ratings yet
Locks Distributed Systems LOCKS
40 pages
Concurrency Control in Databases
No ratings yet
Concurrency Control in Databases
8 pages
InfiniDB Concepts Guide
No ratings yet
InfiniDB Concepts Guide
57 pages
Unit-5.1 Transaction and Concurrency Control
No ratings yet
Unit-5.1 Transaction and Concurrency Control
45 pages
eXtremeDB User Guide
No ratings yet
eXtremeDB User Guide
266 pages
Domo Architecture Overview
No ratings yet
Domo Architecture Overview
7 pages
Solved Question Paper Questions
No ratings yet
Solved Question Paper Questions
55 pages
Oreilly Report What Is Distributed SQL
No ratings yet
Oreilly Report What Is Distributed SQL
37 pages
16 Multi Version Concurrency Control
No ratings yet
16 Multi Version Concurrency Control
66 pages
Non-Functional Requirements in P2P Middleware
No ratings yet
Non-Functional Requirements in P2P Middleware
30 pages