0% found this document useful (0 votes)

15 views20 pages

SQL Part 2

The document outlines a data pipeline architecture known as the Medallion Architecture, which consists of three layers: Bronze, Silver, and Gold. Each layer serves a specific purpose, starting from raw data collection to processed and aggregated data for analytics. It also describes the differences between OLTP and OLAP systems, highlighting their respective use cases in managing transactional data and performing complex data analysis.

Uploaded by

Dina Dwi Annisa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views20 pages

SQL Part 2

Uploaded by

Dina Dwi Annisa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Data Pipeline

BigQuery Data
Transfer Service

Bronze Silver Gold

Raw Integration Data Clean and tranform aggregated data

measures

Source
The source is the starting point in the data pipeline where raw data is collected.
Data can come from various sources such as:
* Relational Databases (MySQL, PostgreSQL)
* File Storage (CSV, Excel, JSON)
* APIs (REST, SOAP)
* Others

Destination
The destination is the end point where processed data is stored or used.
Destinations can include:
* Data Warehouses (BigQuery, Snowflake, Redshift)
* Data Lakes (Amazon S3, Google Cloud Storage)

OLTP OLAP

OLTP (Online Transaction Processing)

OLTP systems are designed to manage transactional data. They support day-
to-day operations and are optimized for many short online transactions.
Key Features:

Transaction-Oriented : OLTP systems manage transactions that involve insert,

update, and delete operations.
Real-Time Processing : Data is processed in real-time, ensuring that the system
reflects the current state of business operations.
High Throughput : Optimized for handling a large number of small
transactions per second.
Use Case : Ideal for applications like order processing, inventory
management, and customer relationship management
(CRM).
Example
If your project involves managing real-time sales transactions, customer
orders, or inventory updates, OLTP would be used. For example, a MySQL
database could handle incoming sales transactions, ensuring that each order is
processed quickly and accurately.

OLAP (Online Analytical Processing)

OLAP systems are designed for complex queries and data analysis. They are
used to support business decision-making and provide insights into the data
through various analytical operations.
Key Features:

Complex Queries OLAP systems handle complex queries that involve

aggregations, joins, and multi-dimensional analysis.
Data Warehousing OLAP is often associated with data warehousing, where
large volumes of historical data are stored for analysis.
Performance Optimized for read-heavy operations and can handle
large volumes of data efficiently.
Use Case : Ideal for Data Warehousing, business intelligenc

Example
If your project involves analyzing sales data to identify trends, forecast
future sales, or generate detailed reports, OLAP would be used. For instance,
using a data warehouse like BigQuery, you could run complex SQL queries to
aggregate sales data across different regions and time periods to gain
insights.
Medalli n Architecture:
The Three-Layer Design for Data Management

The Medallion Architecture is a data architecture design pattern used in data

warehousing. It organizes data processing and storage into multiple layers, each
serving a specific purpose. A medallion architecture consists of three layers:
Bronze, Silver and Gold. Data flows from one layer to the next, gradually moving
from raw, unstructured data to high-quality

Bronze Silver Gold

The Three-Layer Design for Data Management

Bronze
Bronze data is the initial stage of a data pipeline. One way to think of Bronze
data is in raw formats, coming from many different sources, streams, or
batch jobs. In many cases, this data is ingested in various forms, including json,
and is untransformed and unchanged from whatever sources produced it.
Often, services need access to this raw data and do not need any additional
transformations. However, to increase its value further, this data needs
cleansing and filtering to transform it into something more consumable.
The Bronze layer contains unvalidated data. Data ingested in the bronze layer
typically :
Maintains the raw state of the data source.
It is appended incrementally and grows over time.

Examlple query : CREATE TABLE staging_orders (

id INT PRIMARY KEY,
user_id INT,
order_number VARCHAR(50),
total DECIMAL(10, 2),
payment_method VARCHAR(50),
created_at TIMESTAMP,
updated_at TIMESTAMP
);

Silver
Silver data is considered cleansed and processed to make it more accessible.
Data can be normalized at this stage to make it easier to query. Silver data is
more sanitized, cleaner, and filtered to give a more refined view of the data.
Values are better understood, tables are joined, and constraints are added to
create better data integrity, adding additional value. This results in staged,
accurate datasets and useful structures that can be queried by analytical
services and serve a wider purpose for an enterprise.
Silver
Recall that while the Bronze layer contains the entire data history in a nearly
raw state, the silver layer represents a validated, enriched version of our data
that can be trusted for downstream analytics.
Data is cleansed and transformed to ensure consistency and quality.
This layer serves as a preparation stage for final analysis and reporting.

Examlple query : CREATE TABLE intermediate_orders

AS
SELECT
id,
user_id,
order_number,
total,
payment_method,
created_at,
updated_at
FROM
staging_orders;

Gold
Gold data is summarising data and adding business-level aggregations. It is most
useful for analytics as it is presented in a well-constructed way and is ready to
be visualized through Business Intelligence and Analytics dashboards or trained by
Machine Learning models for predictive analytics solutions.

Examlple query :
CREATE TABLE mart_inventory_summary AS
SELECT
i.id AS inventory_id,
i.product_id,
p.product_name,
p.brand_name,
p.department,
i.stock,
i.product_cost,
d.name AS distribution_center_name,
d.city AS distribution_center_city,
d.state AS distribution_center_state,
i.created_at AS inventory_created_at,
i.updated_at AS inventory_updated_at
FROM
intermediate_inventory_items i
INNER JOIN
intermediate_products p ON i.product_id = p.id
INNER JOIN
intermediate_distribution_centers d ON i.distribution_center_id = d.id;
Create Table Staging (BRONZE)

CREATE TABLE
staging_distribution_centers (
Staging Table id INT PRIMARY KEY,
Distribution Centers name VARCHAR(255),
latitude DECIMAL(9, 6),
longitude DECIMAL(9, 6)
);

CREATE TABLE staging_events (

id INT PRIMARY KEY,
Staging Table Events user_id INT,
sequence_number INT,
session_id VARCHAR(255),
created_at TIMESTAMP,
ip_address VARCHAR(255),
city VARCHAR(255),
state VARCHAR(255),
postal_code VARCHAR(10),
browser VARCHAR(50),
traffic_source VARCHAR(50),
uri VARCHAR(255),
event_type VARCHAR(50)
);

CREATE TABLE
staging_inventory_items (
Staging Table Inventory Items id INT PRIMARY KEY,
product_id INT,
created_at TIMESTAMP,
updated_at TIMESTAMP,
stock INT,
product_cost DECIMAL(10, 2),
product_code VARCHAR(50),
product_name VARCHAR(255),
brand_name VARCHAR(255),
department VARCHAR(50),
sku VARCHAR(50),
distribution_center_id INT
);
CREATE TABLE staging_order_items
(
Staging Table Orders_items id INT PRIMARY KEY,
order_id INT,
product_id INT,
quantity INT,
subtotal DECIMAL(10, 2)
);

CREATE TABLE staging_orders (

id INT PRIMARY KEY,
Staging Table Orders user_id INT,
order_number VARCHAR(50),
total DECIMAL(10, 2),
payment_method VARCHAR(50),
created_at TIMESTAMP,
updated_at TIMESTAMP
);

CREATE TABLE staging_users (

id INT PRIMARY KEY,
Staging Table Users first_name VARCHAR(50),
last_name VARCHAR(50),
email VARCHAR(255),
age INT,
gender VARCHAR(1),
state VARCHAR(50),
street_address VARCHAR(255),
postal_code VARCHAR(10),
city VARCHAR(255),
country VARCHAR(50),
latitude DECIMAL(9, 6),
longitude DECIMAL(9, 6),
traffic_source VARCHAR(50),
created_at TIMESTAMP
);
CREATE TABLE staging_products (
id INT PRIMARY KEY,
Staging Table Products product_code VARCHAR(50),
product_name VARCHAR(255),
brand_name VARCHAR(255),
department VARCHAR(50),
price DECIMAL(10, 2),
sku VARCHAR(50),
distribution_center_id INT
);
Data Pipeline

BigQuery Data
Transfer Service

Bronze Silver Gold

Raw Integration Data Clean and tranform aggregated data

measures

OLTP OLAP

OLTP (Online Transaction Processing)

OLTP systems are designed to manage transactional data. They support day-
to-day operations and are optimized for many short online transactions.
Key Features:

Transaction-Oriented : OLTP systems manage transactions that involve insert,

OLAP (Online Analytical Processing)

Complex Queries OLAP systems handle complex queries that involve

The Medallion Architecture is a data architecture design pattern used in data

Bronze Silver Gold

The Three-Layer Design for Data Management

Examlple query : CREATE TABLE staging_orders (

id INT PRIMARY KEY,
user_id INT,
order_number VARCHAR(50),
total DECIMAL(10, 2),
payment_method VARCHAR(50),
created_at TIMESTAMP,
updated_at TIMESTAMP
);

Examlple query : CREATE TABLE intermediate_orders

AS
SELECT
id,
user_id,
order_number,
total,
payment_method,
created_at,
updated_at
FROM
staging_orders;

CREATE TABLE
staging_distribution_centers (
Staging Table id INT PRIMARY KEY,
Distribution Centers name VARCHAR(255),
latitude DECIMAL(9, 6),
longitude DECIMAL(9, 6)
);

CREATE TABLE staging_events (

CREATE TABLE staging_orders (

id INT PRIMARY KEY,
Staging Table Orders user_id INT,
order_number VARCHAR(50),
total DECIMAL(10, 2),
payment_method VARCHAR(50),
created_at TIMESTAMP,
updated_at TIMESTAMP
);

CREATE TABLE staging_users (

BigQuery Data
Transfer Service

Bronze Silver Gold

Raw Integration Data Clean and tranform aggregated data

measures

OLTP OLAP

OLTP (Online Transaction Processing)

OLTP systems are designed to manage transactional data. They support day-
to-day operations and are optimized for many short online transactions.
Key Features:

Transaction-Oriented : OLTP systems manage transactions that involve insert,

OLAP (Online Analytical Processing)

Complex Queries OLAP systems handle complex queries that involve

The Medallion Architecture is a data architecture design pattern used in data

Bronze Silver Gold

The Three-Layer Design for Data Management

Examlple query : CREATE TABLE staging_orders (

id INT PRIMARY KEY,
user_id INT,
order_number VARCHAR(50),
total DECIMAL(10, 2),
payment_method VARCHAR(50),
created_at TIMESTAMP,
updated_at TIMESTAMP
);

Examlple query : CREATE TABLE intermediate_orders

AS
SELECT
id,
user_id,
order_number,
total,
payment_method,
created_at,
updated_at
FROM
staging_orders;

CREATE TABLE
staging_distribution_centers (
Staging Table id INT PRIMARY KEY,
Distribution Centers name VARCHAR(255),
latitude DECIMAL(9, 6),
longitude DECIMAL(9, 6)
);

CREATE TABLE staging_events (

CREATE TABLE staging_orders (

id INT PRIMARY KEY,
Staging Table Orders user_id INT,
order_number VARCHAR(50),
total DECIMAL(10, 2),
payment_method VARCHAR(50),
created_at TIMESTAMP,
updated_at TIMESTAMP
);

CREATE TABLE staging_users (

Data Warehousing (Advanced Query Processing) : Carsten Binnig Donald Kossmann
No ratings yet
Data Warehousing (Advanced Query Processing) : Carsten Binnig Donald Kossmann
55 pages
LectureNotes Data Warehousing
No ratings yet
LectureNotes Data Warehousing
126 pages
DataMining Notes BBIS 2025
No ratings yet
DataMining Notes BBIS 2025
74 pages
DWDM Mid 1
No ratings yet
DWDM Mid 1
10 pages
New DE
No ratings yet
New DE
4 pages
Group 3 Presentation-3 Tier Architecture
No ratings yet
Group 3 Presentation-3 Tier Architecture
8 pages
Data - Mining - Warehousing Unit 1
No ratings yet
Data - Mining - Warehousing Unit 1
35 pages
Data - Mining - Warehousing Unit I
No ratings yet
Data - Mining - Warehousing Unit I
45 pages
Omnifoods Energy Bar Sales Analysis Using Olap: 1 3 2 Motivation 3 3 Theoretical Review 3
No ratings yet
Omnifoods Energy Bar Sales Analysis Using Olap: 1 3 2 Motivation 3 3 Theoretical Review 3
13 pages
DW Assignment
No ratings yet
DW Assignment
12 pages
2-Data Warehouse Architecture - Three-Tier Data Warehouse Architecture-16!12!2024
No ratings yet
2-Data Warehouse Architecture - Three-Tier Data Warehouse Architecture-16!12!2024
30 pages
Big Data Analytics Overview and Notes
No ratings yet
Big Data Analytics Overview and Notes
9 pages
Data Extraction Process in Warehousing
No ratings yet
Data Extraction Process in Warehousing
14 pages
Demystifying The Medallion and Lakehouse Architectures 1714820046
100% (1)
Demystifying The Medallion and Lakehouse Architectures 1714820046
19 pages
DSS ch2
No ratings yet
DSS ch2
112 pages
DM Unit 2
No ratings yet
DM Unit 2
21 pages
Data Warehouse
No ratings yet
Data Warehouse
13 pages
DW Concepts
100% (1)
DW Concepts
40 pages
Data Warehouse and Mining
No ratings yet
Data Warehouse and Mining
7 pages
DATA WAREHOUSE Basic Concepts
No ratings yet
DATA WAREHOUSE Basic Concepts
26 pages
Business Intelligence & Data Warehousing
No ratings yet
Business Intelligence & Data Warehousing
22 pages
Data Wharehousing, OLAP and Data Mining
No ratings yet
Data Wharehousing, OLAP and Data Mining
84 pages
Dimensional Modeling
100% (1)
Dimensional Modeling
670 pages
Data Warehousing Unit 1,2
No ratings yet
Data Warehousing Unit 1,2
9 pages
Wa0077.
No ratings yet
Wa0077.
25 pages
Data Warehouse
No ratings yet
Data Warehouse
23 pages
Rainfall Analysis Implementing On Data Warehouse
No ratings yet
Rainfall Analysis Implementing On Data Warehouse
12 pages
Data Mining & Warehousing
No ratings yet
Data Mining & Warehousing
8 pages
BI Architecture - 1
No ratings yet
BI Architecture - 1
11 pages
DM Theory
No ratings yet
DM Theory
31 pages
3 Tier
No ratings yet
3 Tier
13 pages
TIS Notes
No ratings yet
TIS Notes
34 pages
Business Intelligence 101
No ratings yet
Business Intelligence 101
8 pages
Designing and Implementing A Web-Based Data Warehouse Solution For Cost Analysis
No ratings yet
Designing and Implementing A Web-Based Data Warehouse Solution For Cost Analysis
82 pages
DWDM Questions
No ratings yet
DWDM Questions
8 pages
Business Intelligence
No ratings yet
Business Intelligence
41 pages
Data Warehouse Administration Overview
No ratings yet
Data Warehouse Administration Overview
14 pages
What Is A Data Warehouse?
No ratings yet
What Is A Data Warehouse?
47 pages
DWHDM 22cse120 Module-1
No ratings yet
DWHDM 22cse120 Module-1
45 pages
Section 1: Business Intelligence AND Analytics
No ratings yet
Section 1: Business Intelligence AND Analytics
34 pages
Chapter6 DataWareHousing Final
No ratings yet
Chapter6 DataWareHousing Final
46 pages
Data Engineering 101 Sample DW
No ratings yet
Data Engineering 101 Sample DW
23 pages
Data Ware House
No ratings yet
Data Ware House
25 pages
Lecture 3: Business Intelligence: OLAP, Data Warehouse, and Column Store
No ratings yet
Lecture 3: Business Intelligence: OLAP, Data Warehouse, and Column Store
119 pages
Lecture 3 - 2 - OLAP, Data Warehouse, and Column Store
No ratings yet
Lecture 3 - 2 - OLAP, Data Warehouse, and Column Store
60 pages
DWDM Unit 1
No ratings yet
DWDM Unit 1
23 pages
DMW 2
No ratings yet
DMW 2
16 pages
Ba 01 PDF
No ratings yet
Ba 01 PDF
28 pages
Business Intelligence Overview
No ratings yet
Business Intelligence Overview
20 pages
Introduction to Data Warehousing Concepts
No ratings yet
Introduction to Data Warehousing Concepts
24 pages
Chapter 3 Data Warehouse & OLAP
No ratings yet
Chapter 3 Data Warehouse & OLAP
17 pages
Data Mining
No ratings yet
Data Mining
142 pages
The Concepts of Business Intelligence
No ratings yet
The Concepts of Business Intelligence
30 pages
Understanding Data Warehousing Basics
No ratings yet
Understanding Data Warehousing Basics
58 pages
Unit 1
No ratings yet
Unit 1
99 pages
Data Analysis
No ratings yet
Data Analysis
40 pages
2023-2024 Mass, Weight and Density - PPTX Updated
No ratings yet
2023-2024 Mass, Weight and Density - PPTX Updated
40 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Rotork Actuation Insights
No ratings yet
Rotork Actuation Insights
12 pages
Narain Shankar
No ratings yet
Narain Shankar
7 pages
Course: Citizenship Education and Community Engagement: Code: 8606 Semester: Spring, 2021 Assignment # 2 Level: B.ed 1.5
No ratings yet
Course: Citizenship Education and Community Engagement: Code: 8606 Semester: Spring, 2021 Assignment # 2 Level: B.ed 1.5
27 pages
DTC Agreement Between Netherlands and Malta
No ratings yet
DTC Agreement Between Netherlands and Malta
25 pages
Classical Methods for Ore Reserve Estimation
No ratings yet
Classical Methods for Ore Reserve Estimation
48 pages
Chapter 7
100% (1)
Chapter 7
7 pages
Korg Volca Manual - English
No ratings yet
Korg Volca Manual - English
11 pages
Bài tập thì HTĐ, HTTD, QKĐ
No ratings yet
Bài tập thì HTĐ, HTTD, QKĐ
6 pages
Comprehensive Color Reference Guide
No ratings yet
Comprehensive Color Reference Guide
1 page
EHS PPE Procedure
100% (2)
EHS PPE Procedure
19 pages
Modals For Class 4 PDF
No ratings yet
Modals For Class 4 PDF
12 pages
Ancient Qatari History and Archaeology
No ratings yet
Ancient Qatari History and Archaeology
9 pages
Curriculum Vitae
No ratings yet
Curriculum Vitae
2 pages
WIGGENS General Laboratory Equipment (GLE) - 2025
No ratings yet
WIGGENS General Laboratory Equipment (GLE) - 2025
242 pages
Level II - Ata 36-21-30 Air Systems
100% (1)
Level II - Ata 36-21-30 Air Systems
88 pages
Verificacion
No ratings yet
Verificacion
7 pages
Distribution Management Quiz Test
100% (3)
Distribution Management Quiz Test
3 pages
Revised Pakistan Studies Curriculum 2001
No ratings yet
Revised Pakistan Studies Curriculum 2001
20 pages
Confinity Contech Privata Limited-Asti47625-26-10 Jul 25
No ratings yet
Confinity Contech Privata Limited-Asti47625-26-10 Jul 25
1 page
Ayman's BIO P4 (CAQ)
No ratings yet
Ayman's BIO P4 (CAQ)
53 pages
Smart Transportation System Using IOT
No ratings yet
Smart Transportation System Using IOT
6 pages
Reach For The Top Grade9 Notes
No ratings yet
Reach For The Top Grade9 Notes
3 pages
Territoriality in Taxation Law
No ratings yet
Territoriality in Taxation Law
3 pages
NCERT Solutions For Class 8 Maths Chapter 1 Rational Numbers - Learn CBSE-6
No ratings yet
NCERT Solutions For Class 8 Maths Chapter 1 Rational Numbers - Learn CBSE-6
3 pages
Scan 06 Feb 25 23 40 47
No ratings yet
Scan 06 Feb 25 23 40 47
6 pages
Invoice
No ratings yet
Invoice
2 pages
Learn Lua in 15 Minutes
No ratings yet
Learn Lua in 15 Minutes
10 pages
YK-XE Catalog 202004-AE
No ratings yet
YK-XE Catalog 202004-AE
6 pages

SQL Part 2

Uploaded by

SQL Part 2

Uploaded by

Data Pipeline

Bronze Silver Gold

Raw Integration Data Clean and tranform aggregated data

OLTP (Online Transaction Processing)

Transaction-Oriented : OLTP systems manage transactions that involve insert,

OLAP (Online Analytical Processing)

Complex Queries OLAP systems handle complex queries that involve

The Medallion Architecture is a data architecture design pattern used in data

Bronze Silver Gold

The Three-Layer Design for Data Management

Examlple query : CREATE TABLE staging_orders (

Examlple query : CREATE TABLE intermediate_orders

CREATE TABLE staging_events (

CREATE TABLE staging_orders (

CREATE TABLE staging_users (

Bronze Silver Gold

Raw Integration Data Clean and tranform aggregated data

OLTP (Online Transaction Processing)

Transaction-Oriented : OLTP systems manage transactions that involve insert,

OLAP (Online Analytical Processing)

Complex Queries OLAP systems handle complex queries that involve

The Medallion Architecture is a data architecture design pattern used in data

Bronze Silver Gold

The Three-Layer Design for Data Management

Examlple query : CREATE TABLE staging_orders (

Examlple query : CREATE TABLE intermediate_orders

CREATE TABLE staging_events (

CREATE TABLE staging_orders (

CREATE TABLE staging_users (

Bronze Silver Gold

Raw Integration Data Clean and tranform aggregated data

OLTP (Online Transaction Processing)

Transaction-Oriented : OLTP systems manage transactions that involve insert,

OLAP (Online Analytical Processing)

Complex Queries OLAP systems handle complex queries that involve

The Medallion Architecture is a data architecture design pattern used in data

Bronze Silver Gold

The Three-Layer Design for Data Management

Examlple query : CREATE TABLE staging_orders (

Examlple query : CREATE TABLE intermediate_orders

CREATE TABLE staging_events (

CREATE TABLE staging_orders (

CREATE TABLE staging_users (

You might also like