0% found this document useful (0 votes)

89 views10 pages

Difference Between Data Warehousing and Data Mining: Data Warehouse Architecture Three-Tier Data Warehouse Architecture

Three-tier architecture is generally used for data warehouses, with three tiers: bottom tier is the data warehouse database server using tools for ETL; middle tier is the OLAP server implementing ROLAP or MOLAP; top tier is the front-end client layer with query and analysis tools. Detailed transactional data is stored separately from aggregated data and may be archived offline for reduced storage needs.

Uploaded by

priya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

89 views10 pages

Difference Between Data Warehousing and Data Mining: Data Warehouse Architecture Three-Tier Data Warehouse Architecture

Uploaded by

priya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Data Warehouse Architecture

Three-Tier Data Warehouse Architecture

Generally a data warehouses adopts a three-tier architecture. Following are the three
tiers of the data warehouse architecture.
 Bottom Tier − The bottom tier of the architecture is the data warehouse database server. It
is the relational database system. We use the back end tools and utilities to feed data into
the bottom tier. These back end tools and utilities perform the Extract, Clean, Load, and
refresh functions.
 Middle Tier − In the middle tier, we have the OLAP Server that can be implemented in either
of the following ways.
o By Relational OLAP (ROLAP), which is an extended relational database
management system. The ROLAP maps the operations on multidimensional data to
standard relational operations.
o By Multidimensional OLAP (MOLAP) model, which directly implements the
multidimensional data and operations.
 Top-Tier − This tier is the front-end client layer. This layer holds the query tools and
reporting tools, analysis tools and data mining tools.

Difference Between Data Warehousing

and Data Mining
A Data Warehouse is an environment where essential data from multiple

sources is stored under a single schema. It is then used for reporting and

analysis. Data Warehouse is a relational database that is designed for query

and analysis rather than for transaction processing. It usually contains

historical data derived from transaction data. While a Data Warehouse is built

to support management functions.

Data Mining is used to extract useful information and patterns from data. The

data mining can be carried with any traditional database, but since a data

warehouse contains quality data, it is good to have data mining over the data

warehouse system. Data Mining supports knowledge discovery by finding

hidden patterns and associations, constructing analytical models, performing

classification and prediction.

Let us understand the Difference between Data Warehousing and Data Mining

in detailed

Key Features:

1. Data Warehouse:

The key features of a Data Warehouse are discussed below:

1. Subject Oriented: A data warehouse is subject-oriented as it provides

knowledge around a subject rather than the organization’s ongoing

operations. These subjects can be a product, customers, suppliers, sales,

revenue, etc. A data warehouse focuses on modeling and analysis of

data for decision making.

2. Integrated: A data warehouse is constructed by combining data from

heterogeneous sources such as relational databases, flat files, etc.

3. Time-Variant: The data present in the data warehouse provides

information with respect to a particular time period.

4. Non-volatile: Non-volatile means, data once entered into the

warehouse should not change.

Benefits of Data Warehouse:

1. Consistent and quality data

2. Cost reduction

3. More timely data access

4. Improved performance and productivity

Data Mining:

The key features of Data mining are discussed below:

1. Automatic discovery of patterns

2. Prediction of likely outcomes

3. Creation of actionable information

4. Focus on large data sets and databases

Benefits of data mining:

1. Direct marketing: The ability to predict who is most likely to be

interested in what products

2. Trend analysis: Understanding trends in the marketplace is a strategic

advantage because it helps reduce costs and timeliness to market.

3. Fraud detection: Data mining techniques can help discover which

insurance claims, cellular phone calls or credit card purchases are likely

to be fraudulent.

4. Forecasting in financial markets: Data mining techniques are extensively

used to help model financial markets.

The following diagram depicts the three-tier architecture of data warehouse −

Data Warehouse Models

From the perspective of data warehouse architecture, we have the following data
warehouse models −

 Virtual Warehouse

 Data mart

 Enterprise Warehouse

Virtual Warehouse
The view over an operational data warehouse is known as a virtual warehouse. It is
easy to build a virtual warehouse. Building a virtual warehouse requires excess
capacity on operational database servers.

Data Mart
Data mart contains a subset of organization-wide data. This subset of data is valuable
to specific groups of an organization.
In other words, we can claim that data marts contain data specific to a particular group.
For example, the marketing data mart may contain data related to items, customers,
and sales. Data marts are confined to subjects.
Points to remember about data marts −
 Window-based or Unix/Linux-based servers are used to implement data marts. They are
implemented on low-cost servers.
 The implementation data mart cycles is measured in short periods of time, i.e., in weeks
rather than months or years.
 The life cycle of a data mart may be complex in long run, if its planning and design are not
organization-wide.
 Data marts are small in size.
 Data marts are customized by department.
 The source of a data mart is departmentally structured data warehouse.
 Data mart are flexible.

Enterprise Warehouse
 An enterprise warehouse collects all the information and the subjects spanning an entire
organization
 It provides us enterprise-wide data integration.
 The data is integrated from operational systems and external information providers.
 This information can vary from a few gigabytes to hundreds of gigabytes, terabytes or
beyond.

Load Manager
This component performs the operations required to extract and load process.
The size and complexity of the load manager varies between specific solutions from
one data warehouse to other.

Load Manager Architecture

The load manager performs the following functions −
 Extract the data from source system.
 Fast Load the extracted data into temporary data store.
 Perform simple transformations into structure similar to the one in the data warehouse.
Extract Data from Source
The data is extracted from the operational databases or the external information
providers. Gateways is the application programs that are used to extract data. It is
supported by underlying DBMS and allows client program to generate SQL to be
executed at a server. Open Database Connection(ODBC), Java Database Connection
(JDBC), are examples of gateway.

Fast Load
 In order to minimize the total load window the data need to be loaded into the warehouse in
the fastest possible time.
 The transformations affects the speed of data processing.
 It is more effective to load the data into relational database prior to applying transformations
and checks.
 Gateway technology proves to be not suitable, since they tend not be performant when large
data volumes are involved.

Simple Transformations
While loading it may be required to perform simple transformations. After this has been
completed we are in position to do the complex checks. Suppose we are loading the
EPOS sales transaction we need to perform the following checks:

 Strip out all the columns that are not required within the warehouse.

 Convert all the values to required data types.

Warehouse Manager
A warehouse manager is responsible for the warehouse management process. It
consists of third-party system software, C programs, and shell scripts.
The size and complexity of warehouse managers varies between specific solutions.

Warehouse Manager Architecture

A warehouse manager includes the following −

 The controlling process

 Stored procedures or C with SQL

 Backup/Recovery tool

 SQL Scripts

Operations Performed by Warehouse Manager

 A warehouse manager analyzes the data to perform consistency and referential integrity
checks.
 Creates indexes, business views, partition views against the base data.
 Generates new aggregations and updates existing aggregations. Generates normalizations.
 Transforms and merges the source data into the published data warehouse.
 Backup the data in the data warehouse.
 Archives the data that has reached the end of its captured life.
Note − A warehouse Manager also analyzes query profiles to determine index and
aggregations are appropriate.

Query Manager
 Query manager is responsible for directing the queries to the suitable tables.
 By directing the queries to appropriate tables, the speed of querying and response
generation can be increased.
 Query manager is responsible for scheduling the execution of the queries posed by the user.

Query Manager Architecture

The following screenshot shows the architecture of a query manager. It includes the
following:

 Query redirection via C tool or RDBMS

 Stored procedures

 Query management tool

 Query scheduling via C tool or RDBMS

 Query scheduling via third-party software

Detailed Information
Detailed information is not kept online, rather it is aggregated to the next level of detail
and then archived to tape. The detailed information part of data warehouse keeps the
detailed information in the starflake schema. Detailed information is loaded into the
data warehouse to supplement the aggregated data.
The following diagram shows a pictorial impression of where detailed information is
stored and how it is used.

Note − If detailed information is held offline to minimize disk storage, we should make
sure that the data has been extracted, cleaned up, and transformed into starflake
schema before it is archived.

Data Warehouse Architecture Framework
No ratings yet
Data Warehouse Architecture Framework
7 pages
Data Warehousing - Architecture
No ratings yet
Data Warehousing - Architecture
6 pages
Data Warehouse Design and Architecture
No ratings yet
Data Warehouse Design and Architecture
25 pages
Rdbmsiii 190703162808
No ratings yet
Rdbmsiii 190703162808
20 pages
12 01 09 10 32 12 1287 Sindhujam PDF
No ratings yet
12 01 09 10 32 12 1287 Sindhujam PDF
23 pages
Data Warehousing Syllabus Overview
No ratings yet
Data Warehousing Syllabus Overview
23 pages
Data Warehouse
No ratings yet
Data Warehouse
74 pages
Assignment 1
No ratings yet
Assignment 1
15 pages
03 Data Warehouse
No ratings yet
03 Data Warehouse
27 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
29 pages
Data Warehouse Architecture Overview
No ratings yet
Data Warehouse Architecture Overview
7 pages
Data Mining Chapter 1 Introduction
No ratings yet
Data Mining Chapter 1 Introduction
39 pages
Data Warehouse and Data Mining
No ratings yet
Data Warehouse and Data Mining
12 pages
Data Warehousing & Mining Notes PDF
No ratings yet
Data Warehousing & Mining Notes PDF
56 pages
Data Mining and Data Warehouse BY
100% (1)
Data Mining and Data Warehouse BY
12 pages
ISDM Group5 Review
No ratings yet
ISDM Group5 Review
23 pages
Lecture 2 - Datawarehouse
No ratings yet
Lecture 2 - Datawarehouse
50 pages
INFORMATION MANAGEMENT Unit 3 NEW
100% (1)
INFORMATION MANAGEMENT Unit 3 NEW
61 pages
Data Warehousing and Mining
No ratings yet
Data Warehousing and Mining
52 pages
Data Warehousing Components - L3 - L4 - L5
No ratings yet
Data Warehousing Components - L3 - L4 - L5
26 pages
Module 3 - Datawarehousing
No ratings yet
Module 3 - Datawarehousing
45 pages
Lec09-Data Warehousing
No ratings yet
Lec09-Data Warehousing
32 pages
Data Warehousing and Mining Guide
No ratings yet
Data Warehousing and Mining Guide
46 pages
Data Warehousing and Data Mining Final Year Seminar Topic
No ratings yet
Data Warehousing and Data Mining Final Year Seminar Topic
10 pages
Data Warehousing
No ratings yet
Data Warehousing
35 pages
DM Module 1
No ratings yet
DM Module 1
16 pages
Unit 1
No ratings yet
Unit 1
22 pages
Data Warehousing & Mining Overview
75% (4)
Data Warehousing & Mining Overview
14 pages
Data Warehouse Architecture Guide
No ratings yet
Data Warehouse Architecture Guide
27 pages
Data Warehousing and Data Mining Original Notes
No ratings yet
Data Warehousing and Data Mining Original Notes
47 pages
Lecture6 Three Tier Architecture 11052016
No ratings yet
Lecture6 Three Tier Architecture 11052016
13 pages
Data Ware Housing1
No ratings yet
Data Ware Housing1
18 pages
Module 1
No ratings yet
Module 1
15 pages
BA Unit2 Own
No ratings yet
BA Unit2 Own
10 pages
Data Warehouse
No ratings yet
Data Warehouse
4 pages
Understanding Data Warehousing Basics
No ratings yet
Understanding Data Warehousing Basics
19 pages
Data Mining in Insurance Analysis
No ratings yet
Data Mining in Insurance Analysis
11 pages
Unit Ii-Ba (2) - 1
No ratings yet
Unit Ii-Ba (2) - 1
29 pages
Unit Ii-Ba
No ratings yet
Unit Ii-Ba
16 pages
CS 2208 Data Mining and Warehousing Notes
No ratings yet
CS 2208 Data Mining and Warehousing Notes
14 pages
DATA Mining UNIT1 DATA Mining UNIT1: Operating System (Sindhi College) Operating System (Sindhi College)
No ratings yet
DATA Mining UNIT1 DATA Mining UNIT1: Operating System (Sindhi College) Operating System (Sindhi College)
24 pages
03-Unit 2
No ratings yet
03-Unit 2
79 pages
Business Intelligence?: BI Used For?
No ratings yet
Business Intelligence?: BI Used For?
9 pages
Overview of Data Warehousing and OLAP
No ratings yet
Overview of Data Warehousing and OLAP
12 pages
DWDM Fresh Notes For Unit 1, Unit 2, Unit 3
No ratings yet
DWDM Fresh Notes For Unit 1, Unit 2, Unit 3
54 pages
Understanding Data Repositories in Analytics
No ratings yet
Understanding Data Repositories in Analytics
8 pages
Data Warehousing
No ratings yet
Data Warehousing
23 pages
2 Data Warehousing Components L3 L4 L5
No ratings yet
2 Data Warehousing Components L3 L4 L5
26 pages
Data Warehouse Insights & Solutions
No ratings yet
Data Warehouse Insights & Solutions
9 pages
Unit 2
No ratings yet
Unit 2
26 pages
DW Unit 1
No ratings yet
DW Unit 1
29 pages
Data Warehousing Essentials
No ratings yet
Data Warehousing Essentials
16 pages
Data Warehousing Overview and Benefits
No ratings yet
Data Warehousing Overview and Benefits
67 pages
Unit - 2
No ratings yet
Unit - 2
116 pages
02 DataWarehousing and OLAP
No ratings yet
02 DataWarehousing and OLAP
66 pages
Data Warehouse Architecture Overview
No ratings yet
Data Warehouse Architecture Overview
56 pages
Data Warehousing for Business Intelligence
No ratings yet
Data Warehousing for Business Intelligence
27 pages
DWM Notes
No ratings yet
DWM Notes
6 pages
IT 7th
No ratings yet
IT 7th
10 pages
IT445 - Final Testbank (Fahd's Changes 2016-05-15)
No ratings yet
IT445 - Final Testbank (Fahd's Changes 2016-05-15)
91 pages
000099998888
No ratings yet
000099998888
10 pages
1-Introduction To Business Intelligence in A Business Environment
No ratings yet
1-Introduction To Business Intelligence in A Business Environment
40 pages
Senior Business Analyst Resume Overview
No ratings yet
Senior Business Analyst Resume Overview
6 pages
Data Warehousing Implementation
No ratings yet
Data Warehousing Implementation
18 pages
Term Paper Warehouse
100% (1)
Term Paper Warehouse
8 pages
Building The Data WareHouse - Chapter 03
No ratings yet
Building The Data WareHouse - Chapter 03
95 pages
Ing. Martin Lauf: Areas of Expertise
No ratings yet
Ing. Martin Lauf: Areas of Expertise
3 pages
Data Warehousing: Modern Database Management
No ratings yet
Data Warehousing: Modern Database Management
49 pages
Unit 2
No ratings yet
Unit 2
15 pages
In The Star Schema Design
No ratings yet
In The Star Schema Design
11 pages
Data Mining Techniques Overview
No ratings yet
Data Mining Techniques Overview
51 pages
Database Systems: Key Concepts Quiz
100% (2)
Database Systems: Key Concepts Quiz
20 pages
Bi DW DM
No ratings yet
Bi DW DM
39 pages
Architecture For BI
No ratings yet
Architecture For BI
22 pages
Data Mining & KDD Overview
No ratings yet
Data Mining & KDD Overview
63 pages
OLAP
No ratings yet
OLAP
42 pages
Lightning-Fast Performance. Industry-Leading Security
No ratings yet
Lightning-Fast Performance. Industry-Leading Security
46 pages
Teradata Warehouse Miner
No ratings yet
Teradata Warehouse Miner
3 pages
Business Data Systems Explained
No ratings yet
Business Data Systems Explained
7 pages
De Unit-4
No ratings yet
De Unit-4
20 pages
Data Warehousing Course Outline
No ratings yet
Data Warehousing Course Outline
3 pages
The Relationships Between Definitions of Big Data, Business Intelligence and Business Analytics: A Literature Review
No ratings yet
The Relationships Between Definitions of Big Data, Business Intelligence and Business Analytics: A Literature Review
18 pages
Bigdata Unit1
No ratings yet
Bigdata Unit1
62 pages
Data Rich Information Poor RG
No ratings yet
Data Rich Information Poor RG
9 pages
Data Warehousing Fundamentals For It Professionals Second Edition Second Edition Paulraj Ponniah (Auth.) Ebook Downloadable Chapter Set
100% (3)
Data Warehousing Fundamentals For It Professionals Second Edition Second Edition Paulraj Ponniah (Auth.) Ebook Downloadable Chapter Set
72 pages
Foundations of SQL Server 2008 R2 Business Intelligence 2nd Edition by Guy FouchÃ©, Lynn Langit 1430233249 9781430233244 PDF Download
100% (4)
Foundations of SQL Server 2008 R2 Business Intelligence 2nd Edition by Guy FouchÃ©, Lynn Langit 1430233249 9781430233244 PDF Download
42 pages
Designing Data-Intensive Applications, 2nd Edition (Early - Martin Kleppmann and Chris Riccomini - 2nd, 2024
No ratings yet
Designing Data-Intensive Applications, 2nd Edition (Early - Martin Kleppmann and Chris Riccomini - 2nd, 2024
226 pages
List Data Warehouse Models With Example
No ratings yet
List Data Warehouse Models With Example
19 pages

Difference Between Data Warehousing and Data Mining: Data Warehouse Architecture Three-Tier Data Warehouse Architecture

Uploaded by

Difference Between Data Warehousing and Data Mining: Data Warehouse Architecture Three-Tier Data Warehouse Architecture

Uploaded by

Data Warehouse Architecture

Three-Tier Data Warehouse Architecture

Difference Between Data Warehousing

analysis. Data Warehouse is a relational database that is designed for query

and analysis rather than for transaction processing. It usually contains

to support management functions.

warehouse system. Data Mining supports knowledge discovery by finding

hidden patterns and associations, constructing analytical models, performing

classification and prediction.

The key features of a Data Warehouse are discussed below:

1. Subject Oriented: A data warehouse is subject-oriented as it provides

knowledge around a subject rather than the organization’s ongoing

operations. These subjects can be a product, customers, suppliers, sales,

revenue, etc. A data warehouse focuses on modeling and analysis of

data for decision making.

heterogeneous sources such as relational databases, flat files, etc.

3. Time-Variant: The data present in the data warehouse provides

information with respect to a particular time period.

4. Non-volatile: Non-volatile means, data once entered into the

warehouse should not change.

Benefits of Data Warehouse:

1. Consistent and quality data

3. More timely data access

4. Improved performance and productivity

The key features of Data mining are discussed below:

1. Automatic discovery of patterns

3. Creation of actionable information

4. Focus on large data sets and databases

Benefits of data mining:

1. Direct marketing: The ability to predict who is most likely to be

interested in what products

2. Trend analysis: Understanding trends in the marketplace is a strategic

advantage because it helps reduce costs and timeliness to market.

3. Fraud detection: Data mining techniques can help discover which

4. Forecasting in financial markets: Data mining techniques are extensively

used to help model financial markets.

Data Warehouse Models

Load Manager Architecture

 Convert all the values to required data types.

Warehouse Manager Architecture

 The controlling process

 Stored procedures or C with SQL

Operations Performed by Warehouse Manager

Query Manager Architecture

 Query redirection via C tool or RDBMS

 Query management tool

 Query scheduling via C tool or RDBMS

 Query scheduling via third-party software

You might also like