0% found this document useful (0 votes)
34 views53 pages

PSZI BSC 04 - Eng - 2025.03.06

The document discusses the importance of Business Intelligence (BI) systems in conjunction with Enterprise Resource Planning (ERP) systems, highlighting their roles in data integration and analysis. It covers various aspects of data warehousing, including OLAP vs. OLTP, multidimensional data modeling, and different types of data warehouses. Additionally, it addresses challenges in BI implementation and the evolution of reporting tools from traditional methods to modern solutions like Power BI.

Uploaded by

timi kovács
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
34 views53 pages

PSZI BSC 04 - Eng - 2025.03.06

The document discusses the importance of Business Intelligence (BI) systems in conjunction with Enterprise Resource Planning (ERP) systems, highlighting their roles in data integration and analysis. It covers various aspects of data warehousing, including OLAP vs. OLTP, multidimensional data modeling, and different types of data warehouses. Additionally, it addresses challenges in BI implementation and the evolution of reporting tools from traditional methods to modern solutions like Power BI.

Uploaded by

timi kovács
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

FINANCE AND ACCOUNTING

INFORMATICS
Finance and Accounting
Informatics

Multidimensional data modelling and


2025

Business Intelligence (BI)

Video Lecture
Presented by: Gábor Dragonya
[email protected]

Created by: László SZÍVÓS Phd FCCA


[email protected]
ERP
(Enterprise Resource Planning)
systems
Enterprise information systems of the 1980s

3
Advantages and disadvantages of implementing a new ERP

+ -
DATA INTEGRATION COMPLEX AND TIME-CONSUMING TASK

„ONE VERSION OF TRUTH” WITHIN THE


FEAR AND RESISTANCE AGAINST CHANGE
ORGANIZATION

REAL-TIME DATA ACCESS TECHNOLOGICAL COMPLEXITY

UNIFIED PROCESS AND DATA MODEL PROCESS MANGEMENT CAN BE HARMFUL

INDUSTRY-STANDARD BENCHMARK PROCESSES


4
Multidimensional data modelling
and
Business Intelligence (BI)
Why do we need a BI system if we have already
implemented an ERP?
MAIN GOAL OF AN ERP
ERPs help us to integrate all the different organizational functions of a company and to bring the
silo effect to an end by establishing a unified, centralized data architecture.

ENTERPRISE RESOURCE PLANNING BUSINESS INTELLIGENCE


• ERPs collect and process all data arising withing an • The main responsibility of a BI system is to analyze
organization. data and present information.
• ERPs help us to have more organized and effective • BI systems help us to interpret and understand the
business processes. performance of the company.
• ERPs mostly focus on what happened in the past. • BI systems place a high emphasis on the future and
on what is going to happen.

TRANSACTION-ORIENTED BUSINESS INSIGHT-ORIENTED 6


Business Intelligence

In 1958, IBM researcher Hans


Peter Luhn defined BI as “the
ability to apprehend the
interrelationships of presented
facts in such a way as to guide
action towards a desired goal”.

Peter Luhn, father of Business Intelligence


7
Business Intelligence

In 1989, Howard Dresner (later a Gartner


analyst) proposed business intelligence as an
umbrella term to describe "concepts and
methods to improve business decision
making by using fact-based support
systems." It was not until the late 1990s that
this usage was widespread.

Howard Dresner, Gartner analyst 8


Business Intelligence

Business intelligence (BI) is a set of methodologies, processes,


architectures, and technologies that transform raw data into
meaningful and useful information. It allows business users to make
informed business decisions with real-time data that can put.

Technologies used: data integration, data quality assurance, data


warehouses, masterdata management, text- data- and content mining

9
Data warehouse
Data warehouse is a subject-orientated, integrated, time variant,
non-volatile collection of data in support of management's decision-
making process.

Main charachteristics:
• Subject oriented
• Integrated
• Permament
• Time-dependency

William H. (Bill) Immon


father of data warehousing 10
OLAP vs OLTP

OLTP (Online Transactional Processing) is a type of data processing that


executes transaction-focused tasks. It involves inserting, deleting, or
updating small quantities of database data.

OLAP (Online Analytical Processing) performs multidimensional


analysis of business data and provides the capability for complex
calculations, trend analysis, and sophisticated data modeling.

11
Data warehousing
Data warehousing is the process of constructing and using a data
warehouse. A data warehouse is constructed by integrating data
from multiple heterogeneous sources that support analytical
reporting, structured and/or ad hoc queries, and decision
making. Data warehousing involves data cleansing, data
integration, and data consolidations.

1) Extract data from transactional databases


2) Transform data for data analysis
3) Analysis of data and presentation information for decision-
makers
12
Data warehouse - Architecture

OLTP OLAP
13
OLAP verus OLTP
CHARACHTERISTIC OLTP OLAP
Orientation Transaction processing Data analysis
Users Operational staff, administrators Decision-makers and analysts, financial
controllers
Tasks Record, extract and modify daily Business decision support
transactions
Database Relational database Subject-oriented, snowflake or start schema
Entity relationship diagram
Data Current state of data Past and current data
Level of aggregation Transactional data, not aggregated Aggregated data

View of data Relational Multidimensional


Users’ access Record and read Extract only, no modification in dataset

Scope Record and modification Process information


Number of records to Low Several million
be processed
Number of users High Low

Priority Reliability Flexibility, self-service


14
Types of data warehouse
1) Virtual 2) Centralized

15
Types of data warehouse
3) Two-tier 4) Hybrid

A data mart is limited to a single focus


for one line of business; a data
warehouse is typically enterprise-wide
and ranges across multiple areas.
16
Multidimensional data model
A multidimensional model views data in the form of a data-cube. A data
cube enables data to be modeled and viewed in multiple dimensions. It is
defined by dimensions and facts.
The dimensions are the perspectives or entities concerning which an
organization keeps records. For example, a shop may create a sales data
warehouse to keep records of the store's sales for the dimension time, item,
and location.
A multidimensional data model is organized around a central theme, for
example, sales. This theme is represented by a fact table. Facts are numerical
measures. The fact table contains the names of the facts or measures of the
related dimensional tables.
17
Multidimensional data model - data cube

Fact: SALES
Components of a data cube:
• Fact
• Dimension
• Dimension measure
• Dimension hierarchy
• Attribute
• Cell

18
Multidimensional data model - data cube
SALES

INVENTORY RECEIPT

SALES ORDER

Time Product Shop Campaign Warehouse Employee Customer

19
Multidimensional data model – snowflake
scheme

CAMPAIGN_DIM PRODUCT_DIM

Multidimensional
model: presenting
ORDER_FACT
business transactions in
SALESPERSON_DIM CUSTOMER_DIM
way that enables analysis

DAY_DIM

20
Multidimensional data model – star schema (1)
Star Schema in data warehouse,
in which the center of the star can
have one fact table and a number
of associated unnormalized
dimension tables.
It is known as star schema as its
structure resembles a star. The
Star Schema data model is the
simplest type of data warehouse
schema.
In all dimension tables there is a
simple (not composit) primary
key that appears as a foreign key
in the fact table. 21
Multidimensional data model – star schema (2)
Fact table holds the data to be
analyzed. The fact table describes
historical transactions that remain
constant.

The fact table usually contains huge


data volume.

Fact data is always read-only data that


cannot be modify in the data-cube.

There shall be at least one


summarisable value in each fact table.

22
Multidimensional data model – star schema (3)
Dimension table stores data about the
ways in which the data in the fact table
can be analyzed.

The dimension tables’ fields are used


as conditions in the data warehouse
queries.

In the star schema we improve query


performance by putting the query
criterion in denormalized dimension
tables.

23
Multidimensional data model – star schema (4)

Dimension tables have a surrogate key


column that is the primary key of that
dimension. A fact table may use these
dimension surrogate IDs as foreign
keys to the dimension table.

24
Multidimensional data model – snowflake
schema

Normalized

Snowflake Schema in data warehouse


is a logical arrangement of tables in a
Denormalized multidimensional database such that
the ER diagram resembles a snowflake
shape. A Snowflake Schema is an
extension of a Star Schema, and it adds
additional dimensions. The dimension
tables are normalized which splits data
into additional tables. 25
Multidimensional data model

SalesLoction (SLID, SLname, city, county, street,……)


Product (ProdID, ProdName, UnitPrice,…)
Sales (ProdID, SLID, date, quantity, category,…..)

Example: we would like to present the sales of the last


two month for a specific county.

26
Data cube operations
1. Slice and dice
2. Drill down
3. Drill up
4. Drill cross
5. Unfold
6. Fold
7. Pivot
27
Multidimensional data model - data cube

Components of a data cube:


• Fact
• Dimension
• Dimension measure
• Attribute
• Cell
• Dimension hierarchy

28
1. Slice and dice
The slice operation creates
a sub-cube by selecting a
single dimension from the
main OLAP cube.
The dice operation isolates
a sub-cube by selecting
several dimensions within
the main OLAP cube.

29
2. Drill down
The drill-down operation
converts less-detailed data
into more-detailed data
through one of two
methods—moving down in
the concept hierarchy or
adding a new dimension to
the cube.

30
3. Drill up
Drill up is the opposite of the
drill-down function—it
aggregates data on an OLAP
cube by moving up in the
concept hierarchy or by
reducing the number of
dimensions.

31
4. Drill across
Sales + Customer
Sales Customer complaints complaints

+ =
Time Time Time

Product Product Product

The operation drill across reconciles cells from


several data cubes which share the same scheme.

32
5. Unfold
Sales Location Sales Location

SALES SALES Time

- Value - Value
- Time

Product Product

33
6. Fold
Sales Location
Sales Location

Time SALES
SALES
- Value
- VALUE - Date

Product
Product

34
Data warehousing in business
- Decision support: seeing aggregated data, trends and spotting
recurring or unusual patterns in data
- Data mining: is the process of analyzing hidden patterns of data
according to different perspectives for categorization into useful
information, which is collected and assembled in common areas, such
as data warehouses, for efficient analysis, data mining algorithms,
facilitating business decision making and other information
requirements to ultimately cut costs and increase revenue.
- Common data mining techniques: cluster analyses, classification and
regression trees, and neural networks
35
Data warehouse, data mart, data lake
DATA WAREHOUSE DATA MART DATA LAKE

A data warehouse is the core analytics A data mart is very similar to a data A data lake stores an organization’s raw and
system of an organization. The data warehouse. Like a data warehouse, the data processed (unstructured and structured) data
warehouse will frequently work in mart will maintain and house cleaned data at both large and small scales. Unlike a data
conjunction with an operational data store ready for analysis. However, unlike a data warehouse or database, a data lake captures
(OLTP) to ‘warehouse’ data captured by the warehouse, the scope of visibility is limited. anything the organization deems valuable for
various databases used by the business. future use.

• Stores large quantities of historical data so old • Focuses on one subject matter or business unit • Collects all data from many disparate data
data is not erased when new data is updated sources over an extended period
• Acts as a mini-data warehouse, holding
• Captures data from multiple, disparate aggregated data • Meets the needs of various users in the
databases • Data is limited in scope organization
• Works with ODS to house normalized, cleaned • It is uploaded without an established
• Often uses a star schema or similar structure
data methodology
• Reports and dashboards use the data from the
• Organized by subject • Processes and cleans data and stores it in the
data mart
data lake
• OLAP (online analytical processing) application
• The primary data source for data analytics
• Reports and dashboards use data from data
warehouses

36
A Gartner Magic Quadrant: Who are the leader vendors in Business
Intelligence?
Microsoft Power Bi (2013)

Tableau (2003)

QlikSense / QlikView (1993)

37
Implementation of BI systems
How does the information need of users change the
landscape of business analysis?

39
Requirements for a successful BI implementation

FLEXIBLE
APPLICATION

INSIGHT CUSTOMI-
CENTRIC ZATION

BUSINESS USER
RELIABLE DATA
FOCUS

40
Five challanges of BI implementation projects
CHALLANGES

1. We don’t really know what we should


measure.

2. We have data but there is a huge uncertainty


about the reliability and consistency of data.

3. We have data and the BI platform is available


but there is no one who can write codes.

4. We don’t receive answers for our questions


in time.

5. The system is not scalable with the growth of


the organization.
Changes in business reporting

Self-service
Action
oriented

Continuous
and instant
Dynamic

Histor-
ical data
Static
Business Intelligence EXTENDED AI BASED
Top (BI)
down

Traditional 42
Definition of dashboards

„A dashboard is a visual display of the most important


information needed to achieve one or more objectives
that has been consolidated and arranged on a single
screen so the information can be monitored at a glance.„

Stephen Few
Operational dashboard: aims at tracking on-going operations in a real-time
manner
Analytical dashboard: aims at supporting decision-making based on historical
data
Strategic dashboard: aims at conveying information on the organiztaion as a
whole from the viewpoint of the key performance indicators.
Power BI
Traditional and modern management reporting

ERP
Processing in
Excel
CRM, SCM, MES (Pivot, Chart stb.)
.csv,
.xls,
.txt
Other
tables,systems Manual
data
extraction
in case of any update

Static reports that can only be updated if we go back to the „drawing board”.
Reports cannot be changed flexibly by users.
Reporting evolution at Microsoft: from Excel to Power BI

Manual
Pivot Power Power
reporting
tables Pivot Query
in Excel
Traditional and modern reporting

Dynimic reporting with real-time up-dates.


Reporting evolution at Microsoft: from Excel to Power BI

• Data import from a great variety of


sources.

• Built-in integration settings.

• Splendid visualization tools.

• Real-time report up-dates.

• RLS for securing data privacy and


confidentiality.

• Powerful analysis toolkit.

• Wide options for sharing reports.


Excel versus Power BI: Which is better?
POWER BI EXCEL
Learning curve Not so easy to use (Power Query, Almost everyone knows it.
Power Pivot, DAX etc.)
Costs Power BI Desktop is free, however Part of Office 365
min. 10 EUR/user if we want to share
reports
Flexibility Low Almost everything is possible.
Visualisation Wide variety of visualisation. Only limited.
Dashboard High level (filters, slicers, reporting Limited (filters and slicers)
interactivity filters, dimension filters)
Size of data 1 GB – 100 GB 1 million row, 16 thousand
columns
Formula language DAX -
Sharing mobil, web, laptop, tablet e-mail
Thanks for
your
attention
Presented by: Gábor Dragonya
Q&A [email protected]

Created by: László SZÍVÓS Phd FCCA


[email protected]

You might also like