0% found this document useful (0 votes)
20 views27 pages

Chapter 4 Database and Data Governance

This document provides an overview of Huawei Cloud's database services and data governance strategies. It covers various database types, their applications, advantages of cloud-native databases, and features of both relational and non-relational databases. Additionally, it discusses data processing challenges and governance solutions offered by DataArts Studio.

Uploaded by

adhamayad000
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views27 pages

Chapter 4 Database and Data Governance

This document provides an overview of Huawei Cloud's database services and data governance strategies. It covers various database types, their applications, advantages of cloud-native databases, and features of both relational and non-relational databases. Additionally, it discusses data processing challenges and governance solutions offered by DataArts Studio.

Uploaded by

adhamayad000
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Huawei Cloud Tech Essentials

Chapter 4:
Database and Data Governance

Huawei Cloud | Grow with Intelligence


Objectives
Upon completion of this chapter, you will have:

⚫ Learned the databases of Huawei Cloud.

⚫ Learned database application scenarios.

⚫ Learned data governance.

Page 2
Part 1:
Database

Huawei Cloud | Grow with Intelligence


Besides OBS and EVS, are there any
other services that can store data?

OBS EVS

Page 4
Different Data Types
Unstructured data Structured data
Book Name Published Year Author
The Great Gatsby 1925 F. Scott Fitzgerald

Lord of the Rings 1968 J. R. R. Tolkien

Document Package Audio/video


file Fully structured Semi-structured
{
Year: 2018,
Author: {
Byliner: J. R. R. Tolkien,
Drafter: Internet pal}
}

OBS EVS Relational Database Document


Service Database Service
RDS DDS
Page 5
Relational Database vs Non-Relational Database
Relational Non-Relational

Traditional applications require databases that can New Internet applications require databases that
Application model process transactions, complex queries, and can store massive data and handle a large number
associated queries, and maintain data integrity. of concurrent requests.

Within 10 ms
Milliseconds
Access performance Depending on query complexity and the amount of
The amount of stored data is irrelevant.
stored data

Data type Fully structured Semi-structured (key/value pairs and documents)

Vertical Vertical
Scaling mode
Horizontal: primary DB + read replicas Horizontal: cluster + read replicas

Depending on database configurations


Data consistency Strong consistency
Strong consistency or eventual consistency

Storage space limit Depending on the DB engine Depending on the number of deployed DB nodes

Page 6
Diversified Databases

No single service can meet all requirements.

Document Distributed GaussDB RDS for Relational GaussDB


Database Cache Service (for NoSQL) PostgreSQL Database (for MySQL)
Service Redis Service
DDS RDS

Page 7
Advantages of Cloud Native
Database optimization Database optimization Database optimization

Database development Database development Database development

Backup and restoration Backup and restoration Backup and restoration

High availability High availability High availability

Database engine management Database engine management Database engine management

Patch and configuration management Patch and configuration management Patch and configuration management

Software system Software system Software system

OS OS OS

Device device maintenance Device device maintenance Device device maintenance

Power supply, cooling, and cabinets Power supply, cooling, and cabinets Power supply, cooling, and cabinets

Buildings, locations Buildings, locations Buildings, locations

Cloud-native deployment
Traditional DC deployment Cloud host deployment

Green: Huawei Cloud's responsibilities Blue: Customer's responsibilities

Page 8
RDS Advantages

 A service that manages database engines and resources

 Compatible with mainstream relational database engines


◼ MySQL, PostgreSQL, SQL Server

 A cloud-native service that facilitates maintenance

 Built-in automated backup and patching

 HA configuration and automatic switchover/failover


Relational Database
Service within seconds

RDS  Read replicas

 Security certifications

Page 9
Features of Backup

02
Backups are Automated Binlog backups
stored in OBS Binlog backup can be restored
and can be and manual to a specific
restored to backup are point in time.
local disks. supported.

01 03

Page 10
Building a Simple Architecture Using a Database Service

• A database service, for example RDS,


functions as a database resource
Application Server manager.
App Server
• Applications access the database in the
Elastic Load original way.
RDS
Balance
ELB Application Server Database service • DB engines and versions are selected
App Server while tenants are creating databases.

However, this deployment architecture is


not safe.
Backups in
OBS

Page 11
Primary/Standby Deployment of RDS

• If the primary DB instance fails, workloads can be automatically switched to


the standby DB instance.
– The switchover duration varies depending on your DB engine. A switchover can be
performed within seconds.

• Primary and standby DB instances can be deployed in different AZs to enable


cross-AZ disaster recovery (DR).

• You can complete maintenance operations for the standby database and
then perform a switchover to ensure service continuity.

• Primary/standby deployment increases costs.


Page 12
HA Architecture Based on Primary/Standby Deployment

• This architecture eliminates the impact


RDS
App Server of a single point of failure (SPOF).
(Standby)
– High availability is achieved.
– Service continuity can be guaranteed
Elastic Load Balance
even if a node fails.
(ELB)
App Server

RDS • This is a typical way to use AZs.


(Primary) – The RDS service deployed in
primary/standby mode provides
complete functions in either of the AZs.

Backups in OBS
Page 13
Other Huawei Database Services

 Huawei DB engine

 Compatible with MySQL

 Several times higher performance than that of


native MySQL databases

 Strong backup and restoration capabilities

GaussDB(for MySQL) achieved with decoupled storage and compute

 High availability in cross-AZ deployment

Page 14
DDM Simplifies Database Use
Distributed Database Middleware (DDM) is a distributed database middleware service
that can work with RDS to horizontally expand database capacity so that regular users
can use distributed databases as easily as using a single-node database.

➢ SQL computing and transaction processing Service system

➢ Horizontal database scaling Still like using a single-node database

➢ Unlimited scaling of storage and compute


DDM
resources

➢ Single-node MySQL database experience RDS RDS

Page 15
What Will Happen If There Are a Large
Number of Concurrent Data Writes?

Page 16
GaussDB(for Mongo) Features

 Huawei-developed DB engine

 Compatible with MongoDB clients

 Compute resources scaled within minutes


and storage within seconds

GaussDB(for Mongo)  Cross-AZ HA

If necessary, you can select DDS


DDS instances that use the MongoDB engine.

Page 17
Features of Non-Relational Databases

01 03
There are no
logical Users can locate
relationships data using key
Each data record
between the data values rather
is a structured
records in a than through
document.
collection. complex queries.
02

Page 18
Precautions for Using Non-Relational Databases

• Define data keys and indexes for faster data location to avoid having to scan entire
collections.
– Do not use hot keys.

– Keep data records simple.

– Data can be located in milliseconds no matter how much data there is.

• If necessary, use optimistic locks for transactions in the application layer.

• Use clusters if there is too much traffic.


– Select an appropriate sharding algorithm to locate data.

– Distribute data access across all cluster nodes and avoid hot partitions.

Page 19
HA Architecture Using Non-Relational Databases

• GaussDB(for Mongo) is an
App server
alternative to RDS.
ELB – The type of databases depends on
GaussDB(for
App server Mongo) cluster the type of data and how it will be
accessed.

• GaussDB(for Mongo) clusters


Backups in OBS provide high availability.

Page 20
Part 2:
Data Governance

Huawei Cloud | Grow with Intelligence


Data Processing Challenges

Structured and unstructured High data ingestion


Terabytes to OBS and database bandwidth
petabytes Logs, transactions, and Rapid data processing
clickstreams

Large volume Variety High velocity

Page 22
Typical Data Processing Method

Analysis and
Collection Visualization
processing

Data Insight

Storage

Page 23
Data Collection

RDS GaussDB(for OBS Distributed Data Ingestion


database Mongo) cluster Message Service Service
(DMS) (DIS)
for Kafka

Transaction data Files, logs, and raw data Stream data, IoT data, and
Structured transaction clickstreams
information

Page 24
Governing Data in Lakes
But is it really good to have so many choices
Challenges: for storing data?
1. Where is data stored?
2. What is the data format? Avoid data silos:
1. How to avoid repetition and conflict?
2. How to enable cross-component data
access?
3. How to transfer data from SaaS?
4. ...
Page 25
DataArts Studio Helps You Govern Data

 Starts from data governance planning.

 Data import

DataArts  One platform for data development and


Studio

governance

DataArts Studio  Data quality evaluation based on planning

 Data asset management and metadata


splitting

Page 26
Thank you. Bring digital to every person, home, and
organization for a fully connected,
intelligent world.

Copyright© 2024 Huawei Technologies Co., Ltd.


All Rights Reserved.

The information in this document may contain predictive


statements including, without limitation, statements regarding
the future financial and operating results, future product
portfolio, new technology, etc. There are a number of factors
that could cause actual results and developments to differ
materially from those expressed or implied in the predictive
statements. Therefore, such information is provided for reference
purpose only and constitutes neither an offer nor an acceptance.
Huawei may change the information at any time without notice.

You might also like