EMC - Greenplum Architecture, Administration, and
Implementation
Code: 4448
Lengt h: 5 days
URL: View Online
In this course, you will learn to design and implement the Greenplum environment and gain the information
needed to install, congure, and manage the Greenplum database system. You will be introduced to the
Greenplum environment, consisting of the Greenplum Database and supported systems. You will learn the
fundamental concepts on data warehousing, business intelligence, and how Greenplum helps to solve business
problems in managing and analyzing big data. You will evaluate logical models and business requirements to
determine the best physical design for a Greenplum database.
Skills Gained
Greenplum features, benets, and architecture in terms of shared nothing and the Massively Parallel
Processing (MPP) design
Support redundancy and high availability with Greenplum
Data models used in data warehousing and how data is stored in Greenplum
Install, initialize, validate, and congure Greenplum Database
Manage database objects and workload management processes by dening and managing roles, privileges,
and resource queues
Use table partitioning as a design methodology for handling large tables
Load data into a Greenplum database instance using external tables, SQL copy, insert commands, and parallel
load utilities
Use data manipulation language and data query language to access, manage, and query data
Perform system administrative tasks, including managing and checking the state of the Greenplum database,
its data, and the distribution of data
Perform backup and restoration of Greenplum data
Distribute and store data in Greenplum using a distribution key and partitioning
Use EXPLAIN and EXPLAIN ANALYZE to help the Greenplum query plan optimizer determine how to handle a
submitted query
Improve query performance by keeping statistics up to date and tuning the database for sampling size and
error conditions
Determine when it's best to use an index and what type of index to use
Improve query performance by following a number of performance enhancement tips
Who Can Benet
This course is intended for any person who presently or plans to:
Install Greenplum Database
Design and develop for Greenplum Database implementation
Administer and manage the Greenplum Database
Course Details
1. Greenplum Fundamental Concepts
Basics of Data Warehousing
Greenplum Concepts, Features, and Benets
Greenplum Architecture
Shared Nothing and MPP Implementation
2. Database Installation and Initialization
Systems Preparation and Verication
Greenplum Database Initialization
3. Greenplum Database Tools, Utilities, and Internals
PSQL Client and Greenplum Utilities
Greenplum Performance Monitor
Greenplum Database Server Conguration
Greenplum Database Internals
4. Dening and Securing the User Database
Data Denition Language
Data Manipulation and Data Query Language
Roles, Privileges, and Resources
5. Data Loading and Distribution
Data Loading
Table Partitioning
6. Database Management and Archiving
Managing the Greenplum Database
Backup and Restores
7. Data Modeling and Design
Data Modeling
Physical Design Decisions
8. Performance Analysis and Tuning
JOIN Tables
Types and Methods
Database Tuning
EXPLAIN Plan
Analyzing Queries
Improve Performance with Statistics
Indexing Strategies
9. Developing Reports Using Advanced SQL
This course includes labs designed to allow practical experience for the participant.
ExitC ertied C orporation and iMVP are registered tradem arks of ExitC ertied ULC and Generated Aug 12 8:55:30 UTC 2017
ExitC ertied C orporation and Tech Data C orporation, respectively
C opyright 2017 Tech Data C orporation and ExitC ertied ULC & ExitC ertied C orporation.
All Rights Reserved.