Tutorial Load
Tutorial Load
Tutorial
© 2019 SAP SE or an SAP affiliate company. All rights reserved.
1 Documentation changes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Tutorial
2 PUBLIC Content
4.9 Ensuring that the Job Server is running. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .46
4.10 Executing the job. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
4.11 Summary and what to do next. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
Tutorial
Content PUBLIC 3
8.1 Adding the SalesFact job, work flow, and data flow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
8.2 Creating the SalesFact data flow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
8.3 Defining the details of the Query transform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .83
8.4 Using a lookup_ext function for order status. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
8.5 Validating the SalesFact data flow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
8.6 Executing the SalesFact job. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
8.7 Viewing Impact and Lineage Analysis for the SALES_FACT target table. . . . . . . . . . . . . . . . . . . . . . .91
8.8 Summary and what to do next. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
Tutorial
4 PUBLIC Content
Adding the conditional. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125
Specifying the If-Then work flows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126
11.5 Creating the script that updates the status. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126
11.6 Verify the job setup. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
11.7 Executing the job. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .129
11.8 Data Services automated recovery properties. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129
11.9 Summary and what to do next. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130
Tutorial
Content PUBLIC 5
13.5 Repopulating the material dimension table. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
Adding the material dimension job, work flow, and data flow. . . . . . . . . . . . . . . . . . . . . . . . . . . 173
Adding ABAP data flow to Material Dimension job. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174
Defining the DF_SAP_MtrlDim ABAP data flow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175
Executing the JOB_SAP_MtrlDim job. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178
13.6 Repopulating the Sales Fact table. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179
Adding the Sales Fact job, work flow, and data flow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
Adding ABAP data flow to Sales Fact job. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .180
Defining the DF_ABAP_SalesFact ABAP data flow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
Executing the JOB_SAP_SalesFact job. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 187
13.7 Summary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188
Tutorial
6 PUBLIC Content
1 Documentation changes
The following table contains changes to the documentation, and the related SAP Data Services version in which
the changes were made. The list begins with the most recent changes.
Added the following topics: Standard topics in all Data Services 4.2 SP 9 Patch 1
documents.
● Welcome
● SAP information resources
Removed the following topics: Topics were under Product overview. 4.2 SP 8 Patch 2
Removed because the concepts were 4.2 SP 9
● System configurations
for advanced users.
● Windows and UNIX implementa
tion
Removed: Environment requirements Topic was under Preparation for this tu 4.2 SP 8 Patch 2
torial. Removed because the concept 4.2 SP 9
was for advanced users.
Removed the following topics: Topics were under Setting up for the tu 4.2 SP 8 Patch 2
torial. Removed because an administra 4.2 SP 9
● BI platform and the Central Man
tor should perform the tasks.
agement Server (CMS)
● Opening the Central Management
Console
● Installing SAP Data Services
● Verifying the Windows service
● Creating a new Data Services user
account
Tutorial
Documentation changes PUBLIC 7
Change Notes Version
Renamed the topic Setting up for the Topic is located under Preparation for 4.2 SP 8 Patch 2
tutorial to Tasks required to prepare for this tutorial. 4.2 SP 9
the tutorial.
Added the following topics to the Tuto These topics make it easier for readers 4.2 SP 8 Patch 2
rial: to find additional documents that we 4.2 SP 9
reference in other topics.
● SAP Information resources
● Accessing documentation from the
Web
● Documentation set for SAP Data
Services
Removed all Terminology topics These topics were not consistent, and 4.2 SP8 Patch 2
many terms were used in more than
one section.
Tutorial
8 PUBLIC Documentation changes
2 Introduction to the tutorial
This tutorial introduces you to the basic use of SAP Data Services Designer by explaining key concepts and
providing a series of related exercises and sample data.
Data Services Designer is a graphical user interface (GUI) development environment in which you extract,
transform, and load batch data from flat-file and relational database sources for use in a data warehouse. You
can also use Designer for real-time data extraction and integration.
The tutorial is for users experienced in many areas of database management, SQL, and Microsoft Windows.
The tutorial introduces core SAP Data Services Designer functionality. We wrote the tutorial assuming that you
have experience in some of the following areas:
After you complete this tutorial, you will be able to extract, transform, and load data from various source and
target types, and understand the concepts and features of SAP Data Services Designer.
You will know about the various Data Services objects such as datastores and transforms, and you will be able
to define a file format, import data, and analyze data results.
You will learn how to use Data Services Designer features and functions to do the following:
Tutorial
Introduction to the tutorial PUBLIC 9
2.3 Product overview
Data Services extracts, transforms, and loads (ETL) data from heterogeneous sources into a target database or
data warehouse. You specify data mappings and transformations by using Data Services Designer.
Data Services combines industry-leading data quality and integration into one platform. It transforms your
data in many ways. For example, it standardizes input data, adds additional address data, cleanses data, and
removes duplicate entries.
Data Services provides additional support for real time data movement and access. It performs predefined
operations in real time, as it receives information. The Data Services real time components also provide
services to Web applications and other client applications.
For a complete list of Data Services resources, see the Designer Guide.
Job Server Application that launches the Data Services processing engine and serves as an interface to the engine
and other components in the Data Services suite.
Engine Executes individual jobs that you define in the Designer to effectively accomplish the defined tasks.
Tutorial
10 PUBLIC Introduction to the tutorial
Component Description
Repository Database that stores Designer predefined system objects and user-defined objects including source and
target metadata and transformation rules. Create a local repository and then a central repository to
share objects with other users and for version control.
Access Server Passes messages between Web applications and the Data Services Job Server and engines. Provides a
reliable and scalable interface for request-response processing.
Administrator Web administrator that provides the following browser-based administration of Data Services resources:
The following diagram illustrates Data Services product components and relationships.
Tutorial
Introduction to the tutorial PUBLIC 11
Related Information
Use the many tools in SAP Data Services Designer to create objects, projects, data flows, and workflows to
process data.
The Designer interface contains key work areas that help you set up and run jobs. The following illustration
shows the key areas of the Designer user interface.
Tutorial
12 PUBLIC Introduction to the tutorial
Related Information
SAP Data Services objects are entities that you create, add, define, modify, or work with in the software.
Each Data Services object has similar characteristics for creating and configuring objects.
Characteristic Description
Properties Text that describes the object. For example, the name, de
scription, and creation date describes aspects of an object.
Attributes Properties that organize objects and make them easier for
you to find. For example, organize objects by attributes such
as object types.
The Designer contains a Local Object Library that is divided by tabs. Each tab is labeled with an object type.
Objects in a tab are listed in groups. For example, the Project tab groups projects by project name and further
by job names that exist in the project.
● Projects
● Jobs
● Workflows
● Data flows
● Transforms
● Datastores
● Formats
● Functions
Related Information
Tutorial
Introduction to the tutorial PUBLIC 13
2.3.3.1 Object hierarchy
Object relationships are hierarchical.
The highest object in the hierarchy is the project. The subordinate objects appear as nodes under a project. You
add subordinate objects to the project in a specific order. For example, A project contains jobs, jobs contain
workflows, and workflows contain data flows.
The following diagram shows the hierarchical relationships for the key object types within Data Services.
Tutorial
14 PUBLIC Introduction to the tutorial
Related Information
A project is the highest-level object in Designer hierarchy. Projects provide a way to organize the subordinate
objects, which are jobs, workflows, and data flows.
A project is open when you can view it in the project area. If you open a different project from the Project tab in
the object library, the project area closes the current project and shows the project that you just opened.
A work flow specifies the order in which SAP Data Services processes subordinate data flows.
Arrange the subordinate data flows under the work flow so that the output from one data flow is ready for input
to the intended data flow.
Tutorial
Introduction to the tutorial PUBLIC 15
A work flow is a reusable object. It executes only within a Job. Use work flows to:
The Data Services objects you can use to create work flows appear as icons on the tool palette to the right of
the workspace. If the object isn't applicable to what you have open in the workspace, the software disables the
icon. The following table contains the programming analogy of each object to describe the role the object plays
in the work flow.
Procedure
Workflow
Data flows process data in the order in which they are arranged in a work flow.
A data flow defines the basic task that Data Services accomplishes. The basic task is moving data from one or
more sources to one or more target tables or files.
You define data flows by identifying the sources from which to extract data, the transformations that the data
should undergo, and the targets.
Tutorial
16 PUBLIC Introduction to the tutorial
Use data flows to:
A data flow is a reusable object. It is always called from a work flow or a job.
A consistent naming convention for Data Services objects helps you easily identify objects listed in an object
hierarchy.
Datastore DS ODS_DS
Related Information
To delete an object, first decide whether to delete the object from the project or delete the object from the
repository.
When you delete an object from a project in the project area, the software removes the object from the project.
The object is still available in the object library and the repository.
Tutorial
Introduction to the tutorial PUBLIC 17
When you delete the object from the object library, the software deletes all occurrences of the object from the
repository. If the object is called in separate data flows, the software deletes the object from each data flow.
The deletion may adversly affect all related objects.
To protect you from deleting objects unintentionally, the software issues a notice before it deletes the object
from the repository. The notice states that the object is used in multiple locations, and it provides the following
options:
● Yes: Continues with the delete of the object from the repository.
● No: Discontinues the delete process.
● View Where Used: Displays a list of the related objects in which the object will be deleted.
The preparation may include some steps that your administrator has already completed. You may need to
contact your administrator for important connection information and access information related to those
tasks.
We have a complete documentation set for SAP Data Services available on our User Assistance Customer
Portal. If you are unclear about a process in the tutorial, or if you don't understand a concept, refer to the online
documentation at http://help.sap.com/bods.
Note
If your administrator has already completed these steps, you may be able to skip the tutorial set up section.
You must have sufficient user permission to perform the exercises in the tutorial. For information about
permissions, see the Administrator Guide.
Tutorial
18 PUBLIC Introduction to the tutorial
You or an administrator sets up your system for this tutorial. Instructions for administrator-only tasks are not
included in the tutorial. The following table lists each task and who performs the task.
Install Central Management Server (CMS) by installing either Administrator. More information in the Installation Guide.
the SAP BusinessObjects Business Intelligence platform (BI
platform) or the Information platform services platform (IPS
platform).
Install SAP Data Services Administrator. Steps are in the Installation Guide.
Create user account for tutorial participants. Administrator. Steps are in the Administrator Guide.
Create tutorial repository, source, and target databases You or a user who has permission to perform these tasks in
your RDBMS. Steps are in the tutorial.
Establish the tutorial repository as your local repository by Administrator or you, if you have sufficient permission.
using the Repository Manager, the Server Manager, and the Steps are in the tutorial.
Central Management Console (CMC)
Run the tutorial scripts to create source and target tables. Administrator or you. Steps are in the tutorial.
1. Creating repository, source, and target databases on an existing RDBMS [page 20]
Create the three databases using your preferred RDBMS.
2. Creating a local repository [page 21]
Use the repository database that you created earlier in your RDBMS to create a local repository.
3. Defining a job server and associating your repository [page 21]
Use the Data Services Server Manager to configure a new job server and associate the job server with
the local repository.
4. Configuring the local repository in the CMC [page 22]
To continue preparing the SAP Data Services local repository, you enter connection information in the
Central Management Console (CMC)
5. Running the provided SQL scripts [page 23]
Run the tutorial SQL scripts to create the sample source and target tables.
Related Information
Tutorial
Introduction to the tutorial PUBLIC 19
2.4.1.1 Creating repository, source, and target databases
on an existing RDBMS
An administrator, or a user with sufficient permissions to your RDBMS must perform these steps.
4. Grant access privileges for the user account. For example, grant connect and resource roles for Oracle.
5. Use the following table as a worksheet to note the connection names, database versions, user names, and
passwords for the three databases that you create. We refer you to this information in several of the
exercises in the tutorial.
Database version
User name
Password
Task overview: Tasks required to prepare for the tutorial [page 18]
Tutorial
20 PUBLIC Introduction to the tutorial