0% found this document useful (0 votes)
34 views4 pages

Ds 4

Top 50 Datastage Interview Questions

Uploaded by

prasanna
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
34 views4 pages

Ds 4

Top 50 Datastage Interview Questions

Uploaded by

prasanna
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

26) Explain Usage Analysis in Datastage?

In Datastage, Usage Analysis is performed within few clicks. Launch Datastage


Manager and right click the job. Then, select Usage Analysis and that’s it.

27) How do you find the number of rows in a sequential file?

To find rows in sequential file, we can use the System variable @INROWNUM.

28) Differentiate between Hash file and Sequential file?

The only difference between the Hash file and Sequential file is that the Hash file
saves data on hash algorithm and on a hash key value, while sequential file
doesn’t have any key value to save the data. Basis on this hash key feature,
searching in Hash file is faster than in sequential file.

29) How to clean the Datastage repository?

We can clean the Datastage repository by using the Clean Up Resources


functionality in the Datastage Manager.

30) How a routine is called in Datastage job?

In Datastage, routines are of two types i.e. Before Sub Routines and After Sub
Routines. We can call a routine from the transformer stage in Datastage.

31) Differentiate between Operational Datastage (ODS) and Data


warehouse?

We can say, ODS is a mini data warehouse. An ODS doesn’t contain information
for more than 1 year while a data warehouse contains detailed information
regarding the entire business.

32) NLS stands for what in Datastage?

NLS means National Language Support. It can be used to incorporate other


languages such as French, German, and Spanish etc. in the data, required for
processing by data warehouse. These languages have same scripts as English
language.

33) Can you explain how could anyone drop the index before loading the
data in target in Datastage?
In Datastage, we can drop the index before loading the data in target by using
the Direct Load functionality of SQL Loaded Utility.

34) Does Datastage support  slowly changing dimensions ?

Yes. Version 8.5 + supports this feature

35) How can one find bugs in job sequence?

We can find bugs in job sequence by using DataStage Director.

36) How complex jobs are implemented in Datstage to improve


performance?

In order to improve performance in Datastage, it is recommended, not to use


more than 20 stages in every job. If you need to use more than 20 stages then it
is better to use another job for those stages.

37) Name the third party tools that can be used in Datastage?

The third party tools that can be used in Datastage, are Autosys, TNG and Event
Co-ordinator. I have worked with these tools and possess hands on experience of
working with these third party tools.

38) Define Project in Datastage?

Whenever we launch the Datastage client, we are asked to connect to a


Datastage project. A Datastage project contains Datastage jobs, built-in
components and Datastage Designer or User-Defined components.

39) How many types of hash files are there?

There are two types of hash files in DataStage i.e. Static Hash File and Dynamic
Hash File. The static hash file is used when limited amount of data is to be loaded
in the target database. The dynamic hash file is used when we don’t know the
amount of data from the source file.

40) Define Meta Stage?

In Datastage, MetaStage is used to save metadata that is helpful for data lineage
and data analysis.
41) Have you have ever worked in UNIX environment and why it is useful in
Datastage?

Yes, I have worked in UNIX environment. This knowledge is useful in Datastage


because sometimes one has to write UNIX programs such as batch programs to
invoke batch processing etc.

42) Differentiate between Datastage and Datastage TX?

Datastage is a tool from ETL (Extract, Transform and Load) and Datastage TX is a
tool from EAI (Enterprise Application Integration).

43) What is size of a transaction and an array means in a Datastage?

Transaction size means the number of row written before committing the records
in a table. An array size means the number of rows written/read to or from the
table respectively.

44) How many types of views are there in a Datastage Director?

There are three types of views in a Datastage Director i.e. Job View, Log View and
Status View.

45) Why we use surrogate key?

In Datastage, we use Surrogate Key instead of unique key. Surrogate key is


mostly used for retrieving data faster. It uses Index to perform the retrieval
operation.

46) How rejected rows are managed in Datastage?

In the Datastage, the rejected rows are managed through constraints in


transformer. We can either place the rejected rows in the properties of a
transformer or we can create a temporary storage for rejected rows with the help
of REJECTED command.

47) Differentiate between ODBC and DRS stage?

DRS stage is faster than the ODBC stage because it uses native databases for
connectivity.

48) Define Orabulk and BCP stages?


Orabulk stage is used to load large amount of data in one target table of Oracle
database. The BCP stage is used to load large amount of data in one target table
of Microsoft SQL Server.

49) Define DS Designer?

The DS Designer is used to design work area and add various links to it.

50) Why do we use Link Partitioner and Link Collector in Datastage?

In Datastage, Link Partitioner is used to divide data into different parts through
certain partitioning methods. Link Collector is used to gather data from various
partitions/segments to a single data and save it in the target table.

You might also like