Welcome to Scribd, the world's digital library. Read, publish, and share books and documents. See more
Download
Standard view
Full view
of .
Look up keyword or section
Like this
207Activity
0 of .
Results for:
No results containing your search query
P. 1
All DataStage FAQs and Tutorials

All DataStage FAQs and Tutorials

Ratings:

4.95

(19)
|Views: 9,675 |Likes:
Published by oraveen
DataStage ETL Tool - FAQs and Tutorials
DataStage ETL Tool - FAQs and Tutorials

More info:

Published by: oraveen on Oct 01, 2008
Copyright:Attribution Non-commercial

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
download as PDF, TXT or read online from Scribd
See more
See less

08/07/2013

pdf

text

original

 
Email: oraveen@gmail.com ☻Page 1 of 210☻
DATASTAGE FAQ’s & TUTORIAL’sINDEX
1.
 
DATASTAGE QUESTIONS...........................................................................22.
 
DATASTAGE FAQ........................................................................................133.
 
TOP 10 FEATURES IN DATASTAGE HAWK .........................................294.
 
DATASTAGE NOTES...................................................................................315.
 
DATASTAGE TUTORIAL...........................................................................426.
 
LEARN FEATURES OF DATASTAGE......................................................487.
 
PERFORMANCE TUNING IN PARALLEL ENVIRONMENTS............808.
 
INFORMATICA vs DATASTAGE...............................................................839.
 
BEFORE YOU DESIGN YOUR APPLICATION......................................9410.
 
DATASTAGE 7.5x1 GUI FEATURES.......................................................10111.
 
DATASTAGE & DWH INTERVIEW QUESTIONS...............................10412.
 
DATASTAGE ROUTINES..........................................................................11613.
 
SET_JOB_PARAMETERS_ROUTINE.....................................................177
Version 1.4Prepared by:Raveen Ollalwar Email:oraveen@gmail.com
 
Email: oraveen@gmail.com ☻Page 2 of 210☻
DATASTAGE QUESTIONS
1. What is the flow of loading data into fact & dimensional tables?
A)
Fact table
- Table with Collection of Foreign Keys corresponding to the PrimaryKeys in Dimensional table. Consists of fields with numeric values.
Dimension table
- Table with Unique Primary Key.
Load
- Data should be first loaded into dimensional table. Based on the primary keyvalues in dimensional table, the data should be loaded into Fact table.
2. What is the default cache size? How do you change the cache size if needed?
A. Default cache size is
256 MB
. We can increase it by going into DatastageAdministrator and selecting the Tunable Tab and specify the cache size over there.
3. What are types of Hashed File?
A) Hashed File is classified broadly into 2 types.a)
Static
- Sub divided into 17 types based on Primary Key Pattern. b)
Dynamic
- sub divided into 2 typesi) Generic ii) Specific.Dynamic files do not perform as well as a well, designed static file, but do perform better than a badly designed one. When creating a dynamic file you can specify the followingAlthough all of these have default values)By Default Hashed file is "Dynamic - Type Random 30 D"
4. What does a Config File in parallel extender consist of?
A) Config file consists of the following.a) Number of Processes or Nodes. b) Actual Disk Storage Location.
5. What is Modulus and Splitting in Dynamic Hashed File?
A. In a Hashed File, the size of the file keeps changing randomly.If the size of the file increases it is called as "Modulus".If the size of the file decreases it is called as "Splitting".
6. What are Stage Variables, Derivations and Constants?A. Stage Variable
- An intermediate processing variable that retains value during readand doesn’t pass the value into target column.
Derivation
- Expression that specifies value to be passed on to the target column.
Constant
- Conditions that are either true or false that specifies flow of data with a link.
7. Types of views in Datastage Director?
There are 3 types of views in Datastage Director a)
Job View
- Dates of Jobs Compiled. b)
Log View
- Status of Job last run
 
Email: oraveen@gmail.com ☻Page 3 of 210☻
c)
Status View
- Warning Messages, Event Messages, Program Generated Messages.
8. Types of Parallel Processing?
A) Parallel Processing is broadly classified into 2 types.a)
SMP
- Symmetrical Multi Processing. b)
MPP
- Massive Parallel Processing.
9. Orchestrate Vs Datastage Parallel Extender?
A) Orchestrate itself is an ETL tool with extensive parallel processing capabilities andrunning on UNIX platform. Datastage used Orchestrate with Datastage XE (Beta versionof 6.0) to incorporate the parallel processing capabilities. Now Datastage has purchasedOrchestrate and integrated it with Datastage XE and released a new version Datastage 6.0i.e Parallel Extender.
10. Importance of Surrogate Key in Data warehousing?
A) Surrogate Key is a Primary Key for a Dimension table. Most importance of using it isit is independent of underlying database. i.e. Surrogate Key is not affected by the changesgoing on with a database.
11. How to run a Shell Script within the scope of a Data stage job?
A) By using "ExcecSH" command at Before/After job properties.
12. How to handle Date conversions in Datastage? Convert a mm/dd/yyyy format toyyyy-dd-mm?
A) We use a) "Iconv" function - Internal Conversion. b) "Oconv" function - External Conversion.Function to convert mm/dd/yyyy format to yyyy-dd-mm isOconv(Iconv(Filedname,"D/MDY[2,2,4]"),"D-MDY[2,2,4]")
13 How do you execute datastage job from command line prompt?
A) Using "dsjob" command as follows.dsjob -run -jobstatus projectname jobname
14. Functionality of Link Partitioner and Link Collector?Link Partitioner:
It actually splits data into various partitions or data flows usingvarious partition methods.
Link Collector:
It collects the data coming from partitions, merges it into a single dataflow and loads to target.
15. Types of Dimensional Modeling?
A) Dimensional modeling is again sub divided into 2 types.a) Star Schema - Simple & Much Faster. Denormalized form. b) Snowflake Schema - Complex with more Granularity. More normalized form.
16. Differentiate Primary Key and Partition Key?

Activity (207)

You've already reviewed this. Edit your review.
1 hundred reads
1 thousand reads
krmchinna liked this
talk2parimi liked this
seenuguddu liked this
ajax248590 liked this
SatishRamabhotla liked this
addubey2 liked this
Rahul Jain liked this

You're Reading a Free Preview

Download
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->