You are on page 1of 11

Teradata Questions and Answers:

1. What is the difference between FastLoad and MultiLoad?


FastLoad uses multiple sessions to quickly load large amount of data on empty
table.
MultiLoad is used for high-volume maintenance on tables and views. It works with
non-empty tables also. Maximum tables can be used in MultiLoad.
2. Which is faster?
FastLoad.
3. Difference between Inner join and outer join?
!n inner "oin gets data from both tables where the specified data exists in both
tables.
!n outer "oin gets data from the source table at all times# and returns data from
the outer "oined table $%L& if it matches the criteria.
4. What is multi Insert?
Inserting data records into the table using multiple insert statements. 'utting a
semi colon in front of the key word I%()*+ in the next statement rather than
terminating the first statement with a semi colon achieves it.
Insert into (ales ,select - from customer.
/ Insert into Loan ,select - from customer./
. Is multi insert !"#I standard?
%o.
$. %ow do &ou create a table with an e'istin( structure of another table
with data and with no data?
0reate table 0ustomerdummy as 0ustomer with data 1 with no data/
). What is the o*enin( ste* in +asic ,eradata -uer& scri*t?
.Logon tdipid1username# password.
.. /ou are callin( a +te0 scri*t1 which dro*s a table and creates a table. It
will throw an error if the table does not e'ist. %ow can &ou do it without
throwin( the error?
&ou can it by setting error level to 2ero before dropping and resetting the error
level to 3 after dropping.
&ou can do it like this
)**$*L)4)L 563789 ()4)*I+& 7/
:*$' +!;L) )M'L$&))/
2. 34454L363L 73.8)9 #3634I,/ .:
18.;an &ou Fast3'*ort a field1 which is *rimar& <e& b& *uttin( e0ualit& on
that <e&?
%o.
11.Did &ou write stored *rocedures in ,eradata?
%o# because they become a single amp operation and my company didn<t
encourage that.
12.What is the use of ha=in( inde'>s on table?
For faster record search.
13.Did &ou use -uer& man or #-L assistance?
(=L assistant >.?
14.I am u*datin( a table in +te0. It has to u*date a lar(e number of rows1
so it>s reall& slow. What do &ou su((est?
In +eradata it is not recommended to update more than ? million rows due to
"ournal space problems# if it is less than that and it<s slow in the ;teq# you might
want to add collect statistics statement before the update statement.
1.Is it necessar& to add? -?I, statement after a +te0 0uer& when I am
callin( it in a ?ni' en=ironment?
%ot necessary but it is good to add a =@I+ statement after a query.
1$.,here is a column with date in it. If I want to (et just month how It can
be done? ;an I use sub strin(?
(ub string is used with char fields. (o it cannot be used. +o extract month from a
date column# ex select extract 5month from Acolumn nameB9. (ame thing for
Confidential property of Gantec Corporation.
Teradata Questions and Answers:
year or day. $r hour or minutes if it<s a time stamp 5select extract 5minute from
column name9.
1).What>s the s&nta' of sub strin(?
(@;(+*I%C 5stringDexpression# n? EnFG9
1..Did &ou use ;!#3 W%3" statement. ;an &ou tell us a little about it?
&es. Hhen a case has to be selected depending upon the value of the expression.
12.While creatin( table m& D+! has F!LL+!;@ or "5 F!LL+!;@ in his DDL.
What is that?
F!LL;!0I requests that a second copy of each row inserted into a table be stored
on another !M' in the same cluster. +his is done when !M' goes down or disk
fails.
28.M& table (ot loc<ed durin( ML5!D due to a failed job. What do I do to
*erform other o*erations on it?
@sing *)L)!() ML$!:. It removes access locks from the target tables in
+eradata. It must be entered from ;+)= and not from MultiLoad.
+o proceed# you can do *)L)!() ML$!: Atable nameB
21.%ow to find du*licates in a table?
Croup by those fields and select id# count5-9 from table group by id having count
5-9 B ?
22.%ow to &ou =erif& a com*licated #-L?
I use explain statement to check if the query is doing what I wanted it to do.
23.%ow man& tables can &ou join in 624
@p to >J tables.
24.Did u e=er use ?AA34 Function?
@'')* Function is used to convert all characters in a column to the same
characters in upper case.
2.What does a L5W34 Function do?
L$H)* function is used to convert all characters in a column to the lower case
characters.
2$.%ow do &ou see a DDL for an e'istin( table?
;y using show table command.
2).Which is more efficient B45?A +/ or DI#,I";, to find du*licates?
Hith more duplicates C*$@' ;& is more efficient# if only a few duplicates exist
:I(+I%0+ is more efficient.
2..#&nta' for ;!#3 W%3" statement?
0!() valueDexpressionD? HK)% valueDexpressionDn +K)% scalarDexpressionDn
)%:/
22.What>s the difference between ,IM3#,!MA 789 and ,IM3#,!MA 7$9?
+IM)(+!M' 579 is 0K!* 5?L9 and +IM)(+!M' 5>9 is 0K!* 5F>9
)verything is same except that +IM)(+!M' 5>9 has microseconds too.
38.%ow do &ou determine the number of sessions?
+eradata performance and workload
0lient platform type# performance and workload
0hannel performance for channel attached systems
%etwork topology and performance for network attached systems.
4olume of data to be processed by the application.
31. What is node? %ow man& nodes and !MAs used in &our *re=ious *rojectM
%ode is a database running in a server. He used 6?3 nodes and each node has F
to J !M'(.
32. What is a cli0ueM
0lique is a group of disk arrays physically cabled to a group of nodes.
33.Inter=iewer e'*lained about their *roject 73n=ironment1 nature of wor<9
Listen to them carefully so that at the end of the interview you can ask questions
about the pro"ect when you are given a chance to ask questions.
34.,ell us somethin( about &ourself?
Confidential property of Gantec Corporation.
Teradata Questions and Answers:
:escribe about your pro"ect experience# technical skill sets# hard working# good
team player# self-learner and self-motivated.
3.What is the best *roject &ou e=er wor<ed with and wh& it is best
*roject?
!ll the pro"ects I worked so far are best pro"ects. I treat every pro"ect is equal and
work hard for the success of the pro"ect.
3$.What ma<es a *roject successful and how &ou ha=e contributed to the
success of the *roject?
Cood team members# technical knowledge of team members# hard work# sharing
knowledge among the team# individual<s contribution to the pro"ect. )xplain them
that you posses all the skills you mentioned above.
3).%a=e &ou wor<ed under stress and how did &ou handle it?
&es. Many times to deliver the pro"ect on schedule# we were under lot of pressure.
:uring those times we work extra hours and help each other in the team to
deliver the pro"ect on schedule. +eam effort is key factor for the success of the
pro"ect.
3..What is the difference between FastLoad and MultiLoad?
FastLoad uses multiple sessions to quickly load large amount of data on empty
table.
MultiLoad is used for high-volume maintenance on tables and views. It works with
non-empty tables also. Maximum tables can be used in MultiLoad.
32.%a=e &ou used *rocedures?
%o. I have not used procedures. ;ut I have expertise knowledge writing
procedures. My company have not encouraged me to write procedures because it
becomes single !M' operation# as such uses lot of resources and expensive in
terms of resource and time.
48.What is the *ur*ose of inde'es?
!n index is a mechanism that can be used by the (=L query optimi2er to make
table access more performant. Indexes enhance data access by providing a more-
or-less direct path to stored data and avoiding the necessity to perform full table
scans to locate the small number of rows you typically want to retrieve or update.
41. What is *rimar& inde' and secondar& inde'M
'rimary index is the mechanism for assigning a data row to an !M' and a location
on the !M'<s disks. Indexes also used to access rows from a table without having
to search the entire table.
(econdary indexes enhance set selection by specifying access paths less
frequently used than the primary index path. (econdary indexes are also used to
facilitate aggregate operations. If a secondary index covers a query# then the
$ptimi2er determines that it would be less costly to accesses its rows directly
rather than using it to access the base table rows it points to. (ometimes multiple
secondary indexes with low individual selectivity can be overlapped and bit
mapped to provide enhanced
42. Wh& *rimar& and secondar& inde' is usedM
*efer answer from previous question.
43.What are the thin(s to considered while creatin( secondar& inde'?
0reating a secondary index causes +eradata to build a sub-table to contain its
index rows# thus adding another set of rows that requires updating each time a
table row is inserted# deleted# or updated. (econdary index sub-tables are also
duplicated whenever a table is defined with F!LL;!0I# so the maintenance
overhead is effectively doubled.
44.What is collect statistics?
0ollects demographic data for one or more columns of a table# hash index# or "oin
index# computes a statistical profile of the collected data# and stores the synopsis
in the data dictionary. +he $ptimi2er uses the synopsis data when it generates its
table access and "oin plans.
4.;an we collect statistics on multi*le columns?
Confidential property of Gantec Corporation.
Teradata Questions and Answers:
&es we can collect statistics on multiple columns.
4$.;an we collect statistics on table le=el?
&es we can collect statistics on table level. +he syntax is 0$LL)0+ (+!+ $%
+!;D!/
4).What is inner join and outer join?
!n inner "oin gets data from both tables where the specified data exists in both
tables.
!n outer "oin gets data from the source table at all times# and returns data from
the outer "oined table $%L& if it matches the criteria.
4..When ,*um* is used instead of MultiLoad?
+'ump provides an alternative to MultiLoad for the low volume batch
maintenance of large databases under control of a +eradata system. Instead of
updating +eradata databases overnight# or in batches throughout the day# +'ump
updates information in real time# acquiring every bit of data from the client
system with low processor utili2ation. It does this through a continuous feed of
data into the data warehouse# rather than the traditional batch updates.
0ontinuous updates results in more accurate# timely data. !nd# unlike most load
utilities# +'ump uses row hash locks rather than table level locks. +his allows you
to run queries while +'ump is running. +his also means that +'ump can be
stopped instantaneously. !s a result# businesses can make better decisions that
are based on the most current data.
42.What is s*ool s*ace and when runnin( a job if it reaches the ma'imum
s*ool s*ace how &ou sol=e the *roblem?
(pool space is used to hold intermediate rows during processing# and to hold the
rows in the answer set of a transaction. (pool space reaches maximum when the
query is not properly optimi2ed. @se appropriate conditions in HK)*) clause of
the query to limit the answer set.
8.What is &our le=el of e'*ertise in usin( M# office suite?
)xpert level. @sing it for last 3 years for documentation.
1.%a=e &ou used "et meetin(?
&es. @sed net meeting for team meeting when members of the team
geographically in different locations.
2.Do &ou ha=e an& 0uestion?
Hhat is the team si2e going to beM Hhat is the current status of the pro"ectM Hhat
is the pro"ect scheduleM
3.What is &our a=ailable date?
Immediate. $r your available date for the pro"ect.
54. %ow much e'*erience &ou ha=e in M6#M
Intermediate. In my previous two pro"ects used M4( to submit N0L "obs.
55. %a=e &ou created C;L scri*t from scratchM
&es. I have created N0L scripts from scratch while creating "obs in the
development environment.
$.%a=e &ou modified an& C;L scri*t and used?
&es I have modified N0L scripts. In my previous pro"ects many applications were
re-engineered so the existing N0L scripts were modified according to the company
coding standards.
).4ate &ourself on usin( ,eradata tools li<e +,3-1 -uer& man1 FastLoad1
MultiLoad and ,*um*D
Intermediate to expert level. )xtensively using for last J years. !lso I am certified
in +eradata.
..Which is &our fa=orite area in the *roject?
I en"oy every working on every part of the pro"ect. 4olunteer my time for my
peers so that I can also learn and contribute more towards the pro"ect success.
2.What is data mart?
! data mart is a special purpose subset of enterprise data used by a particular
department# function or application. :ata marts may have both summary and
Confidential property of Gantec Corporation.
Teradata Questions and Answers:
details data# however# usually the data has been pre aggregated or transformed
in some way to better handle the particular type of requests of a specific user
community. :ata marts are categori2ed as independent# logical and dependant
data marts.
$8.Difference between star and snowfla<e schemas?
(tar schema is :e-normali2ed and snowflake schema is normali2ed.
$1.Wh& should &ou *ut &our data warehouse in a different s&stem other
than 5L,A s&stem?
4elational Data Modelin( 75L,A
desi(n9
Dimensional Data Modelin( 75L!A
desi(n9
:ata is stored in *:;M(
:ata is stored in *:;M( or
Multidimensional databases
+ables are units of storage 0ubes are units of storage
:ata is normali2ed and used for $L+'.
$ptimi2ed for $L+' processing
:ata is de-normali2ed and used in data
warehouse and data mart. $ptimi2ed for
$L!'
(everal tables and chains of
relationships among them
Few tables and fact tables are connected
to dimensional tables
4olatile 5several updates9 and time
variant
%on volatile and time invariant
(=L is used to manipulate data M:O is used to manipulate data
:etailed level of transactional data
(ummary of bulky transactional data
5!ggregates and Measures9 used in
business decisions
%ormal *eports
@ser friendly# interactive# drag and drop
multidimensional $L!' *eports
$2.Wh& are 5L,A database desi(ns not (enerall& a (ood idea for a Data
Warehouse?
$L+' designs are for real time data and they are not normali2ed and pre-
aggregated. +hey are not good for decision support systems.
$3.What t&*e of Inde'in( mechanism do we need to use for a t&*ical data
warehouse?
'rimary Index mechanism is the ideal type of index for data warehouse.
$4.What is 6LD+?
4ery Large databases. 'lease find more information on it.
$.What is the difference between 5L,A and 5L!A?
*efer answer for question >?.
$$.What is real time data warehousin(?
*eal-time data warehousing is a combination of two thingsP ?9 real-time activity
and F9 data warehousing. *eal-time activity is activity that is happening right now.
+he activity could be anything such as the sale of widgets. $nce the activity is
complete# there is data about it. :ata warehousing captures business activity
data. *eal-time data warehousing captures business activity data as it occurs. !s
soon as the business activity is complete and there is data about it# the completed
activity data flows into the data warehouse and becomes available instantly. In
other words# real-time data warehousing is a framework for deriving information
from data as the data becomes available.
$).What is 5D#?
!n operational data store 5$:(9 is primarily a QdumpQ of relevant information
from a very small number of systems 5often "ust one9 usually with little or no
transformation. +he benefits are an ad hoc query database# which does not affect
Confidential property of Gantec Corporation.
Teradata Questions and Answers:
the operation of systems required to run the business. $:(<s usually deal with
data QrawQ and QcurrentQ and can answer a limited set of queries as a result.
$..What is real time and near real time data warehousin(?
+he difference between real time and near real time can be summed up in one
wordP latency. Latency is the time lag that is between an activity completion and
the completed activity data being available in the data warehouse. In real time#
the latency is negligible whereas in near real time the latency is a tangible time
frame such as two hours.
$2.What are "ormaliEation1 First "ormal Form1 #econd "ormal Form and
,hird "ormal Form?
%ormali2ation is the process of efficiently organi2ing data in a database. +he two
goals of the normali2ation process are eliminate redundant data 5storing the
same data in more than one table9 and ensure data dependencies make sense
5only storing related data in the table9.
First normali2ation formP
)liminate duplicate columns from the same table.
0reate separate tables for each group of related data and identify each
row with a unique column or set of columns 5primary key9
(econd normal formP
*emoves sub set of data that apply to multiple rows of table and place
them in separate table.
0reate relationships between these new tables and their predecessors
through the use of foreign keys.
+hird normal formP
*emove column that are not dependent upon the primary key.
)8.What is fact table?
+he centrali2ed table in a star schema is called as F!0+ table i.e. a table in that
contains facts and connected to dimensions. ! fact table typically has two types
of columnsP those that contain facts and those that are foreign keys to dimension
tables. +he primary key of a fact table is usually a composite key that is made up
of all of its foreign keys. ! fact table might contain either detail level facts or facts
that have been aggregated 5fact tables that contain aggregated facts are often
instead called summary tables9. In the real world# it is possible to have a fact
table that contains no measures or facts. +hese tables are called as Factless
Fact tables.
)1.What is 3,L?
)xtract# transformation# and loading. )+L refers to the methods involved in
accessing and manipulating source data and loading it into target database.
+he first step in )+L process is mapping the data between source systems and
target database 5data warehouse or data mart9. +he second step is cleansing of
source data in staging area. +he third step is transforming cleansed source data
and then loading into the target system. %ote that )++ 5extract# transformation#
transportation9 and )+M 5extraction# transformation# move9 are sometimes used
instead of )+L.
)2.What is 34 dia(ram?
It is )ntity relationship diagram. :escribes the relationship among the entities in
the database model.
)3.What is data minin(?
!naly2ing of large volumes of relatively simple data to extract important trends
and new# higher level information. For example# a data-mining program might
analy2e millions of product orders to determine trends among top-spending
customers# such as their likelihood to purchase again# or their likelihood to switch
to a different vendor.
)4.What is #tar schema?
Confidential property of Gantec Corporation.
Teradata Questions and Answers:
(tar (chema is a relational database schema for representing multi-dimensional
data. It is the simplest form of data warehouse schema that contains one or more
dimensions and fact tables. It is called a star schema because the entity-
relationship diagram between dimensions and fact tables resembles a star where
one fact table is connected to multiple dimensions. +he center of the star schema
consists of a large fact table and it points towards the dimension tables. +he
advantages of star schema are slicing down# performance increase and easy
understanding of data.
).What is a loo<u* table?
*efer answer for questions 83. :imension tables are sometimes called as lookup
or reference tables.
)$.What is a le=el of Branularit& of a fact table?
+he components that make up the granularity of the fact table correspond
directly with the dimensions of the data model. +hus# when you define the
granularity of the fact table# you identify the dimensions of the data model. +he
granularity of the fact table also determines how much storage space the
database requires. For example# consider the following possible granularities for a
fact tableP
'roduct by day by region
'roduct by month by region
+he si2e of a database that has a granularity of product by day by region would
be much greater than a database with a granularity of product by month by
region because the database contains records for every transaction made each
day as opposed to a monthly summation of the transactions. &ou must carefully
determine the granularity of your fact table because too fine a granularity could
result in an astronomically large database. 0onversely# too coarse granularity
could mean the data is not detailed enough for users to perform meaningful
queries against the database.
)).What is a dimension table?
:imension table is one that describes the business entities of an enterprise#
represented as hierarchical# categorical information such as time# departments#
locations# and products. :imension tables are sometimes called lookup or
reference tables. In a relational data modeling# for normali2ation purposes#
country lookup# state lookup# county lookup# and city lookups are not merged as a
single table. In a dimensional data modeling 5star schema9# these tables would be
merged as a single table called L$0!+I$% :IM)%(I$% for performance and slicing
data requirements. +his location dimension helps to compare the sales in one
region with another region. He may see good sales profit in one region and loss
in another region. If it is a loss# the reasons for that may be a new competitor in
that area# or failure of our marketing strategy etc.
)..What are the =arious 4e*ortin( tools in the Mar<et?
0rystal reports# ;usiness ob"ects# micro strategy and etc.#
)2.What are the =arious 3,L tools in the Mar<et?
!b Initio# Informatica and etc.#
.8.What are the Different methods of loadin( Dimension tables?
.1.What are #emiFadditi=e and factless facts and in which scenario will &ou
use such <inds of fact tables?
.2.What is a threeFtier data warehouse?
+he three-tier differs from the two-tier architecture by strictly enforcing a logical
separation of the graphical user interface# business logic# and data. +he three-tier
is widely used for data warehousing today. $rgani2ations that require greater
performance and scalability# the three-tier architecture may be more appropriate.
In this architecture# data extracted from legacy systems is cleansed# transformed#
and stored in high Rspeed database servers# which are used as the target
database for front-end data access.
Confidential property of Gantec Corporation.
Teradata Questions and Answers:
.3.What are the =arious transformations a=ailable?
.4.Im*ortance of #urro(ate @e& in Data warehousin(?
(urrogate Iey is a 'rimary Iey for a :imension table. Most importance of using it
is independent of underlying database. i.e. (urrogate Iey is not affected by the
changes going on with a database
85. Differentiate Arimar& @e& and Aartition @e&?
'rimary Iey is a combination of unique and not null. It can be a collection of key
values called as composite primary key. 'artition Iey is a "ust a part of 'rimary
Iey. +here are several methods of partition like Kash# :;F# and *andom etc.
Hhile using Kash partition we specify the 'artition Iey.
86. Differentiate Database data and Data warehouse data?
:ata in a :atabase is :etailed or +ransactional# ;oth *eadable and Hrite able
and current.
:ata in data warehouse is detailed or summari2ed# storage place for historical
data.
.).What are 5L!A1 M5L!A1 45L!A1 D5L!A and %5L!A? 3'am*les?
5L!AG
$L!' stands for $nline !nalytical 'rocessing. It uses database tables 5fact and
dimension tables9 to enable multidimensional viewing# analysis and querying of
large amounts of data. ).g. $L!' technology could provide management with fast
answers to complex queries on their operational data or enable them to analy2e
their companySs historical data for trends and patterns.
M5L!AG
(tands for Multidimensional $L!'. In M$L!' cubes the data aggregations and a
copy of the fact data are stored in a multidimensional structure on the !nalysis
(erver computer. It is best when extra storage space is available on the !nalysis
(erver computer and the best query performance is desired. M$L!' local cubes
contain all the necessary data for calculating aggregates and can be used offline.
M$L!' cubes provide the fastest query response time and performance but
require additional storage space for the extra copy of data from the fact table.
45L!AG
(tands for *elational $L!'. In *$L!' cubes a copy of data from the fact table is
not made and the data aggregates are stored in tables in the source relational
database. ! *$L!' cube is best when there is limited space on the !nalysis
(erver and query performance is not very important. *$L!' local cubes contain
the dimensions and cube definitions but aggregates are calculated when they are
needed. ! *$L!' cube requires less storage space than M$L!' and K$L!'
cubes.
%5L!AG
(tands for Kybrid $L!'. ! K$L!' cube has a combination of the *$L!' and
M$L!' cube characteristics. It does not create a copy of the source data
however/ data aggregations are stored in a multidimensional structure on the
!nalysis (erver computer. K$L!' cubes are best when storage space is limited
but faster query responses are needed
D5L!AG
...What is 5L,A?
$L+' stands for $nline +ransaction 'rocessing. $L+' uses normali2ed tables to
quickly record large amounts of transactions while making sure that these
updates of data occur in as few places as possible. 0onsequently $L+' database
are designed for recording the daily operations and transactions of a business.
).g. a timecard system that supports a large production environment must record
successfully a large number of updates during critical periods like lunch hour#
breaks# startup and close of work.
.2.%ierarch& of DW%?
28.What is a((re(ate awareness?
Confidential property of Gantec Corporation.
Teradata Questions and Answers:
21.3'*lain reference cursor?
22.What are *arallel 0ueries and 0uer& hints?
23.DW% architecture?
24.What are cursors?
2.!d=anta(es of de normaliEed data?
2$.What is Meta data and s&stem catalo(?
2).What is confirmed dimension?
2..What is the ca*acit& of *ower cube?
22.What are difference in macros and *rom*ts?
188. What is hash *artition?
181. What is D,M session?
182. What is sta(in( area?
+he data staging area is a system that stands between the legacy systems and
the analytics system# usually a data warehouse and sometimes an $:(. +he data
staging area is considered the Qback roomQ portion of the data warehouse
environment. +he data staging area is where the extract# transform and load
5)+L9 takes place and is out of bounds for end users. (ome of the functions of the
data staging area includeP
)xtracting data from multiple legacy systems
0leansing the data# usually with a speciali2ed tool
Integrating data from multiple legacy systems into a single data
warehouse
+ransforming legacy system keys into data warehouse keys# usually
surrogate keys
+ransforming disparate codes for gender# marital status# etc.# into the data
warehouse standard
+ransforming the heterogeneous legacy data structures to the data
warehouse data structures
Loading the various data warehouse tables via automated "obs in a
particular sequence through the bulk loader provided with the data
warehouse database or a third-party bulk loader
183. What are data mer(in(1 data cleansin( and sam*lin(?
184. 5L!A architecture?
18. What is subject area?
(ub"ect area means fundamental entities that make up the ma"or components of
the business# e.g. customer# product# employee.
18$. Wh& do we use D## database for 5L!A tools?
*efer answer for questions >?.
18). What is tenacit&?
%umber of hours +eradata utility will try to establish a connection to the system.
:efault is J hours.
18.. What is a chec<*oint?
0heckpoints are entries posted to a restart log table at regular intervals during
the data transfer operation. If processing stops while a "ob is running# you can
restart the "ob at the most recent checkpoint.
182. What is slowl& chan(in( dimension?
In a slowly changing dimension the attribute for a record varies over time. +here
are three ways to solve this problem.
+ype ? R *eplace an old record with a new record. %o historical data
available.
+ype F R Ieep the old record and insert a new record. Kistorical data
available but resources intensive.
+ype 6 R In the existing record# maintain extra columns for the new values.
118. What is slee*?
Confidential property of Gantec Corporation.
Teradata Questions and Answers:
%umber of minutes the +eradata utility will wait between logon attempts. :efault
is > minutes.
111. Difference between MultiLoad and ,Aum*?
+pump provides an alternative to MultiLoad for low volume batch maintenance of
large databases under control of a +eradata system. +pump updates information
in real time# acquiring every bit of a data from the client system with low
processor utili2ation. It does this through a continuous feed of data into the data
warehouse# rather than the traditional batch updates. 0ontinuous updates results
in more accurate# timely data. +pump uses row hash locks than table level locks.
+his allows you to run queries while +pump is running.
112. Different *hases of MultiLoad?
'reliminary phase
:ML phase
!cquisition phase
!pplication phase
)nd phase
113. 3'*lain modifierD
+he explain modifier generates an )nglish translation of the parser<s plan. It is
fully parsed and optimi2ed but not executed. )xplain returns
+ext showing how a statement will be processed.
!s estimate of how many rows will be involved
! relative cost of the request in units of time.
+his information is useful for predicting row counts# predicting performance#
testing queries before production and analy2ing various approaches to a problem.
114. 3'*lain how hash distribution is doneD
11. Difference between oracle and ,eradata warehouseD
+eradata can handle multi tera bytes of data. +eradata is linearly expandable#
uses matured optimi2er# shared nothing architecture. @ses data parallelism.
+he +eradata :;!<s never have to reorgani2e data or index space# pre-allocate
table1index space# format partitions# tune buffer space# ensure the queries run in
parallel# pre-process data for loading and write or run programs to split the input
data into partitions for loading.
11$. What is dimensional modelin(?
:imensional :ata Modeling comprises of one or more dimension tables and fact
tables. Cood examples of dimensions are location# product# time# promotion#
organi2ation etc. :imension tables store records related to that particular
dimension and no facts 5measures9 are stored in these tables
11). %ow will &ou sol=e the *roblem that occurs durin( u*date?
Hhen there is an error during the update process# an entry is posted in the error
log table. =uery the log table and fix the error and restart the "ob.
11.. %ow data is distributed in ,eradata s&stem?
112. ;an &ou connect MultiLoad from !b Initio?
&es we can connect.
128. What interface is used to connect to windows based a**lications?
Hin0LI interface.
121. What is data warehousin(?
! data warehouse is a sub"ect oriented# integrated# time variant# non-volatile
collection of data in support of managementSs decision-making process.
122. What is data modelin(?
! :ata model is a conceptual representation of data structures 5tables9 required
for a database and is very powerful in expressing and communicating the
business requirements.
123. What is lo(ical data model?
Confidential property of Gantec Corporation.
Teradata Questions and Answers:
! Logical data model is the version of a data model that represents the business
re0uirements 7entire or *art9 of an or(aniEation and is developed before
the physical data model. ! sound logical design should streamline the physical
design process by clearly defining data structures and the relationships between
them. ! good data model is created by clearly thinking about the current and
future business requirements. Logical data model includes all required entities1
attributes1 <e& (rou*s1 and relationshi*s that represent business
information and define business rules.
124. ,ell us somethin( about data modelin( tools?
:ata modeling tools to transform business requirements into logical data model#
and logical data model to physical data model. From physical data model# these
tools can be instructed to generate (=L code for creating database entities.
12. #te*s to create a data model?
Cet business requirements.
0reate Kigh Level 0onceptual :ata Model.
0reate Logical :ata Model.
(elect target :;M( where data-modeling tool creates the physical
schema.
0reate standard abbreviation document according to business standard
12$. What is the ma'imum number of DML can be coded in a MultiLoad
scri*t?
Maximum :ML can be coded in a MultiLoad script.
?F8. .
Confidential property of Gantec Corporation.

You might also like