1. What is the difference between FastLoad and MultiLoad?
FastLoad uses multiple sessions to quickly load large amount of data on empty table. MultiLoad is used for high-volume maintenance on tables and views. It works with non-empty tables also. Maximum tables can be used in MultiLoad. 2. Which is faster? FastLoad. 3. Difference between Inner join and outer join? !n inner "oin gets data from both tables where the specified data exists in both tables. !n outer "oin gets data from the source table at all times# and returns data from the outer "oined table $%L& if it matches the criteria. 4. What is multi Insert? Inserting data records into the table using multiple insert statements. 'utting a semi colon in front of the key word I%()*+ in the next statement rather than terminating the first statement with a semi colon achieves it. Insert into (ales ,select - from customer. / Insert into Loan ,select - from customer./ . Is multi insert !"#I standard? %o. $. %ow do &ou create a table with an e'istin( structure of another table with data and with no data? 0reate table 0ustomerdummy as 0ustomer with data 1 with no data/ ). What is the o*enin( ste* in +asic ,eradata -uer& scri*t? .Logon tdipid1username# password. .. /ou are callin( a +te0 scri*t1 which dro*s a table and creates a table. It will throw an error if the table does not e'ist. %ow can &ou do it without throwin( the error? &ou can it by setting error level to 2ero before dropping and resetting the error level to 3 after dropping. &ou can do it like this )**$*L)4)L 563789 ()4)*I+& 7/ :*$' +!;L) )M'L$&))/ 2. 34454L363L 73.8)9 #3634I,/ .: 18.;an &ou Fast3'*ort a field1 which is *rimar& <e& b& *uttin( e0ualit& on that <e&? %o. 11.Did &ou write stored *rocedures in ,eradata? %o# because they become a single amp operation and my company didn<t encourage that. 12.What is the use of ha=in( inde'>s on table? For faster record search. 13.Did &ou use -uer& man or #-L assistance? (=L assistant >.? 14.I am u*datin( a table in +te0. It has to u*date a lar(e number of rows1 so it>s reall& slow. What do &ou su((est? In +eradata it is not recommended to update more than ? million rows due to "ournal space problems# if it is less than that and it<s slow in the ;teq# you might want to add collect statistics statement before the update statement. 1.Is it necessar& to add? -?I, statement after a +te0 0uer& when I am callin( it in a ?ni' en=ironment? %ot necessary but it is good to add a =@I+ statement after a query. 1$.,here is a column with date in it. If I want to (et just month how It can be done? ;an I use sub strin(? (ub string is used with char fields. (o it cannot be used. +o extract month from a date column# ex select extract 5month from Acolumn nameB9. (ame thing for Confidential property of Gantec Corporation. Teradata Questions and Answers: year or day. $r hour or minutes if it<s a time stamp 5select extract 5minute from column name9. 1).What>s the s&nta' of sub strin(? (@;(+*I%C 5stringDexpression# n? EnFG9 1..Did &ou use ;!#3 W%3" statement. ;an &ou tell us a little about it? &es. Hhen a case has to be selected depending upon the value of the expression. 12.While creatin( table m& D+! has F!LL+!;@ or "5 F!LL+!;@ in his DDL. What is that? F!LL;!0I requests that a second copy of each row inserted into a table be stored on another !M' in the same cluster. +his is done when !M' goes down or disk fails. 28.M& table (ot loc<ed durin( ML5!D due to a failed job. What do I do to *erform other o*erations on it? @sing *)L)!() ML$!:. It removes access locks from the target tables in +eradata. It must be entered from ;+)= and not from MultiLoad. +o proceed# you can do *)L)!() ML$!: Atable nameB 21.%ow to find du*licates in a table? Croup by those fields and select id# count5-9 from table group by id having count 5-9 B ? 22.%ow to &ou =erif& a com*licated #-L? I use explain statement to check if the query is doing what I wanted it to do. 23.%ow man& tables can &ou join in 624 @p to >J tables. 24.Did u e=er use ?AA34 Function? @'')* Function is used to convert all characters in a column to the same characters in upper case. 2.What does a L5W34 Function do? L$H)* function is used to convert all characters in a column to the lower case characters. 2$.%ow do &ou see a DDL for an e'istin( table? ;y using show table command. 2).Which is more efficient B45?A +/ or DI#,I";, to find du*licates? Hith more duplicates C*$@' ;& is more efficient# if only a few duplicates exist :I(+I%0+ is more efficient. 2..#&nta' for ;!#3 W%3" statement? 0!() valueDexpressionD? HK)% valueDexpressionDn +K)% scalarDexpressionDn )%:/ 22.What>s the difference between ,IM3#,!MA 789 and ,IM3#,!MA 7$9? +IM)(+!M' 579 is 0K!* 5?L9 and +IM)(+!M' 5>9 is 0K!* 5F>9 )verything is same except that +IM)(+!M' 5>9 has microseconds too. 38.%ow do &ou determine the number of sessions? +eradata performance and workload 0lient platform type# performance and workload 0hannel performance for channel attached systems %etwork topology and performance for network attached systems. 4olume of data to be processed by the application. 31. What is node? %ow man& nodes and !MAs used in &our *re=ious *rojectM %ode is a database running in a server. He used 6?3 nodes and each node has F to J !M'(. 32. What is a cli0ueM 0lique is a group of disk arrays physically cabled to a group of nodes. 33.Inter=iewer e'*lained about their *roject 73n=ironment1 nature of wor<9 Listen to them carefully so that at the end of the interview you can ask questions about the pro"ect when you are given a chance to ask questions. 34.,ell us somethin( about &ourself? Confidential property of Gantec Corporation. Teradata Questions and Answers: :escribe about your pro"ect experience# technical skill sets# hard working# good team player# self-learner and self-motivated. 3.What is the best *roject &ou e=er wor<ed with and wh& it is best *roject? !ll the pro"ects I worked so far are best pro"ects. I treat every pro"ect is equal and work hard for the success of the pro"ect. 3$.What ma<es a *roject successful and how &ou ha=e contributed to the success of the *roject? Cood team members# technical knowledge of team members# hard work# sharing knowledge among the team# individual<s contribution to the pro"ect. )xplain them that you posses all the skills you mentioned above. 3).%a=e &ou wor<ed under stress and how did &ou handle it? &es. Many times to deliver the pro"ect on schedule# we were under lot of pressure. :uring those times we work extra hours and help each other in the team to deliver the pro"ect on schedule. +eam effort is key factor for the success of the pro"ect. 3..What is the difference between FastLoad and MultiLoad? FastLoad uses multiple sessions to quickly load large amount of data on empty table. MultiLoad is used for high-volume maintenance on tables and views. It works with non-empty tables also. Maximum tables can be used in MultiLoad. 32.%a=e &ou used *rocedures? %o. I have not used procedures. ;ut I have expertise knowledge writing procedures. My company have not encouraged me to write procedures because it becomes single !M' operation# as such uses lot of resources and expensive in terms of resource and time. 48.What is the *ur*ose of inde'es? !n index is a mechanism that can be used by the (=L query optimi2er to make table access more performant. Indexes enhance data access by providing a more- or-less direct path to stored data and avoiding the necessity to perform full table scans to locate the small number of rows you typically want to retrieve or update. 41. What is *rimar& inde' and secondar& inde'M 'rimary index is the mechanism for assigning a data row to an !M' and a location on the !M'<s disks. Indexes also used to access rows from a table without having to search the entire table. (econdary indexes enhance set selection by specifying access paths less frequently used than the primary index path. (econdary indexes are also used to facilitate aggregate operations. If a secondary index covers a query# then the $ptimi2er determines that it would be less costly to accesses its rows directly rather than using it to access the base table rows it points to. (ometimes multiple secondary indexes with low individual selectivity can be overlapped and bit mapped to provide enhanced 42. Wh& *rimar& and secondar& inde' is usedM *efer answer from previous question. 43.What are the thin(s to considered while creatin( secondar& inde'? 0reating a secondary index causes +eradata to build a sub-table to contain its index rows# thus adding another set of rows that requires updating each time a table row is inserted# deleted# or updated. (econdary index sub-tables are also duplicated whenever a table is defined with F!LL;!0I# so the maintenance overhead is effectively doubled. 44.What is collect statistics? 0ollects demographic data for one or more columns of a table# hash index# or "oin index# computes a statistical profile of the collected data# and stores the synopsis in the data dictionary. +he $ptimi2er uses the synopsis data when it generates its table access and "oin plans. 4.;an we collect statistics on multi*le columns? Confidential property of Gantec Corporation. Teradata Questions and Answers: &es we can collect statistics on multiple columns. 4$.;an we collect statistics on table le=el? &es we can collect statistics on table level. +he syntax is 0$LL)0+ (+!+ $% +!;D!/ 4).What is inner join and outer join? !n inner "oin gets data from both tables where the specified data exists in both tables. !n outer "oin gets data from the source table at all times# and returns data from the outer "oined table $%L& if it matches the criteria. 4..When ,*um* is used instead of MultiLoad? +'ump provides an alternative to MultiLoad for the low volume batch maintenance of large databases under control of a +eradata system. Instead of updating +eradata databases overnight# or in batches throughout the day# +'ump updates information in real time# acquiring every bit of data from the client system with low processor utili2ation. It does this through a continuous feed of data into the data warehouse# rather than the traditional batch updates. 0ontinuous updates results in more accurate# timely data. !nd# unlike most load utilities# +'ump uses row hash locks rather than table level locks. +his allows you to run queries while +'ump is running. +his also means that +'ump can be stopped instantaneously. !s a result# businesses can make better decisions that are based on the most current data. 42.What is s*ool s*ace and when runnin( a job if it reaches the ma'imum s*ool s*ace how &ou sol=e the *roblem? (pool space is used to hold intermediate rows during processing# and to hold the rows in the answer set of a transaction. (pool space reaches maximum when the query is not properly optimi2ed. @se appropriate conditions in HK)*) clause of the query to limit the answer set. 8.What is &our le=el of e'*ertise in usin( M# office suite? )xpert level. @sing it for last 3 years for documentation. 1.%a=e &ou used "et meetin(? &es. @sed net meeting for team meeting when members of the team geographically in different locations. 2.Do &ou ha=e an& 0uestion? Hhat is the team si2e going to beM Hhat is the current status of the pro"ectM Hhat is the pro"ect scheduleM 3.What is &our a=ailable date? Immediate. $r your available date for the pro"ect. 54. %ow much e'*erience &ou ha=e in M6#M Intermediate. In my previous two pro"ects used M4( to submit N0L "obs. 55. %a=e &ou created C;L scri*t from scratchM &es. I have created N0L scripts from scratch while creating "obs in the development environment. $.%a=e &ou modified an& C;L scri*t and used? &es I have modified N0L scripts. In my previous pro"ects many applications were re-engineered so the existing N0L scripts were modified according to the company coding standards. ).4ate &ourself on usin( ,eradata tools li<e +,3-1 -uer& man1 FastLoad1 MultiLoad and ,*um*D Intermediate to expert level. )xtensively using for last J years. !lso I am certified in +eradata. ..Which is &our fa=orite area in the *roject? I en"oy every working on every part of the pro"ect. 4olunteer my time for my peers so that I can also learn and contribute more towards the pro"ect success. 2.What is data mart? ! data mart is a special purpose subset of enterprise data used by a particular department# function or application. :ata marts may have both summary and Confidential property of Gantec Corporation. Teradata Questions and Answers: details data# however# usually the data has been pre aggregated or transformed in some way to better handle the particular type of requests of a specific user community. :ata marts are categori2ed as independent# logical and dependant data marts. $8.Difference between star and snowfla<e schemas? (tar schema is :e-normali2ed and snowflake schema is normali2ed. $1.Wh& should &ou *ut &our data warehouse in a different s&stem other than 5L,A s&stem? 4elational Data Modelin( 75L,A desi(n9 Dimensional Data Modelin( 75L!A desi(n9 :ata is stored in *:;M( :ata is stored in *:;M( or Multidimensional databases +ables are units of storage 0ubes are units of storage :ata is normali2ed and used for $L+'. $ptimi2ed for $L+' processing :ata is de-normali2ed and used in data warehouse and data mart. $ptimi2ed for $L!' (everal tables and chains of relationships among them Few tables and fact tables are connected to dimensional tables 4olatile 5several updates9 and time variant %on volatile and time invariant (=L is used to manipulate data M:O is used to manipulate data :etailed level of transactional data (ummary of bulky transactional data 5!ggregates and Measures9 used in business decisions %ormal *eports @ser friendly# interactive# drag and drop multidimensional $L!' *eports $2.Wh& are 5L,A database desi(ns not (enerall& a (ood idea for a Data Warehouse? $L+' designs are for real time data and they are not normali2ed and pre- aggregated. +hey are not good for decision support systems. $3.What t&*e of Inde'in( mechanism do we need to use for a t&*ical data warehouse? 'rimary Index mechanism is the ideal type of index for data warehouse. $4.What is 6LD+? 4ery Large databases. 'lease find more information on it. $.What is the difference between 5L,A and 5L!A? *efer answer for question >?. $$.What is real time data warehousin(? *eal-time data warehousing is a combination of two thingsP ?9 real-time activity and F9 data warehousing. *eal-time activity is activity that is happening right now. +he activity could be anything such as the sale of widgets. $nce the activity is complete# there is data about it. :ata warehousing captures business activity data. *eal-time data warehousing captures business activity data as it occurs. !s soon as the business activity is complete and there is data about it# the completed activity data flows into the data warehouse and becomes available instantly. In other words# real-time data warehousing is a framework for deriving information from data as the data becomes available. $).What is 5D#? !n operational data store 5$:(9 is primarily a QdumpQ of relevant information from a very small number of systems 5often "ust one9 usually with little or no transformation. +he benefits are an ad hoc query database# which does not affect Confidential property of Gantec Corporation. Teradata Questions and Answers: the operation of systems required to run the business. $:(<s usually deal with data QrawQ and QcurrentQ and can answer a limited set of queries as a result. $..What is real time and near real time data warehousin(? +he difference between real time and near real time can be summed up in one wordP latency. Latency is the time lag that is between an activity completion and the completed activity data being available in the data warehouse. In real time# the latency is negligible whereas in near real time the latency is a tangible time frame such as two hours. $2.What are "ormaliEation1 First "ormal Form1 #econd "ormal Form and ,hird "ormal Form? %ormali2ation is the process of efficiently organi2ing data in a database. +he two goals of the normali2ation process are eliminate redundant data 5storing the same data in more than one table9 and ensure data dependencies make sense 5only storing related data in the table9. First normali2ation formP )liminate duplicate columns from the same table. 0reate separate tables for each group of related data and identify each row with a unique column or set of columns 5primary key9 (econd normal formP *emoves sub set of data that apply to multiple rows of table and place them in separate table. 0reate relationships between these new tables and their predecessors through the use of foreign keys. +hird normal formP *emove column that are not dependent upon the primary key. )8.What is fact table? +he centrali2ed table in a star schema is called as F!0+ table i.e. a table in that contains facts and connected to dimensions. ! fact table typically has two types of columnsP those that contain facts and those that are foreign keys to dimension tables. +he primary key of a fact table is usually a composite key that is made up of all of its foreign keys. ! fact table might contain either detail level facts or facts that have been aggregated 5fact tables that contain aggregated facts are often instead called summary tables9. In the real world# it is possible to have a fact table that contains no measures or facts. +hese tables are called as Factless Fact tables. )1.What is 3,L? )xtract# transformation# and loading. )+L refers to the methods involved in accessing and manipulating source data and loading it into target database. +he first step in )+L process is mapping the data between source systems and target database 5data warehouse or data mart9. +he second step is cleansing of source data in staging area. +he third step is transforming cleansed source data and then loading into the target system. %ote that )++ 5extract# transformation# transportation9 and )+M 5extraction# transformation# move9 are sometimes used instead of )+L. )2.What is 34 dia(ram? It is )ntity relationship diagram. :escribes the relationship among the entities in the database model. )3.What is data minin(? !naly2ing of large volumes of relatively simple data to extract important trends and new# higher level information. For example# a data-mining program might analy2e millions of product orders to determine trends among top-spending customers# such as their likelihood to purchase again# or their likelihood to switch to a different vendor. )4.What is #tar schema? Confidential property of Gantec Corporation. Teradata Questions and Answers: (tar (chema is a relational database schema for representing multi-dimensional data. It is the simplest form of data warehouse schema that contains one or more dimensions and fact tables. It is called a star schema because the entity- relationship diagram between dimensions and fact tables resembles a star where one fact table is connected to multiple dimensions. +he center of the star schema consists of a large fact table and it points towards the dimension tables. +he advantages of star schema are slicing down# performance increase and easy understanding of data. ).What is a loo<u* table? *efer answer for questions 83. :imension tables are sometimes called as lookup or reference tables. )$.What is a le=el of Branularit& of a fact table? +he components that make up the granularity of the fact table correspond directly with the dimensions of the data model. +hus# when you define the granularity of the fact table# you identify the dimensions of the data model. +he granularity of the fact table also determines how much storage space the database requires. For example# consider the following possible granularities for a fact tableP 'roduct by day by region 'roduct by month by region +he si2e of a database that has a granularity of product by day by region would be much greater than a database with a granularity of product by month by region because the database contains records for every transaction made each day as opposed to a monthly summation of the transactions. &ou must carefully determine the granularity of your fact table because too fine a granularity could result in an astronomically large database. 0onversely# too coarse granularity could mean the data is not detailed enough for users to perform meaningful queries against the database. )).What is a dimension table? :imension table is one that describes the business entities of an enterprise# represented as hierarchical# categorical information such as time# departments# locations# and products. :imension tables are sometimes called lookup or reference tables. In a relational data modeling# for normali2ation purposes# country lookup# state lookup# county lookup# and city lookups are not merged as a single table. In a dimensional data modeling 5star schema9# these tables would be merged as a single table called L$0!+I$% :IM)%(I$% for performance and slicing data requirements. +his location dimension helps to compare the sales in one region with another region. He may see good sales profit in one region and loss in another region. If it is a loss# the reasons for that may be a new competitor in that area# or failure of our marketing strategy etc. )..What are the =arious 4e*ortin( tools in the Mar<et? 0rystal reports# ;usiness ob"ects# micro strategy and etc.# )2.What are the =arious 3,L tools in the Mar<et? !b Initio# Informatica and etc.# .8.What are the Different methods of loadin( Dimension tables? .1.What are #emiFadditi=e and factless facts and in which scenario will &ou use such <inds of fact tables? .2.What is a threeFtier data warehouse? +he three-tier differs from the two-tier architecture by strictly enforcing a logical separation of the graphical user interface# business logic# and data. +he three-tier is widely used for data warehousing today. $rgani2ations that require greater performance and scalability# the three-tier architecture may be more appropriate. In this architecture# data extracted from legacy systems is cleansed# transformed# and stored in high Rspeed database servers# which are used as the target database for front-end data access. Confidential property of Gantec Corporation. Teradata Questions and Answers: .3.What are the =arious transformations a=ailable? .4.Im*ortance of #urro(ate @e& in Data warehousin(? (urrogate Iey is a 'rimary Iey for a :imension table. Most importance of using it is independent of underlying database. i.e. (urrogate Iey is not affected by the changes going on with a database 85. Differentiate Arimar& @e& and Aartition @e&? 'rimary Iey is a combination of unique and not null. It can be a collection of key values called as composite primary key. 'artition Iey is a "ust a part of 'rimary Iey. +here are several methods of partition like Kash# :;F# and *andom etc. Hhile using Kash partition we specify the 'artition Iey. 86. Differentiate Database data and Data warehouse data? :ata in a :atabase is :etailed or +ransactional# ;oth *eadable and Hrite able and current. :ata in data warehouse is detailed or summari2ed# storage place for historical data. .).What are 5L!A1 M5L!A1 45L!A1 D5L!A and %5L!A? 3'am*les? 5L!AG $L!' stands for $nline !nalytical 'rocessing. It uses database tables 5fact and dimension tables9 to enable multidimensional viewing# analysis and querying of large amounts of data. ).g. $L!' technology could provide management with fast answers to complex queries on their operational data or enable them to analy2e their companySs historical data for trends and patterns. M5L!AG (tands for Multidimensional $L!'. In M$L!' cubes the data aggregations and a copy of the fact data are stored in a multidimensional structure on the !nalysis (erver computer. It is best when extra storage space is available on the !nalysis (erver computer and the best query performance is desired. M$L!' local cubes contain all the necessary data for calculating aggregates and can be used offline. M$L!' cubes provide the fastest query response time and performance but require additional storage space for the extra copy of data from the fact table. 45L!AG (tands for *elational $L!'. In *$L!' cubes a copy of data from the fact table is not made and the data aggregates are stored in tables in the source relational database. ! *$L!' cube is best when there is limited space on the !nalysis (erver and query performance is not very important. *$L!' local cubes contain the dimensions and cube definitions but aggregates are calculated when they are needed. ! *$L!' cube requires less storage space than M$L!' and K$L!' cubes. %5L!AG (tands for Kybrid $L!'. ! K$L!' cube has a combination of the *$L!' and M$L!' cube characteristics. It does not create a copy of the source data however/ data aggregations are stored in a multidimensional structure on the !nalysis (erver computer. K$L!' cubes are best when storage space is limited but faster query responses are needed D5L!AG ...What is 5L,A? $L+' stands for $nline +ransaction 'rocessing. $L+' uses normali2ed tables to quickly record large amounts of transactions while making sure that these updates of data occur in as few places as possible. 0onsequently $L+' database are designed for recording the daily operations and transactions of a business. ).g. a timecard system that supports a large production environment must record successfully a large number of updates during critical periods like lunch hour# breaks# startup and close of work. .2.%ierarch& of DW%? 28.What is a((re(ate awareness? Confidential property of Gantec Corporation. Teradata Questions and Answers: 21.3'*lain reference cursor? 22.What are *arallel 0ueries and 0uer& hints? 23.DW% architecture? 24.What are cursors? 2.!d=anta(es of de normaliEed data? 2$.What is Meta data and s&stem catalo(? 2).What is confirmed dimension? 2..What is the ca*acit& of *ower cube? 22.What are difference in macros and *rom*ts? 188. What is hash *artition? 181. What is D,M session? 182. What is sta(in( area? +he data staging area is a system that stands between the legacy systems and the analytics system# usually a data warehouse and sometimes an $:(. +he data staging area is considered the Qback roomQ portion of the data warehouse environment. +he data staging area is where the extract# transform and load 5)+L9 takes place and is out of bounds for end users. (ome of the functions of the data staging area includeP )xtracting data from multiple legacy systems 0leansing the data# usually with a speciali2ed tool Integrating data from multiple legacy systems into a single data warehouse +ransforming legacy system keys into data warehouse keys# usually surrogate keys +ransforming disparate codes for gender# marital status# etc.# into the data warehouse standard +ransforming the heterogeneous legacy data structures to the data warehouse data structures Loading the various data warehouse tables via automated "obs in a particular sequence through the bulk loader provided with the data warehouse database or a third-party bulk loader 183. What are data mer(in(1 data cleansin( and sam*lin(? 184. 5L!A architecture? 18. What is subject area? (ub"ect area means fundamental entities that make up the ma"or components of the business# e.g. customer# product# employee. 18$. Wh& do we use D## database for 5L!A tools? *efer answer for questions >?. 18). What is tenacit&? %umber of hours +eradata utility will try to establish a connection to the system. :efault is J hours. 18.. What is a chec<*oint? 0heckpoints are entries posted to a restart log table at regular intervals during the data transfer operation. If processing stops while a "ob is running# you can restart the "ob at the most recent checkpoint. 182. What is slowl& chan(in( dimension? In a slowly changing dimension the attribute for a record varies over time. +here are three ways to solve this problem. +ype ? R *eplace an old record with a new record. %o historical data available. +ype F R Ieep the old record and insert a new record. Kistorical data available but resources intensive. +ype 6 R In the existing record# maintain extra columns for the new values. 118. What is slee*? Confidential property of Gantec Corporation. Teradata Questions and Answers: %umber of minutes the +eradata utility will wait between logon attempts. :efault is > minutes. 111. Difference between MultiLoad and ,Aum*? +pump provides an alternative to MultiLoad for low volume batch maintenance of large databases under control of a +eradata system. +pump updates information in real time# acquiring every bit of a data from the client system with low processor utili2ation. It does this through a continuous feed of data into the data warehouse# rather than the traditional batch updates. 0ontinuous updates results in more accurate# timely data. +pump uses row hash locks than table level locks. +his allows you to run queries while +pump is running. 112. Different *hases of MultiLoad? 'reliminary phase :ML phase !cquisition phase !pplication phase )nd phase 113. 3'*lain modifierD +he explain modifier generates an )nglish translation of the parser<s plan. It is fully parsed and optimi2ed but not executed. )xplain returns +ext showing how a statement will be processed. !s estimate of how many rows will be involved ! relative cost of the request in units of time. +his information is useful for predicting row counts# predicting performance# testing queries before production and analy2ing various approaches to a problem. 114. 3'*lain how hash distribution is doneD 11. Difference between oracle and ,eradata warehouseD +eradata can handle multi tera bytes of data. +eradata is linearly expandable# uses matured optimi2er# shared nothing architecture. @ses data parallelism. +he +eradata :;!<s never have to reorgani2e data or index space# pre-allocate table1index space# format partitions# tune buffer space# ensure the queries run in parallel# pre-process data for loading and write or run programs to split the input data into partitions for loading. 11$. What is dimensional modelin(? :imensional :ata Modeling comprises of one or more dimension tables and fact tables. Cood examples of dimensions are location# product# time# promotion# organi2ation etc. :imension tables store records related to that particular dimension and no facts 5measures9 are stored in these tables 11). %ow will &ou sol=e the *roblem that occurs durin( u*date? Hhen there is an error during the update process# an entry is posted in the error log table. =uery the log table and fix the error and restart the "ob. 11.. %ow data is distributed in ,eradata s&stem? 112. ;an &ou connect MultiLoad from !b Initio? &es we can connect. 128. What interface is used to connect to windows based a**lications? Hin0LI interface. 121. What is data warehousin(? ! data warehouse is a sub"ect oriented# integrated# time variant# non-volatile collection of data in support of managementSs decision-making process. 122. What is data modelin(? ! :ata model is a conceptual representation of data structures 5tables9 required for a database and is very powerful in expressing and communicating the business requirements. 123. What is lo(ical data model? Confidential property of Gantec Corporation. Teradata Questions and Answers: ! Logical data model is the version of a data model that represents the business re0uirements 7entire or *art9 of an or(aniEation and is developed before the physical data model. ! sound logical design should streamline the physical design process by clearly defining data structures and the relationships between them. ! good data model is created by clearly thinking about the current and future business requirements. Logical data model includes all required entities1 attributes1 <e& (rou*s1 and relationshi*s that represent business information and define business rules. 124. ,ell us somethin( about data modelin( tools? :ata modeling tools to transform business requirements into logical data model# and logical data model to physical data model. From physical data model# these tools can be instructed to generate (=L code for creating database entities. 12. #te*s to create a data model? Cet business requirements. 0reate Kigh Level 0onceptual :ata Model. 0reate Logical :ata Model. (elect target :;M( where data-modeling tool creates the physical schema. 0reate standard abbreviation document according to business standard 12$. What is the ma'imum number of DML can be coded in a MultiLoad scri*t? Maximum :ML can be coded in a MultiLoad script. ?F8. . Confidential property of Gantec Corporation.