You are on page 1of 14

Name -Jatin San deep Naik MT

Page Ho:
W

MCA- B C- 230+ DAte: YOUVA

Asignrment No. 2
Explain Dach wan'ehouse 'achi leetuse in dletcil

Daa Wanchousing is the cpository of inkegneked


ivtowahon data will be extuckd Arom tte heteoqne
Sownces Data cwoe housing.acliteche Contoins the
cli feuent sousics ike ate iles, and ERP Hen altey
aoea auol-data wasehousing
ate that t has te ci fPeeet data masLk
ten, it have the difoent date it also
hawe tue GDs-O pes.atioa Data Store-Tis
Complete acitectuse is colled t e Data waehousing
aseitectse
Tue data wasrehouses ascitectue based on velationa
databae Managemeut systan (KDBMs) Tle anclutetev
Can be detumined ormodified ,after inplonentakie
La Subjcct-oneted ,iuteqrated tine -Vaxiènt
Collection of data to enabledecision making
Ore
a dispahate qroup cfusees
thi no'st basic concepks of. cata. Cwaiehousis is
to cleanilte,
clean anstom, Sumprasi2e cnd
anel
Hue dlatu aud Hhin put it inih
Shic hse foy
tor Casy accss and analysis by tese
usees iSut, hat shutue nst first be
ceined and Hat is te task of te data
wanchouse model Te modelius a dala wasehew
we, beqin by asedilectingHhe olata y we
srvce aad locale it acording toits
AeEordins: chanatteistics
W

Page No,

C-23oRt YOUVA

Tiecl Ties2 Tiec3

(Analysis
Extnal
Sources Data Repor
Wachoe Serot Date

Opckodioa
Databuses
Dota
Shorage
Data Data OLAP front End
Sosces Siexaqe tools

In His auci tectuse, He opeBahional data processing


is entirelly sperte from data waehouse pYCEsS0 ng
H sowce of data for the data cwaehouse
CoMes fom opeation
opcahonalal dotebase

Diefeset Heas of Dota woehouse.

0 Boto Tteu : Botom iLa deals with e meuing


xeleted data aY informaton from yaious
informatton posi tories by using SQLThe bottom
Hen is a wachose database Sewee that is
alost aweys a oelational database system
data fm opectional debasec and fm extenal
Souice extacted using applu cation pregam
MT W
Paqe No.
AVUo YOUVA
C-23o 44 Date

intenteces known qaleuays


( pata Exrachon ,- Datoa extsaction is He act or
pncesS retieving data_out of data sosK ces
for funtuH dala proc essing or data Storage
(data miqratan)
(3) Data Cleansiny 3- The task of coYreching and
data is called data
prtpaocasing
@Data transfornmahon s- A deta transfomation
Conu ts data bom a SowrC data forat
fnto destinatiou datoa

SLoading-loading oten implets physical


MouenenP of the data trom Hhe coriputer storing
Hhe sowHce dotabase to tiat wich will stor
sfo
tHe dota Cwasehouse dota bese, assumine
jt is dilfeeent

6) Midd le Tieu Midd l HeH exists between t e wsey


wnteuace on thehe cient síde and dae base
Manageemt systeMs (DBMs on the siwer side
he iddleex is also called the appucaton
Swer Tt conteinsa cenalzedprucessine
Loic wich ped litates Manase Meut ovnd
lagic
adhinistoation -
The mi dd le He is an OLA Seweh tHlat
is ypielly impla a teletonal
OLAP(R OLAP) oY mti dlinention al (MOLAP)
odeb
YOUYA
C-2302 Date

Heng- The top Heu is Hue wey sglem inkofat


(clieont) he top He is c client, wich contins
qubuy ad portiag tools ,analipis tools and /or
dota 'ming teols -

Q2 lwite a Shot not ETL Process


ETL is pnceas tat extracts e date from
dieferent sousce Systestnen toansform tae
data like agpying caleulahiong concontenction,
et and Hinally loads the data into tne
Data wasrehous syste m Ful( form of ETL
is Extyact Tranfor and Load
Tue ETL proces tquires achveinputs
Pom vaious
,analysiSkeste
stekehaldess
inclucing
dlopens
to exROuthves and technically
clallluging
ETL Puces in Dat Wonehowses

Oraele
SAL
Sewe

Sagiag hrea Deta


Tea Wesehous
dota

ETL Process
File
M

Page No
C-2307 Date YOUYA

O Exhrachon
Lu His slep of ETL ancitechue, dta is
Ta
skep
exracte ol foom the sowce system into Hhe
aeaTransfomations
Staging
done
Siagng
B
aea so Hhet peutorance
So wiCe is not cleqraded" Also f
Comupkcd data is eplcd divecHy from thc
Patacwanehouse datase
Yoll back will be a chalenge Staging
gives an Pportunity to vaiud
extacted data before it mou nto ata waehase

2 Transformati on
’ Deta extcted from So see sesw on is aw and not
usable in its ognod Form Thenefore it needs to
be cleansed mapped and tragfeved It is one
of e mportant ETL Concepts whee you q
cpply 3et af Punctton one extacte d date
De Huat olees not equreA any trayfemnton
is celled as direc t m

3 Loadiy &
- Loacing data indo Hhe totgetted tovg eked
data wcn hoe datebse is Hhe lo
ETL proces: Tu aa typicad Pale cwane houehus
Hypiccl Date
veLume of daute needd to be loaoled n a relatiuely
skort peiad (riats) kence, load poveis should
be opinÜzod or pchfermance
TW
Page No,
YOUVA
C-23o DATe

Q.3Short note Pata Mart

AA Data mat is a siyple form of data woKehowse


Hhat is focwsed on a single subiect (or Punchiona
Sales Pinance or
aria) Such
Paa Mats ae often built ancd maslcehg
controlled
by a a Single de pas tment withinan OTqani2ation
Eahdata mat Can Contain dilfesnt
Combinaious of tablescolums cud ws from te
Enteprise data wosehouse
Lind oF data Mauts:

ODependeut Datoa Macct: A dependet data Mat is


One uwhose Sousce is a dota wreloue

Tudepencleut Data Maut: An indepencdent data


mast is One whose Sousrceis He
appli cation enuisonment
Advautages e
when you dont have He fll data w
0eloesiny
experti'se available to youÝt Company
2) Tt s faste to iMplin
Concentrating on asubset of a
as yau
e data yeu
you only
ned foy appiccution aea
Ttearatng date from melkiple Sowr ces -
of
Pesormin new
analysis isto al date -
Reduciny Cast to access
Page No
C-23O77 Date YOUVA

Disad vandagesá
OData Nats have Mauyissues inckuding funchonaliy
ydata siz sralailbiy,befos Mane,da aecess
andCo njolicdation:

Q. kOLAP VS MOLAP

RoLAP
OROLAP sBandls or OMoLAP 3taundy for
Telatiouol onliueuliciension al online
Axalytial przcessi Analytcod proeasiy
OTt uualliy wed weu Tt used when data uandae
data wReuse contiu Conting relationod as wel]
velational dateasnou-relatioucul data
3Tt caaains Analytcad 3) t conteins the MDD8.
SwA Sewe
t creates aalti- A) Tt Contains prefabicated
dimensionad vie of data cubes -
dada dynamically
S Tt is vesy eay Tt is cuPficlt to
to iapleent implenentiA
Tt haj c 6) Tt has less oesponse
espense tine Hne due to prefticakd
Cubes

of meory anout of memorys


MT W
Paye No
YOUVA
C-2307 Date

0:S OLAP VS OLTP

Arta OLAP OLTP


Compasision

Time This stores historicoul This stores cutent


Scale data for analyis data

Sowree of Cperotond dat Consolidatou date


deta OLTPs oe hhe OLAP data com e
Origina
oe He
So
data
ce fnM Hhe vOuous
OLTP Databases.

Tadeing Opiizes adhoc Opinizes update


queies by incudinglpefmance buy
lots of inolexes miimizing He
number of dexes

Nornalzation Possibly pauticlly


denornalized fo
Tus is
noTMaiced.

as thisis used for


Lporting
Orqauzaion Douta stored esolves Data steved evolvey
aoUnd informahon asLOUnd busiess
topics funchons
Stored Stores descrphe Stort
ply
Values datec Codod ata
M T W
Page No.:

C-23047 Date: YOUVA

Aveas of GLAP OLTP


Compais ov
Processiug Dependson tue ypically Vey
Speed auouht of data 'Past
Mvalved

fuspose of To help wik 0 contl and un


data pnblau
planning fundamntal
Solving and decisio business tasks
Suppert
|Backyp of Tiuledd of rxqulan |Backop seligiously
opotioul data
Recovey backups, scnd.
environments may is ciical o uo
considos siMHe
rloading
Hhe businiss ,data loss
is likely fo ontail
OLTP dae as 3lqniticant monctauy
a ecovey melod toss aud lega
Uability
D8 Size l00 GB- TB

Hundeds Thouaudi

Applcato n Managut
Trtoonton systonm CRM, leqau
docisiov apps
M

Page No.:
YOUVA
C-23077 Date:

GExplain Stat scema , growftake schoma.


and fact constellatton in dotails

>0Star ScheMa!he stcUu schema is te


Sinplest style of cata nsehouse schema
The ston schena consists o Oneor or
fact takey (FT) Per encing ouny num bes
of dimension tables (oT)'He 's fas
Sche ma an important Spedal case of
the snow take schema and is ore
ePféc ive for haudting siMpleu queries
qUe ries Tun
otun sense te stas schema is a
elationaldata base schema for epseuting
ulti dinntional dote
A st1 Schema is caled such as
it esunbles a constellation of stas
gansially stuetal bHgut stros (facts).
SUr1o Unded by diameA ones [dimentious)
The fact table holds, HL netic yalues
Corded for a speci ic event. Becae of
the dosit to hold autonic level data
ey ae a
of oline ntion tabll LLSUlly
hawe few corcs compaed to fact
taldes but ay have
numbee atti butes tat decbes te
fact data o
MTWTF
Page No

C-230a7 Date YOUVA

TiME (p) PRODWT(oT)


day
Productky
Product-nane
day fHwek Categery
nlon braud
Suatees Color
yea Suppieunanee

SALES,FT (F)T
ne-key
Productkey
locatt once
Mass

LoCATON (or)
Location-kay
3tore
Street address
city
lostalteo
Bregion
delhi"gion
Ston Schena tor biq Lop Maket
Shap
M TWTES

Page No.
YOUVA
Date:

Advautages of
prouide
Stn Schea s
a diret cud iututive nappins
bewoen
loy eud
lbusiness.
uses
enthes being
aud t e scma
aualyed
design
Suow Flake
TeSnow Hake schma is a vaient
of the sta sclema odelthe next kind
cY odel is called a Snoo Plake modal
and is vey similan to He aboue Scuma
expect that sou of te edunan cy in te
dimentiens is emoved by wsing what is
called dat norimalized tables he dlate
the snowake scheM a is an extension of
Hhe stay schem a ,wlene each point of He
stan explodes into nore points In a ste
sc hema each diension is preseted by
a siugle dimention a
single tablé, whee as
a Snow' fake schena Hat dimentionad
table nomaiz d into u ltiple look up
tables each prscutin a levelin t e
dierional
Liehancy:
Aduautages -
Sauing Storage Space.
easieH to maint
imost swtble for
quey poCeSS ing:
MT W

Page No.:
Date: YOUVA

Dis aduantages
Due to lage numben ot
Cemplex to navigate .
Bouosing woo le be
Denormalized
oliPicult.
Highaly Dealeu (pT)
TneloT)) dealekey
deale_name
day Pozcuct DT)
cayofthe_week
montu
Productkey
Prouetname
quateu
SALES (DT) brañd
Hme key Color

Producty Sepplies
|location y
branch ku
Units So ld
Branch (on) Onount
LocaHeu (DT)
yanhne location key
branchype Store
esteetaddress.

Citty (oT)

Snow Foake ScleMa of


big shop Monket. Site
Coont
MT W T E

Page No.:
YOUVA
Date.

Fact Constell ation -


Tis Schema is more Complex than st
Or SnOuo fHaee achitecue be cause it con tains
Nultple fact ables.is allow dimension
tables to be shased fact
tables A fac oamengst
Mny alo
Constellaton gchema
Dimension tables to be shaxed bttween
fact tables.
The main olis advantqe of te Fact
Canstel lation chia is a more complicakd
desiqn because many vaiants of agsreaqtion
mst be considereTn a fact constelloticn
Schema, dillexent fact tables ae explici
assigned to te dimengions,ahich ae expliify
for qivenfaacts nelevant Tis My be sefu
Cases cohen ame fa cts a associa ted
wit a qiven oliension level and otie
facts uih oeepe climensi on level.

You might also like