Welcome to Scribd. Sign in or start your free trial to enjoy unlimited e-books, audiobooks & documents.Find out more
Download
Standard view
Full view
of .
Look up keyword
Like this
36Activity
0 of .
Results for:
No results containing your search query
P. 1
Satish Dwh Concept

Satish Dwh Concept

Ratings:
(0)
|Views: 449|Likes:
Published by api-3841240

More info:

Published by: api-3841240 on Oct 18, 2008
Copyright:Attribution Non-commercial

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
download as DOC, PDF, TXT or read online from Scribd
See more
See less

03/18/2014

pdf

text

original

1) What are the advantages data mining over tradition...

A) Data Mining is used for the estimation of future. For example, if we take a company/business organization, by using the concept of Data Mining, we can predict the future of business in terms of Revenue (or) Employees (or) Customers (or) Orders etc.

Traditional approaches use simple algorithms for estimating the future. But,

it does not give accurate results when compared to Data Mining.
2) What is the difference between views and materialized Views?
2A) View - stores the SQL statement in the database and let you use it as a

table. Every time you access the view, the SQL statement executes.

Materialized view - stores the results of the SQL in table form in the
database. SQL statement only executes once and after that every time you
run the query, the stored result set is used. Pros include quick query results.

2B) VIEW: This is a PSEUDO table that is not stored in the database and it
is just a query.

MATERIALIZED VIEWS: These are similar to a view but these are
permantely stored in the database and often refreshed. This is used in
optimization for the faster data retrieval and is useful in aggregation and
summarization of data.
3) What is the main difference between Inmon and Kimball...?

3A) basically speaking, Inmon professes the Snowflake Schema while
Kimball relies on the Star Schema

3B) both differed in the concept of building the data warehouse...
According to Kimball...
Kimball views data warehousing as a constituency of data marts. Data marts

are focused on delivering business objectives for departments in the
organization. And the data warehouse is a conformed dimension of the data
marts. Hence a unified view of the enterprise can be obtained from the
dimension modeling on a local departmental level.

Inmon beliefs in creating a data warehouse on a subject-by-subject area
basis. Hence the development of the data warehouse can start with data from
the online store. Other subject areas can be added to the data warehouse as
their needs arise. Point-of-sale (POS) data can be added later if management
decides it is necessary.

i.e.,
Kimball--First Data Marts--Combined way ---Data warehouse
Inmon---First Data warehouse--Later----Data marts

3C) the main difference b/w the Kimball and inmon technologies is...
Kimball --- creating data marts first then combining them up to form a data
warehouse
Inmon----Creating data warehouse --- then data marts

3D) actually, the main difference is
Kimball: fallows Dimensional Modeling
Inmon: fallows ER Modeling bye Mayee

3E) Ralf Kimball: he follows bottom-up approach i.e., first creates
individual Data Marts from the existing sources and then create Data
Warehouse.

BillImmon: he follows top-down approach i.e., first creates Data Warehouse
from the existing sources and then create individual Data Marts.
4) What is junk dimension? What is the difference between junk dimension
and degenerate dimension?

4A) a junk dimension is a collection of random transactional codes, flags and text attributes that are unrelated to any particular dimension. The junk dimension is simply a structure that provides the convenient place to store the junk dimension.

4B) a "junk" dimension is a collection of random transactional codes, flags
and/or text attributes that are unrelated to any particular dimension. The junk
dimension is simply a structure that provides a convenient place to store the
junk attributes. Where as A degenerate dimension is data that is dimensional
in nature but stored in a fact table.

4C) junk dimension:
The column which we are using rarely or not used, these columns are

formed a dimension is called junk dimension
Degenerative dimension
The column which we use in dimension is degenerative dimension
Example:
EMP table has empno, ename, sal, job, deptno
But
We are talking only the column empno, ename from the EMP table and

forming a dimension this is called degenerative dimension
5) What is the definition of normalized and denormalization?
5A) Normalization is the process of removing redundancies.
Denormalization is the process of allowing redundancies.

Activity (36)

You've already reviewed this. Edit your review.
1 hundred reads
1 thousand reads
maheshumbarkar liked this
stevester liked this
hemaghsr liked this
EmperorAkshay liked this
AIWA23 liked this
vijay550 liked this

You're Reading a Free Preview

Download
scribd
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->