) andcollecting statisticsand information about that data. The purpose of these statistics may be to:
Find out whether existing data can easily be used for other purposesGivemetrics ondata qualityincluding whether the data conforms to company standards
Assess the risk involved in integrating data for new applications, including the challenges of joins
Trackdata quality Assess whether metadataaccurately describes the actual values in the source databaseUnderstanding data challenges early in any data intensive project, so that late project surprises are avoided. Finding dataproblems late in the project can incur time delays and project cost overruns.Have an enterprise view of all data, for uses such asMaster Data Management where key data is needed, or Data
governance for improving data qualityData governance is aquality control discipline for assessing, managing, using, improving, monitoring, maintaining, and
protecting organizational information. It is a system of decision rights and accountabilities for information-related
processes, executed according to agreed-upon models which describe who can take what actions with whatinformation, and when, under what circumstances, using what methods.:
What are derived facts and cumulative facts?There are 2 kinds of derived facts that are additive and can be calculated entirely from the other facts in the same facttable row can be shown in a user view as if they existed in the real data. The user will never know the difference.The second kind of derived fact is a non additive calculation, such as ratio or cumulative fact that is typically expressed ata different level of details than the base facts themselves. A Cumulative fact might be year-to-date or month-to-date fact. In any case these kinds of derived facts can not bepresented in a simple view at the DBMS level because they violate the grain of the fact table. They need to becalculated
at query time by the BI tool.
Question :what is the data type of the surrogate keyAnswer :
Data type of the surrogate key is either integer or numeric or number
Question :What is hybrid slowly changing dimensionAnswer :
Hybrid SCDs are combination of both SCD 1 and SCD 2.It may happen that in a table, some columns are important and we need to track changes for them i.e. capture the historical data for them whereas in some columns even if the data changes,we don't care.For such tables we implement Hybrid SCDs, where in some columns are Type 1 and some areType 2.