Professional Documents
Culture Documents
Includes tools for organizing, managing, and accessing the data in the database
Data definition, dictionary and manipulation
Query ability
Controlling of redundancy
Computation
Backup and replication
Automated optimization
Rule enforcement and security
Provides multiple user interfaces
4. Normalization
Normalization is the process of reorganizing data in a database so that it all data is stored in
just one place and all related data items are stored together. Normalization is important
mostly because it allows databases to cover up as little disk space as possible, resulting in
increased performance.
5. Hadoop
Apache Hadoop is a freely licensed software framework developed by the Apache Software
Foundation and used to develop data-intensive, distributed computing. Hadoop is designed to
scale from a single machine up to thousands of computers. The concept was inspired by
Google MapReduce and Google File System papers.
6. A) Data Warehouse B) Data Mart
A data warehouse is a collection of corporate information and data derived from operational
systems and external data sources. It is designed to support business decisions by allowing
data consolidation, analysis and reporting at different aggregate levels through extraction,
transformation and loading.
A data mart is a subject-oriented archive that stores data and uses the retrieved set of
information to assist and support the requirements involved within a particular business
function or department. They exist within a single organizational data warehouse repository.
7. Online analytical processing (OLAP)
Online analytical processing (OLAP) is a high-level concept that describes a category of tools
that helps in the analysis multi-dimensional queries It became relevant in the 1970s as the
volume of business data became too heavy for adequate analysis through SQL queries. OLAP
can uncover data relationships between seemingly unrelated events and trends, thus
enhancing business decision making.
8. Data Mining
Data mining is the process of analyzing hidden patterns of data according to different
perspectives for categorizing into useful information, which is collected and assembled in
common areas, such as data warehouses, for efficient analysis, data mining algorithms,
facilitating business decision making and other information requirements to ultimately cut
costs and increase revenue. Data mining is also known as data discovery and knowledge
discovery.