Professional Documents
Culture Documents
Big Data Technology can be defined as a Software-Utility that is designed to Analyse, Process and Extract the
information from an extremely complex and large data sets which the Traditional Data
Processing Software could never deal with.
DATA DASHBOARD
DATA
In computing, data is information that has been translated into a form that is
efficient for movement or processing. Relative to today's computers and
transmission media, data is information converted into binary digital form. It
is acceptable for data to be used as a singular subject or a plural subject. Raw
data is a term used to describe data in its most basic digital format.
The concept of data in the context of computing has its roots in the work of Claude
Shannon, an American mathematician known as the father of information theory.
DATABASE
Data or information is in raw format. With increasing data size, it has become a need for inspecting, cleaning,
transforming, and modeling data with the goal of finding useful information, making conclusions, and supporting
decision making. This process is known as data analysis.
Data mining is a particular data analysis technique where modeling and knowledge discovery for predictive rather
than purely descriptive purposes is focused. Business intelligence covers data analysis that relies heavily on
aggregation, focusing on business information. In statistical applications, some people
divide business analytics into descriptive statistics, exploratory data analysis (EDA), and
confirmatory data analysis (CDA).
EDA focuses on discovering new features in the data and CDA focuses on confirming or
falsifying existing hypotheses.
Data mining is a process used by companies to turn raw data into useful information. By using software to look for
patterns in large batches of data, businesses can learn more about their customers to develop more effective
marketing strategies, increase sales and decrease costs. Data mining depends on effective data collection,
warehousing, and computer processing.
How Data Mining Works
Data mining involves exploring and analyzing large blocks of information to glean meaningful patterns and trends.
It can be used in a variety of ways, such as database marketing, credit risk management, fraud detection, spam
Email filtering, or even to discern the sentiment or opinion of users.
The data mining process breaks down into five steps. First, organizations collect data and load it into their data
warehouses. Next, they store and manage the data, either on
in-house servers or the cloud. Business analysts, management
teams and information technology professionals access the
data and determine how they want to organize it. Then,
application software sorts the data based on the user's results,
and finally, the end-user presents the data in an easy-to-share
format, such as a graph or tables
DATA WAREHOUSE
Warehousing is an important aspect of data mining. Warehousing is when companies centralize their data into one
database or program. With a data warehouse, an organization may spin off segments of the data for specific users
to analyze and use.
However, in other cases, analysts may start with the data they want and create a data warehouse based on those
specs. Regardless of how businesses and other entities organize their data, they use it to support management's
decision-making processes.
Enterprise Data Warehouse is a centralized warehouse. It provides decision support service across
the enterprise. It offers a unified approach for organizing and representing data. It also provide
the ability to classify data according to the subject and give access according to those divisions.
Operational Data Store, which is also called ODS, are nothing but data store required when
neither Data warehouse nor OLTP systems support organizations reporting needs. In ODS, Data
warehouse is refreshed in real time. Hence, it is widely preferred for routine activities like storing
records of the Employees.
3. Data Mart:
A data mart is a subset of the data warehouse. It specially designed for a particular line of
business, such as sales, finance, sales or finance. In an independent data mart, data can collect
directly from sources.
Load manager
Warehouse Manager
Query Manager
INFORMATION