You are on page 1of 4

What is Big Data Technology?

Big Data Technology can be defined as a Software-Utility that is designed to Analyse, Process and Extract the
information from an extremely complex and large data sets which the Traditional Data
Processing Software could never deal with.

We need Big Data Processing Technologies to Analyse this


huge amount of Real-time data and come up with
Conclusions and Predictions to reduce the risks in the
future.
Types of Big Data Technologies:
Big Data Technology is mainly classified into two types:

Operational Big Data Technologies


The Operational Big Data is all about the normal day to day data that we generate. This could be the Online
Transactions, Social Media, or the data from a Particular Organisation etc. You can even consider this to be a kind
of Raw Data which is used to feed the Analytical Big Data Technologies.

Analytical Big Data Technologies


Analytical Big Data is like the advanced version of Big Data Technologies. It is a little complex than the Operational
Big Data. In short, Analytical big data is where the actual performance part comes into the picture and the crucial
real-time business decisions are made by analyzing the Operational Big Data.

DATA DASHBOARD

A data dashboard is an information management tool that visually tracks,


analyzes and displays key performance indicators (KPI), metrics and key data
points to monitor the health of a business, department or specific process. They are customizable to meet the
specific needs of a department and company. Behind the scenes, a dashboard connects to your files, attachments,
services and API’s, but on the surface displays all this data in the form of tables, line charts, bar charts and gauges.
A data dashboard is the most efficient way to track multiple data sources because it provides a central location for
businesses to monitor and analyze performance. Real-time monitoring reduces the hours of analyzing and long line
of communication that previously challenged businesses.

Dashboards are fueled by Business Questions


The best data dashboards answer important questions about your business. Unlike
advanced business intelligence tools, dashboards are designed for quick analysis and
informational awareness. The most common approach to designing a business
dashboard is to build it using a question-answer format.
Dashboards can focus on presenting operational and analytical data
Dashboards present interactive data visualizations

DATA
In computing, data is information that has been translated into a form that is
efficient for movement or processing. Relative to today's computers and
transmission media, data is information converted into binary digital form. It
is acceptable for data to be used as a singular subject or a plural subject. Raw
data is a term used to describe data in its most basic digital format.
The concept of data in the context of computing has its roots in the work of Claude
Shannon, an American mathematician known as the father of information theory.

How data is stored


Computers represent data, including video, images, sounds and text, as binary values using patterns of just two
numbers: 1 and 0. A bit is the smallest unit of data, and represents just a single value. A byte is eight binary digits
long. Storage and memory is measured in megabytes and gigabytes.

DATABASE

A database is a collection of information that is organized so that


it can be easily accessed, managed and updated. Computer
databases typically contain aggregations of data records or
files, containing information about sales transactions or
interactions with specific customers.
Typically, the database manager provides users with the ability to control read/write access, specify report
generation and analyze usage. Some databases offer ACID (atomicity, consistency, isolation and durability)
compliance to guarantee that data is consistent and that transactions are complete.
Relational database
A relational database, invented by E.F. Codd at IBM in 1970, is a tabular database in which data is defined so that it
can be reorganized and accessed in a number of different ways.
Distributed database
A distributed database is a database in which portions of the database are stored in multiple physical locations,
and in which processing is dispersed or replicated among different points in a network.
Cloud database
A cloud database is a database that has been optimized or built for a virtualized environment, either in a hybrid
cloud, public cloud or private cloud. Cloud databases provide benefits such as the ability to pay for storage capacity
and bandwidth on a per-use basis, and they provide scalability on demand, along with high
availability.
NoSQL database
NoSQL databases are useful for large sets of distributed data.
Object-oriented database
Items created using object-oriented programming languages are often stored in relational databases, but object-
oriented databases are well-suited for those items.
Graph database
A graph-oriented database, or graph database, is a type of NoSQL database that uses graph theory to store, map
and query relationships.
What is Data Analytics?

Data or information is in raw format. With increasing data size, it has become a need for inspecting, cleaning,
transforming, and modeling data with the goal of finding useful information, making conclusions, and supporting
decision making. This process is known as data analysis.
Data mining is a particular data analysis technique where modeling and knowledge discovery for predictive rather
than purely descriptive purposes is focused. Business intelligence covers data analysis that relies heavily on
aggregation, focusing on business information. In statistical applications, some people
divide business analytics into descriptive statistics, exploratory data analysis (EDA), and
confirmatory data analysis (CDA).
EDA focuses on discovering new features in the data and CDA focuses on confirming or
falsifying existing hypotheses.

What Is Data Mining?

Data mining is a process used by companies to turn raw data into useful information. By using software to look for
patterns in large batches of data, businesses can learn more about their customers to develop more effective
marketing strategies, increase sales and decrease costs. Data mining depends on effective data collection,
warehousing, and computer processing.
How Data Mining Works
Data mining involves exploring and analyzing large blocks of information to glean meaningful patterns and trends.
It can be used in a variety of ways, such as database marketing, credit risk management, fraud detection, spam
Email filtering, or even to discern the sentiment or opinion of users.
The data mining process breaks down into five steps. First, organizations collect data and load it into their data
warehouses. Next, they store and manage the data, either on
in-house servers or the cloud. Business analysts, management
teams and information technology professionals access the
data and determine how they want to organize it. Then,
application software sorts the data based on the user's results,
and finally, the end-user presents the data in an easy-to-share
format, such as a graph or tables

DATA WAREHOUSE
Warehousing is an important aspect of data mining. Warehousing is when companies centralize their data into one
database or program. With a data warehouse, an organization may spin off segments of the data for specific users
to analyze and use.
However, in other cases, analysts may start with the data they want and create a data warehouse based on those
specs. Regardless of how businesses and other entities organize their data, they use it to support management's
decision-making processes.

Three main types of Data Warehouses are:


1. Enterprise Data Warehouse:

Enterprise Data Warehouse is a centralized warehouse. It provides decision support service across
the enterprise. It offers a unified approach for organizing and representing data. It also provide
the ability to classify data according to the subject and give access according to those divisions.

2. Operational Data Store:

Operational Data Store, which is also called ODS, are nothing but data store required when
neither Data warehouse nor OLTP systems support organizations reporting needs. In ODS, Data
warehouse is refreshed in real time. Hence, it is widely preferred for routine activities like storing
records of the Employees.

3. Data Mart:

A data mart is a subset of the data warehouse. It specially designed for a particular line of
business, such as sales, finance, sales or finance. In an independent data mart, data can collect
directly from sources.

Four components of Data Warehouses are:

Load manager

Warehouse Manager

Query Manager

End-user access tools:

INFORMATION

You might also like