You are on page 1of 29

ABOUT ‘BIG DATA & ANALYTICS’

• Big data analytics is the often complex process of examining large and varied data sets or to
uncover information including hidden patterns, unknown correlations, market trends and
customer preferences that can help organizations make informed business decisions
• On a broad scale, data analytics technologies and techniques provide a means to analyse data
sets and draw conclusions about them to help organizations make informed business
decisions. B.I queries answer basic questions about business operations and performance.
EMERGENCE AND GROWTH OF BIG DATA ANALYTICS

• The term big data was first used to refer to increasing data volumes in the mid-1990s. In
2001, Doug Laney, then an analyst at consultancy Meta Group Inc., expanded the notion of
big data to also include increases in the variety of data being generated by organizations and
the velocity at which that data was being created and updated. Those three factors -- volume,
velocity and variety
• Separately, the Hadoop distributed processing framework was launched as an Apache open
source project in 2006, planting the seeds for a clustered platform built on top of commodity
hardware and geared to run big data applications.
WHAT IS HADOOP

• Hadoop is an open source distributed processing framework that manages data processing
and storage for big data applications running in clustered systems. It is at the center of a
growing ecosystem of big data technologies that are primarily used to support advanced
analytics initiatives, including predictive analytics, data mining and machine learning
applications. Hadoop can handle various forms of structured and unstructured data, giving
users more flexibility for collecting, processing and analysing data than relational databases
and data warehouses provide.
STORY BEHIND THE LOGO OF HADOOP

• The project’s creator, Doug Cutting, explains how the name came about:

The name my kid gave a stuffed yellow elephant. Short, relatively easy to spell and pronounce,
meaningless, and not used elsewhere: those are my naming criteria. Kids are good at generating such.
Googol is a kid’s term.
• hence this also explains the yellow elephant vector in the logo.

Source : Hadoop: Toddler Talk Provides Big Data Name


SOME IMPORTANT COMPONENTS OF HADOOP

• AMBARI
• PIG
• HIVE
• HBASE
• OOZIE
• MAPREDUCE
• HDFS
• SQOOP
• ZOOKEEPER
LET’S SEE BIG BATA FROM ANOTHER
PROFPECTIVE

• THROUGH AN EXAMPLE OF A RICE BOWL!


Byte : one grain of rice

Byte
BYTE : one grain of rice
KILOBYTE : cup of rice

Kilobyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice

Megabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks

Gigabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships

Terabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Covers peninsula

Petabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Covers peninsular
Exabyte : Blankets South Asia
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Covers peninsular
Exabyte : Blankets south Asia
Zettabyte : Fills the Pacific Ocean
Zettabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Coves peninsula
Exabyte : Blankets south Asia
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL! Yottabyte
Byte : one grain of rice Hobbyist
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets peninsular
Exabyte : Blankets south Asia
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
Byte : one grain of rice Hobbyist
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Desktop
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets peninsular
Exabyte : Blankets south asia
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
Byte : one grain of rice Hobbyist
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Desktop
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Internet
Petabyte : Blankets peninsular
Exabyte : Blankets south Asia
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
Byte : one grain of rice Hobbyist
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Desktop
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Internet
Petabyte : Blankets peninsular
Exabyte : Blankets south asia
Big Data
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets peninsular
Exabyte : Blankets south asia
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
Byte : one grain of rice Hobbyist
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Desktop
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Internet
Petabyte : Blankets peninsular
Exabyte : Blankets south asia
Big Data
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
The Future?
THE FUTURE OF BIG DATA (PREDICTIONS)

• Come 2020, every person in the world will be creating 7 MBs of data every
second. We have already created more data in past couple of years than in
the entire history of human kind.
1. MACHINE LEARNING WILL BE THE NEXT BIG THING
IN BIG DATA

• One of the hottest technology trends today is machine learning and it will play a big part in
the future of big data as well. According to Ovum, Machine learning will be at the forefront of
the big data revolution. It will help businesses in preparing data and conduct predictive
analysis so that businesses can overcome future challenges easily.
2.CHIEF DATA OFFICER: A NEW POSITION WILL EMERGE

• According to Forrester, we will see the emergence of chief data officer as the new position
and businesses will appoint chief data officers. Although, the appointment of chief data
officer solely depend on the type of business and its data needs but the wider adoption of big
data technologies across enterprises, hiring a chief data officer will become the norm.
3. DATA SCIENTISTS WILL BE IN HIGH DEMAND

• As the volume of data grows and big data grows bigger, demand for data scientists, analysts
and data management experts will shoot up. The gap between the demand for data
professionals and the availability will widen.
4. INVESTMENTS IN BIG DATA TECHNOLOGIES WILL
SKYROCKET

• According to IDC analysts, “Total revenues from big data and business
analytics will rise from $122 billion in 2015 to $187 billion in 2019.” Business
spending on big data will surpass $57 billion dollars this year. Although, the
business investments in big data might vary from industry to industry, the
increase in big data spending will remain consistent overall.
5. BIG DATA WILL BE REPLACED BY FAST AND
ACTIONABLE DATA

• According to some big data experts, big data is dead. They argue that businesses do not even
use a small portion of data they have access to and big does not always mean better. Sooner
rather than later, big data will be replaced by fast and actionable data, which will help
businesses, take the right decisions at the right time. Having tremendous amounts of data
will not give you a competitive advantage over your competitors but how effectively and
quickly you analyse the data and extract actionable information from it will.
THANKYOU FOR YOUR TIME

OPEN FOR QUESTIONS.

You might also like