Professional Documents
Culture Documents
Apache Flink
Apache Flink is called 4G of Big Data. It is an open source
framework that can handle streaming as well as batch data.
6. Figures Of Big Brands
6.1. Facebook
Because of more than 950 million users, Facebook is collecting a
huge amount of data. Every time whenever you are clicking a
notification, visiting a page, uploading a photo, or checking out a
friend’s link, you’re generating data for the company to track
various records.Users shared 2.5 billion content items daily (status
updates + wall posts + photos + videos + comments). 300 million
photos are uploaded by users per day. 105 terabytes of data scanned
via Hive, Facebook’s Hadoop query language in every 30 minutes.
70,000 queries executed on these databases per day. 500+terabytes
of new data ingested into the databases every day.
6.2. Twitter
Twitter – the second biggest social network generating less social
data as compared to dating app, Tinder. Tinder users swipe 290,278
matches per minute – that is potentially 35 million lovers per hour!
on the other hand, twitter users generate 347,222 Tweets each
minute – or 21 million Tweets per hour.
6.3. Youtube
The video is a big part of our everyday lives on the internet, and
although Facebook is also trying really hard to fit in and it is
succeeding, with over 3 billion video views per day but YouTube is
still the king. Every minute users are uploading over 300 hours of
new video on YouTube.
3. Skill enhancement
Learning Big Data can be your best investment and can reward you
with skills that you require not only working for big data but in your
day to day life. In general, the domain of Big Data Analytics is full of
unsolved problems and puzzles to solve, which can greatly enhance
your analytical skills and reasoning. Big Data involves statistics and
problem-solving skills which are useful and highly practical for you
even if you don’t intend to make a career in Big Data.
Data Engineer
Data Engineer is a link between Data Scientists and business
executives. They are responsible for communicating data scientists
about business goals so that they can work accordingly to achieve
the objectives. Data Engineer also handles a vast amount of raw
data and evaluate new data sources.
Data Analyst
The problem solver who analyzes data systems, creates automated
systems to retrieve information from the database and compile
reports is the Data Analyst.
Data Scientist
The work of a data scientist is to analyze the raw data which can be
structured and unstructured to derive information which is used by
business leaders to take important decisions impacting business
growth.
Data Architect
Those members of the Big Data team who understand all the aspects
of database design are Data Architects. They collaborate with big
data engineers to create data workflows and are responsible for
designing and testing new database prototypes.