Professional Documents
Culture Documents
Page 1 of 9
Golam Kaderye
Lecturer (Dept. of CSE), IUS
Website: https://sites.google.com/view/golamkaderyeLecture No 1
VOLUME
Within the Social Media space for example, Volume refers to the amount of data generated through
websites, portals and online applications. Especially for B2C companies, Volume encompasses the available
data that are out there and need to be assessed for relevance. Consider the following -Facebook has 2 billion
users, YouTube 1 billion users, Twitter 350 million users and Instagram 700 million users. Every day, these
users contribute to billions of images, posts, videos, tweets etc. You can now imagine the insanely large amount
-or Volume- of data that is generated every minute and every hour.
VELOCITY
With Velocity we refer to the speed with which data are being generated. Staying with our social media
example, every day 900 million photos are uploaded on Facebook, 500 million tweets are posted on Twitter, 0.4
million hours of video are uploaded on YouTube and 3.5 billion searches are performed in Google. This is like
a nuclear data explosion. Big Data helps the company to hold this explosion, accept the incoming flow of data
and at the same time process it fast so that it does not create bottlenecks.
VARIETY
Variety in Big Data refers to all the structured and unstructured data that has the possibility of getting
generated either by humans or by machines. The most commonly added data are structured -texts, tweets,
pictures & videos. However, unstructured data like emails, voicemails, hand-written text, ECG reading, audio
recordings etc., are also important elements under Variety. Variety is all about the ability to classify the
incoming data into various categories.
Page 2 of 9
Golam Kaderye
Lecturer (Dept. of CSE), IUS
Website: https://sites.google.com/view/golamkaderyeLecture No 1
Page 3 of 9
Golam Kaderye
Lecturer (Dept. of CSE), IUS
Website: https://sites.google.com/view/golamkaderyeLecture No 1
FIGURE 1-2 Examples of what can be learned through genotyping, from 23andme.com
Page 4 of 9
Golam Kaderye
Lecturer (Dept. of CSE), IUS
Website: https://sites.google.com/view/golamkaderyeLecture No 1
Page 5 of 9
Golam Kaderye
Lecturer (Dept. of CSE), IUS
Website: https://sites.google.com/view/golamkaderyeLecture No 1
Unstructured data:
Data that has no inherent structure.
Example:
Text documents
PDFs
Images
Video
Page 6 of 9
Golam Kaderye
Lecturer (Dept. of CSE), IUS
Website: https://sites.google.com/view/golamkaderyeLecture No 1
Page 7 of 9
Golam Kaderye
Lecturer (Dept. of CSE), IUS
Website: https://sites.google.com/view/golamkaderyeLecture No 1
Page 8 of 9
Golam Kaderye
Lecturer (Dept. of CSE), IUS
Website: https://sites.google.com/view/golamkaderyeLecture No 1
Structured data:
Data containing a defined data type, format and structure.
Examples:
✓ Transaction data
✓ Online Analytical Processing (OLAP) data cubes
✓ RDBMS
✓ CSV files
✓ Spread-sheets (MS Excel)
ThAnKyOU
Page 9 of 9