You are on page 1of 20

A Data Driven Approach for Exploring

YouTube Data Set

Batch 18:-
Under the guidance of:- K.L.Venkata Kumar(15761A0579)
Mr.G.V.Suresh M.Sai Kiran(15761A0582)
Associate Professor G. Harsha Sri(15761A0569)
A.Mohit Pavan(15761A0560)
Contents:-
 Introduction
 What is YouTube Data Analysis?
 Technique used
 Connection Establishment
 Results
 Advantages
 Applications
 Code
 Conclusion
 References
Introduction:-
The popularity of YouTube is because of ease of use and simplicity of
these systems for the creation, collaboration and sharing of resources
(images, videos) even from non-technical users . Current YouTube usage
statistics indicate the approximate scale of the site: at the time of this
writing there are more than 1 billion unique users viewing video content,
watching over 6 billion hours of video each month . Also, YouTube
accounts for 20% of web traffic and 10% of total internet traffic. YouTube
provides many social mechanisms to judge user opinion and views about a
video by means of voting, rating, favorites, sharing and negative
comments, etc. It is important to note that YouTube provides more than just
video sharing; beyond uploading and viewing videos, users can subscribe
to video channels and can interact with other users through comments.
What is YouTube Data Analysis?:-
.

YouTube is a video sharing service where users can watch , like ,


share , comment and upload their own videos. The video service can be
accessed on PCs , laptops , tablets and via mobile phones. Data analytics
(DA) is the process of examining data sets in order to draw conclusions
about the information they contain, increasingly with the aid of specialized
systems and software. Data analytics technologies and techniques are
widely used in commercial industries to enable organizations to make
more-informed business decisions and by scientists and researchers to
verify or disprove scientific models, theories and hypotheses.
Technique used:-
Cluster Analysis:
Clustering Analysis refers to the process of detecting data sets with similar attributes to learn
their similarities as well as differences in the data.

Regression Analysis:
In Regression Analysis, you can try to determine the dependency between attributes.
There is an assumption of a single-way causal effect from one attribute to the
response of another attribute.

Classification Analysis:
Classification Analysis refers to a systematic approach for gathering crucial and
relevant information about data. This data science technique can help the
business to determine which set of data can be used for further analysis

Association Analysis:
Association Analysis will allow the business to discover relevant associations between
different variables in a large-scale database.
Connection Establishment:-
Step 1:Extracting the API key from Google Developer Console
The first thing you need to do is to extract the API key from Google Developer
Console. In order to do so, you will need to activate your account for GDC. Once
you have done this, you need to create a new Project
Once the project is created, you will automatically be re-directed to the dashboard
for the project
Once at this stage, you need to create a key by going in APIs & auth section. In the
APIs sub-section, switch on YouTube Data API v3, in case this API is OFF. Also,
please note the limit Google has put on the number of units / day – it should be
sufficient for any non-commercial data extraction. Next, go to the Credentials
sub-section and select “Create new key” to create a Public API access.

You should keep the key generated handy, as this would be required to be replaced in
your Python code.
Results:-
Univariate Analysis:-
Results(cont..):-
Results(cont..):-
Bivariate Analysis:-
Results(cont..):-
Results(cont..):-
Results(cont..):-
Multivariate Analysis:-
Advantages:-
We can get an overview of the following data:
1.Likes
2.Dislikes
3.Views
4.Comments
5.Subscribers
YouTube Analytics also gives you insights, So that it can be helpful to see if
we are getting to the Right people.
Applications:-

1.Communcations,Media & Entertainment


2.Education
3.Manufacturing
4.Transportation
Code:-
Code(Cont..):-
Conclusion:-
With the help of YouTube Analytics, organizations can
identify social leads, influencers on a daily basis. The potential leads
can be segregated into different segments based on the conversation
themes and tonality. This helps to understand the demographics of
prospects and influencers. Generally, the social media experts / analysts
verify the leads and sort them into different segments as per business
requirement and create personalized communication strategies.
YouTube is a powerful way to grow your followers, gain trust, and
increase overall revenue by acquiring more Subscribers, Likes. Using
YouTube Analytics, one can definitely increase the ability to grow to the
maximum potential through social networks.
References:-
1.Alhabash, S.; Baek, J.-h.; Cunningham, C.; and Hagerstrom, A.2015. To Comment or
Not to Comment?: How Virality, Arousal Level, and Commenting Behavior on YouTube
Videos Affect Civic Behavioral Intentions. Computers in Human Behavior

2. Arapakis, I.; Lalmas, M.; Cambazoglu, B. B.; Marcos, M.-C.; and Jose, J. M. 2014.
User Engagement in Online News: Under the Scope of Sentiment, Interest, Affect, and
Gaze. Journal of the Association for Information Science and Technology.

3. Baym, N. K. 2013. Data Not Seen: The Uses and Shortcomings of Social Media
Metrics. First Monday.

4. Berger, J., and Milkman, K. L. 2012. What Makes Online Content Viral? Journal of
Marketing Research.

5. Chatzopoulou, G.; Sheng, C.; and Faloutsos, M. 2010. A First Step Towards
Understanding Popularity in YouTube. In Proc. Of INFOCOM.

You might also like