You are on page 1of 13

A TWITTER BASED ANALYSIS OF MOST

DEMANDING PROGRAMMING LANGUAGE


REGARDING JOB RECRUITEMENT.

Prepared for: Zain Saleem

Muhammad Daniyal Mateen (BITF19A517)


Muhammad Basiq Naeem (BITF19A542)
Hamza Habib (BITF19A540)
Hussain Attique (BITF19A523)
Executive Summary

The purpose of our report is to provide information about which Programming Language has the
greatest job recruitment opportunities based on Twitter Analysis. As there are thousands of
Programming Languages so many people opt the wrong option, this report will guide the people
to take the right path. We picked top-most demanding Languages Java, PHP, C-Family, JavaScript,
Python and Objective-C. In methodology steps different kinds of softwares and websites are used
to collect tweets to provide the information regarding the most demanding Language. Different
kinds of books and conferences are also used to gather information. Many methods are used to
gather tweets. Statistical analysis is also done to provide interest graphs and tables of total tweets
collected on each language. A result occurs proving that Java is the dominant and most demanding
language regarding job recruitments. So, the conclusion is that according to Twitter a large number
of job-related Tweets are about Java no doubt remaining Languages have also great number of
Tweets but Java is on first. No research like this was ever done using social platforms.
Recommendations are works should be done to check that the popularity of Programming
Languages is acceptable by using social platforms or not.

Page | 2
Table of Contents

1 Introduction… ................................................................................... 4
1.1 Background Information ........................................................................ 4
1.2 Statement of the problem… ................................................................... 4
1.3 Significance of the study… .................................................................... 4
. 1.4 Scope of the study --------------------------------------------------------- 5
1.5 Limitations of the study… .......................................................................................... 5

2 Review of related literature… ............................................................ 5

3 Methods of study ................................................................................ 6

4 Results and Discussions… .................................................................. 7

4.1 Table… ....................................................................................................7


4.2 Interest graph relevant to job demands….............................................................. 7
4.3 Demanding language Pie Chart… ........................................................................... 9

5 Conclusion… .................................................................................... 10

6 Recommendations ............................................................................. 10

7 References ........................................................................................ 10

8 Appendices ........................................................................................ 12

8.1 Developers in world column chart… ................................................................. 13

Page | 3
1. Introduction:

Computer programming languages allow us to give instructions to computer in a


language the computer understands. Just as many human languages exist there are an array of
computer languages that programmers can use to communicate with computer.

1.1. Background Information:

As there are a lot of computer programming languages so which language will be mostly
used? The choice of which programming language to learn is a question that many developers
encountered during their career. The quality of a programming language is largely determined by
the market demand for developers. In the last nine years, social networks have been used as a
recruitment option. Twitter is a social networking service that enables its users to exchange text-
based messages of up to one hundred and forty characters known as tweets. Useful information
for tracking or even forecasting behaviour when extracted in an appropriate manner lies
embedded in Twitter stream. Twitter currently has three hundred and fifty million active users.

1.2. Statement of the problem:

• How popular are various programming languages related to research for


software developers on Twitter?
• Could it be possible to make predictive analysis using data collected on Twitter?

1.3. Significance of the Study:

The report will inform people which programming language is mostly used and
effective. So, people can easily build their career and brighten their future. Many people opt the
programming language without having knowhow of that particular language. This report will
give the people knowhow of some effective programming languages. Also, the report will aware

Page | 4
the people about social recruitment options and provide all the information relevant to
effective programming language available on social networks.

1.4. Scope of the study:

As there are thousands of programming languages so information regarding thousands of


programming languages cannot be provided. Only information regarding these languages will be
provided:

a) Java b) Objective-C
c) JavaScript d) PHP
e) C-Family f) Python

1.5. Limitations of the Study:

Several studies have analyzed diffusion, trends, and popularity of programming


languages. Some studies use the definitions of particular indexes that try to measure the
popularity of programming languages, but very few have tried to analyze the popularity of
programming language using social networks. The TIOBE programming community index is a
measure of popularity of programming language calculated from number of search engine
results for queries containing the name of language. The index covers searches in Google,
Google Blogs, MSN, Yahoo!, Baidu, Wikipedia and YouTube. Index is updated once a month.
Index doesn’t provide popularity about SQL or HTML. TIOBE index is not about best
programming language, it is about popularity or demand of programming language.

2. Review of related Literature:

These have been already done in our area of research:

Page | 5
❖ PyPL, this is a popularity index created by analyzing how often language tutorials
are searched on Google, the more specific language tutorial is searched the more popular
language is assumed to be.
❖ Delorey et al. used survey methodology to identify the factors that lead to
language adoption. They analyzed large datasets, including over three hundred
thousand Source Forge projects and multiple surveys of two thousand to twenty
thousand programmers.
❖ Chen et al. reviewed the history of group of programming languages and
showed that two kinds of factors, intrinsic and extrinsic factors could affect the
evolution of programming languages.
❖ Karus et al. investigated the revision of twenty-two OSS projects and tracked
the evolution of multiple programming language usage.

These are all indexes used to determine the popularity of Programming Languages.
Software in IEEE.

3. Methods of Study:

To analyze information related to popularity of programming languages, tweets regarding


job recruiting activity were collected from Twitter between 3 January 2016 and 3 September
2019 using Twitter4J, a Java library for Twitter Application Programmers Interface (API)
from Twitter4J.org. It deleted those tweets not connected to job recruitments and collected
those tweets related to job recruitments. It collects tweets using following pattern:

name_of_programming_language+developer+job

Irrelevant tweets are like:

A recruiter just pinged me about an opportunity as a #Java application developer.


Tempting, but I think I’ll stick with the current job.

Page | 6
We performed a statistical analysis using Google Trends to show the interest of
each programming language from 3 January 2016 to 3 September 2019. Interest of every
programming language is calculated worldwide.

4. Results and Discussions:

In results and discussion, collected tweets, interest graphs and chart will be shown to provide
popularity of every programming language.

4.1. Table:

Language Collected Tweets Period


Java 1,125,878 3 Jan 2016-3 Sep 2019
JavaScript 102,139 3 Jan 2016- 3 Sep 2019
C-Family 239,885 3 Jan 2016- 3 Sep 2019
Objective-C 3,850 3 Jan 2016- 3 Sep 2019
PHP 417,346 3 Jan 2016- 3 Sep 2019
Python 57,667 3 Jan 2016- 3 Sep 2019
Job related collected tweets for each programming language.

4.2. Interest Graphs relevant to Job demands:

Java
120
100
80
60
40
20
0

Interest graph of Java from Jan, 2016 to Sep, 2019.

Page | 7
C-Family
120
100
80
60
40
20
0

Interest graph of C- family from Jan, 2016 to Sep, 2019.

Objecive-C
120
100
80
60
40
20
0

Interest graph of Objective C from Jan, 2016 to Sep, 2019.

PHP
120
100
80
60
40
20
0

Interest graph of PHP from Jan, 2016 to Sep, 2019.

Page | 8
JavaScript
120
100
80
60
40
20
0

Interest graph of Java Script from Jan, 2016 to Sep, 2019.

Python
120
100
80
60
40
20
0

interest graph of python from Jan, 2016 to Sep, 2019.

4.3. Chart:

Chart of demanding programming languages, 2016-2019.


Page | 9
5) Conclusion:
In this report, we described findings of our empirical study analysing a dataset of Nineteen
lac forty-six thousand seven hundred sixty-five tweets. These tweets are related to announcements
which request developer experts in a particular programming language. We considered the
following programming languages: Java, JavaScript, C-Family, Objective-C, PHP and Python.
The main goal of the study is was to find most popular language on twitter, a popular social
network analyzing specific tweets. According to all analysis and statistical findings, it is proved
that Java is dominant. According to pie chart it can be clearly seen that most demanding and
interesting language is Java. Different assumptions made in literature on the popularity of
programming languages exist, but none of those considers social network analysis.

6) Recommendations:

Our study shows that earlier popular programming languages like C-Family are still
strongly required by software companies. We confirm that Java is most popular programming
language, but JavaScript, Python and PHP are also gaining much attention. Due to this report, we
have demonstrated that it is possible to obtain accurate results related to programming languages
popularity using Twitter. Further work needs to be done to establish whether it is possible to
consider a social networks factor in programming languages popularity index. Further research
might explore in depth Twitter georeferentiation to understand in which part of the world particular
developers are required and a future study investigating Tweet written in different languages would
provide a more detailed analysis.

7) References:

Paulson, Linda Dailey. (2007). “Developers Shift to Dynamic Programming Languages”.


New York, USA: IEEE.

Page | 10
(2013) “Computer Software and Applications Conference (COMPSAC)”, Kyoto, Japan.

(2016) “Computer Software and Applications Conference (COMPSAC)”, Atlanta, Georgia, USA.

(2018) “Computer Software and Applications Conference (COMPSAC)”, Tokyo, Japan.

URL http://www.wikicfp.com/cfp/program?id=548&f=Computer.

(2016) “Object-oriented Programming, Systems, Languages, and Applications


(OOPSLA)”, Amsterdam, Netherlands.

(2017) “Object-oriented Programming, Systems, Languages, and Applications


(OOPSLA)”, Vancouver, Canada.

(2018) “Object-oriented Programming, Systems, Languages, and Applications


(OOPSLA)” Boston, MA.

(2019) “Object-oriented Programming, Systems, Languages, and Applications


(OOPSLA)”, Athens, Greece.

URL http://www.sigplan.org/Conferences/OOPSLA/.

Miryung Kim, Romain Robbes, Christian Bird. (2016) “IEEE Working Conference on
Mining Software Repositories (MSR)”, TX, USA.

Jesus M. Gonzalez-Barhona, Abram Hindle, Lin Tan, (2017) “IEEE Working Conference on
Mining Software Repositories (MSR)”, Buenos Aires, Argentina.

Andy Zaidman, Yasutaka Kamei, Emily Hill, (2018) “IEEE Working Conference on Mining
Software Repositories (MSR)”, Gothenburg, Sweden.

Margaret-Anne D. Storey, Bram Adams, Sonia Haiduc (2019) “IEEE Working Conference
on Mining Software Repositories (MSR)”, Montreal, Canada.

Page | 11
URL https://dblp.org/db/conf/msr/index.

Scott Christly, Maria Letizia Jaccheri, Charles M. Schweik, Robert English, Audris
Mockus, L. Harjumaa, Mikko Rajanen, Netta Iivari, Henrik Hedberg, Greg Madey. (2007)
“FLOSS Research and Development”.

Antonio Pecchia (2009) “International Journal of Open Source Software and


Processes (IJOSSP)”. Quartely.

Charles S. Davis (2002) “Statistical Methods for the Analysis of Repeated Measurements”.
Springer.

8) Appendices:

Intrinsic factors are the factors that can be used to describe the general design criteria of
programming languages. Extrinsic factors are the factors that are not directly related to the general
attributes of programming languages, but still can affect their evolution.

8.1. Column Chart of total developers in the world:


According to column chart, it is clear that Java has 10 million developers, PHP has 9
million, C-Family (C, C++, C#) has 8 million developers, JavaScript has 4.5 million developers,
Python has 3.5 million developers and Objective-C has 2.8 million developers.

Page | 12
DEVELOPERS OF EVERY LANGUAGE IN WORLD
10000000
9000000
8000000

4500000
3500000
2800000

JAVA PHP C Family JavaScript Python Objective C

Total active developers of each language in world

Page | 13

You might also like