You are on page 1of 12

Involvement of machine learning in improving telecommunication the need for data is increasing

information access: A Systematic Review and the expanded channels have brought several
opportunities as well as challenges. The data plays
Abstract
an important role in handling all tasks of different
organizations efficiently. Though reaching
The data and its management is very important for customers through data can help to enhance the
the effective growth of the industries as this helps marketing campaigns and other efforts related to
in making effective decision. The machine personalization and fundraising (Cioffi et al.,
learning is important to be implemented for 2020). But it is important that the data acquired is
improving the information access as data quality correct and relevant. Also, managing the data
is very important to handle. Every industry needs quality is equally important for the companies.
to implement the unique algorithms through There are so many data entry points through
which machine learning can be brought into which it becomes difficult for the companies to
existence. In the following research two common ensure that there is no inaccurate data. If there
algorithms are being suggested which can be exist no appropriate algorithms that can ensure
implemented in almost every field and industry. that data entered is inaccurate it becomes difficult
Two algorithms are recognized as Random forest to obtain the advantages of data. Thus, artificial
and Support vector machine (SVM) algorithm intelligence or machine learning-based algorithms
which can be implemented in almost all industries must be implemented which can enhance the
and all sectors. Also, different areas in which overall performance of the companies by
machine learning can be implemented are appropriately managing the data and ensure that
explored so that better results can be achieved the accuracy rate of data entered is higher when
easily. The education, medical, business and many compared with other services. The data quality is
other areas are recognized and particular sectors important for resolving the problems and
are identified as data under machine learning is enhancing the decision-making processes. The
effectful in these fields. The qualitative data is data quality helps fulfil the requirements of the
collected using literature review and relevant different companies in terms of security as well.
information is analyzed using Content analysis, Different dimensions are recognized so that the
thematic and disclosure analysis. quality of data can be defined easily using
different attributes. The data need to be complete
Keywords: Machine Learning, Random Forest,
so that they meet the expectations and helps in
Support Vector Machine, content analysis,
effective. The machine learning algorithms help
thematic analysis, disclosure analysis.
ensure that data is complete. These algorithms are
INTRODUCTION also helpful in maintaining the consistency of the
Background data to ensure that everyone accesses similar
In the competitive world today because of information which helps in presenting better
advancement in information technology and decisions. Accuracy is one of the important
qualities of the data and the main focus of all Relative work
machine learning algorithms is made on data Many researchers have implied the research in the
accuracy. The data availability is also important if research in this area so that they can provide a
the decisions need to be made depending on the better algorithm for different applications and
data. An appropriate form of data is necessary for companies. Agaoglu, (2016) in his research has
the algorithms and it needs to be ensured that presented a classifier through which real and fake
proper format, range and definition must be data can be differentiated for enhancing data
followed. Thus, data quality and effective accuracy. Wuest et al., (2016) have focused on the
management are very important for the current situation of social media and algorithms
appropriate growth of the company of any sector. that help detect spam which is important to handle
Machine learning and master data management the current situation of an increasing number of
can be linked together so that better corporation in users on social media and chances of accurate data
AI and Machine Learning can be achieved easily. has reduced to a greater extend. Najafabadi et al.,
The machine learning algorithms are therefore (2015) has provided the analysis for CRM so that
helpful in enhancing the accuracy, consistency, a better understanding of demography can be
manageability among other different sectors achieved. The sentiment analysis and
(Bansal et al., 2015). Thus, it is important to classification approaches are used so that data can
research is conducted which can provide the be reviewed and decisions are made effectively.
required algorithm for all different industries as This has stated that machine learning for data
the common algorithm is easy to handle and management plays an important role in handling
management of data becomes an easier task. The the activities in different organization. Smart
following research document has proposed an devices are associated with the organization as
algorithm that enhances information accessibility customers from different devices access the
and performs better functionalities with accurate information. This has increased the need for
data. The proposed algorithm is helpful for all proper management of data to ensure that quality
industries like healthcare, retail, banking, supply data is accessed by the company for the decision
chain, education, social media and other sectors. making process. Calandra et al., (2012) has also
The common algorithm is analyzed using different analyzed different applications related to voice
algorithms provided by different authors but in a and video recognition application so that better
single industrial sector. The research conducted by algorithms are provided for advancement. The
different authors has been explored first and then dimensions of networking in e-commerce and net
the gap between different researches is analyses, banking can be improvised so that better outputs
to fulfil the gap the following research is proposed can be received through data management. Also,
and better output related to the issues is provided the user experiences and enhancement in the
by suggesting a common algorithm. customer needs is achieved. Moreover, the access
of information through machine learning has
helped to enhance the performance of the
customers to a greater extent. Cusumano, (2005) implemented successfully for data management.
has focused on the healthcare sector which states But they all have provided the algorithms either
that data need to be handled with appropriate for individual industry or for the individual sector.
streamlined machine algorithms under which data Therefore, the need for research arises under
is analyzed properly. Even in the current scenario which a single algorithm can be implemented in
of the Covid19 if effective data management is the various industrial sector (Salton & Buckley,
achieved it will be easy to calculate the distances 1988). This increases the capability of the
and collecting relative information. This has resources and data management become more
increased the need for machine learning so that efficient. The process of access to information
information from the data can be obtained more becomes easier and development in any sector can
easily and enhanced outputs are provided. be achieved easily. The quality of data can also
Salakhutdinov & Hinton, (2009) has provided the easily be maintained if a single algorithm provides
algorithms which are applicable in the education complete information regarding the handling of
sector and different activities of the industry are data in an accurate manner. This also helps to
handled with the algorithm provided. The quality enhance the accuracy rate of the data. The
assurance needs to be achieved through the private additional resources are not required if the
and public schools which state that a learning- communication amongst the cross-industries is
based model can be achieved with the help of maintained (Bengio, 2013). Therefore, a single
machine learning when implemented in different algorithm has more importance when different
scenarios related to the education sector. Many industries are considered. The following research
works have focused on the handling of data has focused upon on a single algorithm that can be
through machine learning in the current scenario implemented in different sectors like healthcare,
of Covid19. Hinton & Salakhutdinov, (2010) has retail, banking, supply chain, education, social
stated that information using ML algorithms can media and other sectors. This helps to eliminate
be used for getting the sentimental as well as the gap which has been recognized under the
psychological senses. This has also increased the different researches conducted by various authors.
need for the implementation of artificial The following research has also identified
intelligence through which understandings can be different areas in which the machine learning
enhanced easily. algorithms can be implemented so that better it
can enhance the knowledge of readers. Focus is
Gap Analysis
completely made on the different areas of
It can be analyzed from the above description of
implementation for machine learning as well as a
different articles in which the research has been
single algorithm that can be implemented in
conducted in the context of machine learning for
different sectors so that issues related to machine
better management of data. Though many
learning can be resolved easily.
researchers have been conducted by different
authors and they have presented the algorithms
through which machine learning can be
Research Questions looked in for collecting the relevant information.
It is important for any research that some relevant The group discussions are done with the team
research questions must be answered and this members so that relevant information can be
enhances the capability of the research. This also collected and presented to the readers. Therefore,
attracts the readers as they mainly focus on the observations are conducted so that all relevant
output that they can achieve after reading the information related to the research. Since the
research. The most relevant research question that research question focuses on a single algorithm
can be answered through this research are listed for different industries, it is important that articles
below: describing the algorithms for various sectors are
chosen and data is collected from these articles.
 What are the main areas in which machine
The resources that are needed for conducting the
learning can be implemented so that better
research include the internet and various articles
information access can be achieved easily?
of journals (Aguinis et al., 2019). The Google
What is the importance of data
Scholars can also be explored for different
management through machine learning in
scholarly articles and experiences and ideas are
different sectors?
explored in a detailed manner; an in-depth
 Which algorithms can be implemented in
analysis helps achieve the relevant resultsv (Dahl
different industries like healthcare, retail,
et al., 2012).
banking, supply chain, education, social
media and other sectors so that better Data analysis
information access can be achieved easily? Once the data is collected from different articles
and jotted down so that the next step is to analyze
METHODS
the data. In this research, three different methods
Data collection method are used for analyzing the data collected. These
Since no enhanced calculation are needed for the three methods are described below:
proposed research, the most suitable method for
 Content analysis: the algorithms are
collecting the data related to machine learning is a
divided based on sectors. Different
literature review. This stated that qualitative data
meanings of words, phrases and sentences
is required for conducting valuable research. The
are discussed so that a clear view of the
literature review is done that is different articles
information can be achieved. These can be
are obtained from the journals which are present
implemented for a better understanding of
online on the internet are reviewed. The journals
the data and machine learning can be
that are included for the research purposes are
achieved. This enhances the knowledge
iEEExplorer, ResearchGate, ScienceDirect and
through which the research questions can
others. All the articles that are being accessed are
be answered effectively (Hase, 2021). An
peer-reviewed (Paradis et al., 2016). The
in-depth analysis of the content is done so
keywords like machine learning, information
that better results can be achieved easily.
access, data quality and data management are
 Thematic analysis: The next effective  Image Recognition: it is one of the most
method that can be used for analyzing the common applications that are applicable
data collected in the coding and for machine learning. It easily recognizes
examination of the data so that broad the objects, graphics, images and others.
themes and patterns can be achieved On basis of further actions are performed.
easily. With this method, the complete For example, automatic friend tag
focus is made on the areas of the machine suggestions facility is provided by
learning and the algorithms which are different websites and this information is
common in the different industrial sector. completely based on the images present.
The themes are selected as machine Many social media websites have
learning algorithm and they are analyzed popularly used this feature and Google
for achieving knowledge related to also has used a similar algorithm (Righi et
different sectors ("Capturing student al., 2016). The search engine google can
learning with thematic analysis", 2017). take input as an image and preset all
 Discourse analysis: the communication relevant information in one click. This is
and meanings of the content are analyzed directly associated with the social media
so that an effective algorithm can be industry.
decided. Before deciding on the particular  Speech Recognition: for enhancing
algorithm all other options are explored in communication in business different
a detailed manner and this helps jot down companies make use of applications either
the results of the research (Muller, 2015). online or offline that has provided the
All issues and exposures are explored facility of "Search by voice" and this is
before making the final decision. only possible by the application of
machine learning. The different algorithms
Table of Authors missing
related to speech recognition need to make
RESULTS sure that they understand the correct query
After collecting data from different articles and and respond accordingly (YOU & MA,
analyzing them the results achieved are discussed 2017). Data accuracy plays an important
below: role in this process as information
accessibility increases to a greater extent.
Areas in which machine learning can be
Google Assistant, Siri, Cortana, and Alexa
implemented
are using speech recognition and they have
The main areas in which machine learning for
algorithms based on machine learning.
data management proves to be beneficial and is
 Traffic prediction: in the security industry
helpful in the achievement of relevant targets are
or automobile industry the correct path and
listed below:
shortest routes are very important for the
execution of the operation. For this the
information from the traffic data need to provided easily. This enhances the health
be achieved, this can only be achieved by of the patients.
making use of the machine learning  Banking sector: the satisfaction level of
algorithm (Rodríguez et al., 2020). Google the customers can be enhanced as data of
maps and sensors are the best applications the users is provided and predictions are
that can be used for enhanced output of the made accordingly. If advanced banking
operations. services are provided to the customers, it is
 Email Spam and Malware Filtering: in the easy to achieve better financial growth of
computing field the spam recognition and the country.
malware filtering are very important so  Plagiarism checkers: this is beneficial for
that better and advanced output can be the education sector as the plagiarism
achieved easily. To obtain the same the checker platforms such as Turnitin make
data plays an important role as filters can use of ML at their core functioning to
be implied if and only if advanced detect plagiarized content (Shafer, 2014).
information related to the users is present. This is helpful for the educational
Machine learning plays an important role department for keeping the records in a
in this context (Broadhurst & Trivedi, definite manner.
2018).
Common Algorithm
 Trading: the risks of stock up and down
It is imperative for the industries that they can
always exist but the predictions for the
make use of the right and common algorithms so
traders are easily made and then helps to
that they can perform different operations on the
present the relevant information. The
correct data. The two algorithms which can be
information is completely based on the
implied in all industries are Random Forest and
data of the market trends and trading
SVM.
decisions are made accordingly. Therefore,
machine learning is very important for  Random forest: it is a flexible machine

advanced predictions related to trading. algorithm through which reliable results

 Medical Diagnosis: Based on the available are being produced, because of its ease to

data the decisions related to the diseases use and simplicity is one of the most

are made and diagnosis are done popular algorithms. It can be used for

effectively for future diseases. This has regression and identification purpose. It

enhanced the health of the patients as establishes different decision trees and

accurate information related to the diseases combines them so that they can provide

is provided to them (Z Zaghloul, 2012). stable and accurate predictions. Thus, this

Based on the 3D printing and data algorithm is beneficial in all industrial

availability the medical treatment can be sectors as predictions that are made by
using this algorithm is beneficial in all
term. The biggest advantage of this simplified prediction and calculations need
algorithm is that it can be used for both to be made as the algorithms are presented
classification and regression activities. The in the graphic image. The decision-making
relative importance helps to provide better process becomes an easier task in that
input features (Mantas et al., 2018). It case. Through this algorithm, the
provides reliable results as the predictions categorization of the text and hypertext
that are made are accurate and easily be can be done. The training of data is
relied upon. The only disadvantage is this important as this helps to classify the
algorithm is that it requires a large number documents in different forms (kumar,
of decision trees to be constructed. The 2017). These forms are based on the scores
data quality is provided by the Random that are generated and comparisons are
Forest algorithm and it can be used for made on the highest value. The SVMs help
banks, e-commerce, medicine, and the play an important role in the recognition of
stock market. Fraud recognition can also handwriting and this is helpful in the
be effectively achieved by the companies. validation of various documents. The
In the social media industry, it is beneficial classification of genes is done and
for making predictions and generating biological problem in patients is detected,
suggestion for the customers based on the for example, the SVM algorithm has been
different activities performed by the users used to detect protein remote homology.
in past. Random forest is used in the This can be recognized as a big
medical field so that correct medicine achievement. Enhanced search accuracy is
components can easily be detected and achieved as the images are classified
medical history can easily be analyzed and which is completely based on the
frequency of disease can be determined recognition of the images. The algorithms
easily (Deldar et al., 2021). This helps to identify the images with higher accuracy.
provide better treatment. The stock The potential of students is identified
behaviours are also determined and better using this algorithm and then decisions are
decision related to the purchase can be made related to their placement by the
made easily. proper implementation of machine
 Support vector machine (SVM) algorithm: learning and data management.
the other common algorithm that is
DISCUSSION
preferable for all industries is SVM. As the
There exists a new road for data analytics and
name suggests the supervised machine
machine learning is representation learning. The
learning algorithm is applicable for
modelling tasks are categorized into unsupervised
classification as well as regression, the
learning and supervised learning. Under
classification of the unseen data is the
unsupervised learning, the descriptive models are
primary aim of this algorithm. The
used to identify the structure inside the input data untagged data for training, usually a small amount
and then monitor the process of distribution. On of tagged data and a large amount of untagged
the contrary under supervised learning, the data. Systems using this method can significantly
regression and classification are done. The improve the accuracy of learning. Semi-
functional mapping in the inputs well output is supervised learning is typically chosen when
made which has increased the accuracy rate of the captured tagged data requires skilled and
data. The quality of the crucial variables is very appropriate resources for training/learning.
important for the success of industrial processes. Otherwise, raw data extraction generally requires
This has also increased the value of fast rate no additional resources.
process variables. The domain-specific knowledge
Machine learning algorithms with reinforcement
is important so that model can be built and it is
are a learning method that interacts with the
very important to enhance the performance of the
environment by performing actions and
model.
discovering mistakes or rewards. Trial and error
Supervised machine learning algorithms can apply search and delayed reward are the most important
what has been learned in the past to new data features of reinforcement learning. This method
using tagged examples to predict future events. enables computers and software agents to
Based on the analysis of the known set of training automatically determine the ideal behaviour in a
data, the learning algorithm creates a deduced specific context to maximize their performance
function that makes it possible to predict the (de Toledo & Torrisi, 2019). Simple feedback on
output values. After sufficient training, the system rewards is required for the agent to know which
can set targets for each new entry. The learning action is best; This is called the boost signal.
algorithm can also compare its output to the
CONCLUSION
correct expected results and find errors to modify
Thus, it can be concluded from the above research
the model accordingly (Gupta et al., 2017).
report that machine learning is very important for
In contrast, unsupervised machine learning the effective growth of industries. The data plays a
algorithms are used when the information used for crucial role in the development of different
training is not classified or tagged. Unattended decisions in the companies. The accuracy of data
learning explores how systems can infer a is very important as all the decisions that are
function describing an underlying structure from predicted by the companies dependent on the data.
unmarked data. The system does not find valid The different industries make use of the data and
results, but examines the data and can draw implement machine learning so that they can
conclusions from datasets to describe hidden obtain enhanced decisions. But there exists no
structures from untagged data. common algorithm through which all companies

Semi-supervised machine learning algorithms lie can work upon. There exist different areas in

somewhere between supervised and unsupervised which machine learning can be implemented. The

learning because they use both tagged and common algorithms which can be implemented
for effective growth are SVM and random forest training processes are also implemented for better
algorithms. The supervised and unsupervised growth.
algorithms are categorized so that relevant
decisions related to the industries are taken. The
REFERENCES
Agaoglu, M. (2016). Predicting Instructor Streams. Artificial Neural Networks And
Performance Using Data Mining Machine Learning – ICANN 2012, 379-
Techniques in Higher Education. IEEE 386. https://doi.org/10.1007/978-3-642-
Access, 4, 2379-2387. 33266-1_47
https://doi.org/10.1109/access.2016.25687
Capturing student learning with thematic analysis.
56
(2017), 2(6).
Aguinis, H., Hill, N., & Bailey, J. (2019). Best https://doi.org/10.26500/jarssh-02-2017-
Practices in Data Collection and 0601
Preparation: Recommendations for
Cioffi, R., Travaglioni, M., Piscitelli, G., Petrillo,
Reviewers, Editors, and
A., & De Felice, F. (2020). Artificial
Authors. Organizational Research
Intelligence and Machine Learning
Methods, 109442811983648.
Applications in Smart Production:
https://doi.org/10.1177/109442811983648
Progress, Trends, and
5
Directions. Sustainability, 12(2), 492.
Bansal, G., Zahedi, F., & Gefen, D. (2015). The https://doi.org/10.3390/su12020492
role of privacy assurance mechanisms in
Cusumano, M. (2005). Google. Communications
building trust and the moderating role of
Of The ACM, 48(2), 15-17.
privacy concern. European Journal Of
https://doi.org/10.1145/1042091.1042107
Information Systems, 24(6), 624-644.
https://doi.org/10.1057/ejis.2014.41 Dahl, G., Dong Yu, Li Deng, & Acero, A. (2012).
Context-Dependent Pre-Trained Deep
Bengio, Y. (2013). Deep Learning of
Neural Networks for Large-Vocabulary
Representations: Looking
Speech Recognition. IEEE Transactions
Forward. Statistical Language And Speech
On Audio, Speech, And Language
Processing, 1-37.
Processing, 20(1), 30-42.
https://doi.org/10.1007/978-3-642-39593-
https://doi.org/10.1109/tasl.2011.2134090
2_1
de Toledo, T., & Torrisi, N. (2019). Encrypted
Broadhurst, R., & Trivedi, H. (2018). Malware in
DNP3 Traffic Classification Using
Spam Email: Trends in the 2016
Supervised Machine Learning
Australian Spam Intelligence Data. SSRN
Algorithms. Machine Learning And
Electronic Journal.
Knowledge Extraction, 1(1), 384-399.
https://doi.org/10.2139/ssrn.3413442
https://doi.org/10.3390/make1010022
Calandra, R., Raiko, T., Deisenroth, M., &
Deldar, M., Anbiaee, R., & Sayehmiri, K. (2021).
Pouzols, F. (2012). Learning Deep Belief
Predicting Epithelial Ovarian Cancer First
Networks from Non-stationary
Recurrence with Random Survival Forest:
Comparison Parametric, Semi-Parametric, Muller, A. (2015). Using Discourse Network
and Random Survival Forest Analysis to Measure Discourse Coalitions:
Methods. Journal Of Biostatistics And Towards a Formal Analysis of Political
Epidemiology. Discourse, 11(2).
https://doi.org/10.18502/jbe.v6i4.5680 https://doi.org/10.1515/wps-2015-0009

Gupta, A., Pathak, S., & Kannadasan, R. (2017). Najafabadi, M., Villanustre, F., Khoshgoftaar, T.,
An Evaluation of Supervised Machine Seliya, N., Wald, R., & Muharemagic, E.
Learning Algorithms for Heart Disease (2015). Deep learning applications and
Diagnosis. International Research Journal challenges in big data analytics. Journal
Of Computer Science, 4(8). Of Big Data, 2(1).
https://doi.org/10.26562/irjcs.2017.aucs10 https://doi.org/10.1186/s40537-014-0007-
087 7

Hase, V. (2021). Sentiment/tone (Automated Paradis, E., O'Brien, B., Nimmon, L., Bandiera,
Content Analysis). DOCA - Database Of G., & Martimianakis, M. (2016). Design:
Variables For Content Analysis. Selection of Data Collection
https://doi.org/10.34778/1d Methods. Journal Of Graduate Medical
Education, 8(2), 263-264.
Hinton, G., & Salakhutdinov, R. (2010).
https://doi.org/10.4300/jgme-d-16-00098.1
Discovering Binary Codes for Documents
by Learning Deep Generative Righi, M., D’Acunto, M., & Salvetti, O. (2016).
Models. Topics In Cognitive Science, 3(1), An image enhancement tool: Pattern
74-91. https://doi.org/10.1111/j.1756- Recognition Image Augmented
8765.2010.01109.x Resolution. Pattern Recognition And
Image Analysis, 26(3), 518-523.
kumar, R. (2017). Signature Verification Using
https://doi.org/10.1134/s10546618160301
Support Vector Machine
60
(SVM). International Journal Of Scientific
Research And Management. Rodríguez, J., Jattin, J., & Soracipa, Y. (2020).
https://doi.org/10.18535/ijsrm/v5i5.07 Probabilistic temporal prediction of the
deaths caused by traffic in Colombia.
Mantas, C., Castellano, J., Moral-García, S., &
Mortality caused by traffic
Abellán, J. (2018). A comparison of
prediction. Accident Analysis &
random forest based algorithms: random
Prevention, 135, 105332.
credal random forest versus oblique
https://doi.org/10.1016/j.aap.2019.105332
random forest. Soft Computing, 23(21),
10739-10754. Salakhutdinov, R., & Hinton, G. (2009). Semantic
https://doi.org/10.1007/s00500-018-3628- hashing. International Journal Of
5 Approximate Reasoning, 50(7), 969-978.
https://doi.org/10.1016/j.ijar.2008.11.006

Salton, G., & Buckley, C. (1988). Term-weighting


approaches in automatic text
retrieval. Information Processing &
Management, 24(5), 513-523.
https://doi.org/10.1016/0306-
4573(88)90021-0

Shafer, S. (2014). Plagiarism Is Plagiarism Is


Plagiarism. Anesthesia &
Analgesia, 118(1), 1-2.
https://doi.org/10.1213/ane.000000000000
0031

Wuest, T., Weimer, D., Irgens, C., & Thoben, K.


(2016). Machine learning in
manufacturing: advantages, challenges,
and applications. Production &
Manufacturing Research, 4(1), 23-45.
https://doi.org/10.1080/21693277.2016.11
92517

YOU, C., & MA, B. (2017). Spectral-domain


speech enhancement for speech
recognition. Speech Communication, 94,
30-41.
https://doi.org/10.1016/j.specom.2017.08.0
07

Z Zaghloul, M. (2012). Helicobacter


pylori. Journal Of Medical Microbiology
& Diagnosis, 01(02).
https://doi.org/10.4172/2161-
0703.1000e104

You might also like