You are on page 1of 18

Computers & Industrial Engineering 137 (2019) 106099

Contents lists available at ScienceDirect

Computers & Industrial Engineering


journal homepage: www.elsevier.com/locate/caie

Understanding Big Data Analytics for Manufacturing Processes: Insights T


from Literature Review and Multiple Case Studies
Amine Belhadia, , Karim Zkikb, Anass Cherrafic, Sha'ri M. Yusofd, Said El fezazia

a
Cadi Ayyad University, Marrakech, Morocco
b
TICLab, International University of Rabat, Sale, Morocco
c
ENSAM-Meknes, Moulay Ismail University, Meknes, Morocco
d
University of Technology of Malaysia, Kuala Lumpur, Malaysia

ARTICLE INFO ABSTRACT

Keywords: Today, we are undoubtedly in the era of data. Big Data Analytics (BDA) is no longer a perspective for all level of
Big Data Analytics the organization. This is of special interest in the manufacturing process with their high capital intensity, time
Manufacturing process constraints and given the huge amount of data already captured. However, there is a paucity in past literature on
Big Data Analytics capabilities BDA to develop better understanding of the capabilities and strategic implications to extract value from BDA. In
Business intelligence
that vein, the central aim of this paper is to develop a novel model that summarizes the main capabilities of BDA
Literature review
Multiple case study
in the context of manufacturing process. This is carried out by relying on the findings of a review of the ongoing
research along with a multiple case studies within a leading phosphate derivatives manufacturer to point out the
capabilities of BDA in manufacturing processes and outline recommendations to advance research in the field.
The findings will help companies to understand the big data analytics capabilities and its potential implications
for their manufacturing processes and support them seeking to design more effective BDA-enabler infrastructure.

1. Introduction with the concept of Analytics which refers to the ability to acquire in-
formation from data by applying statistics, mathematics, econometrics,
The widespread emergence of digital technologies and advancing simulations, optimizations, or other techniques to support decision
computing power and expansion of the industrial Internet of Things making processes (Arunachalam, Kumar, & Kawalek, 2018; Wang,
(IoT) have led to a new generation of networked, information-based Gunasekaran, Ngai, & Papadopoulos, 2016).
technologies, data analytics, and predictive modeling (He & Wang, Particularly for manufacturing processes, the challenge of Big Data
2018). This new generation is providing unparalleled integrated com- Analytics (BDA) is even greater. Actually, the excessive use of process
puting capabilities to supply manufacturers with better wherewithal to operation, control computers and information systems makes the ex-
extract value from an increasingly huge amount of data and gain a isting manufacturing process operation databases huge and massive.
powerful competitive advantage (Chiang, Lu, & Castillo, 2017; He & Moreover, with the ever-increasing advancement of IoT devices from
Wang, 2018). conventional process sensors to images, videos and indirect measure-
According to the research report published in the earlier 2011 by ment technologies, it is expected that the data extracted from future
McKinsey Global Institute under the heading “Big Data: The next frontier smart manufacturing processes will expand drastically (Qin, 2014).
for innovation, competition, and productivity”, big data has permeated Therefore, it seems to be a consensus that only manufacturers able to
every part of life, and becomes a paramount pioneer for production in analyze their manufacturing processes based on this accelerating huge
the near future (Manyika et al., 2011). According to Manyika et al. mass of data will survive in the next stage of the transformation of
(2011), Big Data involves datasets whose size exceeds the ability of advanced manufacturing within the age of data as a decisive competi-
typical database software tools to capture, store, manage, and analyze. tive asset. Such manufacturers predict the best proceeding process flow,
Mainly, Big Data is characterized by the ability to handle data with four and proactively control their processes with this knowledge (Krumeich,
qualities: Volume (the size/scale of the data), Variety (the form/format Jacobi, Werth, & Loos, 2014).
of the data), Velocity (the rate of the data being produced), and Vera- Owing to the aforementioned considerations, Big Data Analytics in
city (the uncertainty/reliability of the data). Big Data is often associated manufacturing processes has received increasing attention because of


Corresponding author at: Route Dar Si-Aïssa BP. 89 Avenue Echahid Mbarek El Mokhtar, Safi, Morocco.
E-mail addresses: belhadi-9@hotmail.fr, Belhadi.amine@outlook.com (A. Belhadi).

https://doi.org/10.1016/j.cie.2019.106099
Received 28 October 2018; Received in revised form 3 July 2019; Accepted 27 September 2019
Available online 28 September 2019
0360-8352/ © 2019 Elsevier Ltd. All rights reserved.
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

its considerable impact on manufacturing processes. First, the wide use integrative definition of this compound concept.
of distributed control systems and the development of some typical
information and communication technologies (ICTs) have considerably 2.1. Defining Big Data
evolved the mode of production. Today’s manufacturing processes are
increasingly operating in an uncertain and complex environment with When attempting to define Big Data, size is the immediate attribute
tricky operations and overcomplicated constraints (Cheng, Chen, Sun, that intuitively comes to mind (Gandomi & Haider, 2015). However,
Zhang, & Tao, 2018). Therefore, it becomes more and more difficult to there are other important attributes of Big Data, namely data variety
build first-principle models in those complex processes and plenty of and data velocity. The three V’s (Volume, Variety and Velocity) con-
processes and practices satisfying the traditional mode of production stituted, erstwhile, the most commonly used framework to define Big
management are no longer appropriate (Cheng et al., 2018; Ge, Song, Data (Chen, Chiang, & Storey, 2012). Quite few definitions of ‘Big data’
Ding, & Huang, 2017). Second, the immense need of production man- are summarized in Wamba et al. (2015). Moreover, Beyer and Laney
agers for the real-time, dynamic, self-adaptive and accurate production (2012, p. 2) propose a largely convergent definition of BD from Gartner
management has brought new challenges to the traditional methods. It using the three V’s as:
becomes highly required to create manufacturing intelligence from
‘‘high-volume, -velocity and -variety information assets that demand
real-time data to provide precise prediction of product quality, pro-
cost-effective, innovative forms of information processing for enhanced
duction and processing time. This is done by new effective techniques,
insight and decision making”
within shorter computation time to control the continuous real-time
production systems and to identify faults, defects and some other ab- Over the time, some other attributes have been included to the basic
normal situations alongside supporting accurate and timely decision- framework such as Veracity, Variability, Volatility and Value (Tewari &
making (Cheng et al., 2018; He & Wang, 2018). Dwivedi, 2019). The Seven V’s can be defined as:
Notwithstanding the enthusiasm and growth of interest in Big Data
Analytics, little is known about their key capabilities for manufacturing • Volume: refers to the ever-growing magnitude and size of data
processes. Indeed, organizations willing to adopt Big Data in their generated. Big data sizes easily reach multiple terabytes even pe-
manufacturing processes are fighting to better understand its concept tabytes.
and then gain the business value from BDA (Wamba, Akter, Edwards, • Variety: represents the heterogeneity, diversity and unevenness of
Chopin, & Gnanzou, 2015). Moreover, quite few scholars highlight that data types in a dataset. Advanced ICTs in today’s companies gen-
BDA is still in its early stage and there are yet undiscovered directions erate various types of structured, semi-structured, and unstructured
to explore on BDA in manufacturing processes. To bridge the existing data of various types such as text, sensor data, audio, video, log files
knowledge gap in the literature, the present paper aims at relying on and so on.
previous studies on BDA in manufacturing processes along with an in- • Velocity: is the frequency of data generation and the high-speed at
depth multiple case studies within an international company using Big which it should be processed, analyzed and acted upon.
Data Analytics to improve its manufacturing process to achieve the • Veracity: introduced by IBM as a defining attribute of big data.
following research objectives: Veracity refers the degree of truthfulness and uncertainty related to
most sources of data. Big Data proposes the use of specific tools and
1. Clear up the concept of BDA in the context of manufacturing pro- analytics to deal with imprecise and unreliable data.
cesses, • Variability: pioneered by SAS as an additional attribute of big data.
2. Review, classify and summarize all relevant articles dealing with The Variability and complexity in the process of data generation
BDA in manufacturing processes drawing on a conceptual frame- implies a high need to connect, match, cleanse and transform data
work for classifying the literature, received from different sources.
3. Point out future research trends to enhance the capabilities of BDA • Volatility: refers to the capacity of storage and retention of data.
in the manufacturing process, With the huge volume and velocity of data, the issue of storage
4. Analyze in-depth the findings of a multiple case studies within a retention along with security of data becomes significant for big
leading company to provide recommendations to advance BDA data.
implementation in manufacturing processes. • Value: coined by Oracle as the seventh V. The Value of data gen-
erated is insignificant in its raw form compared to its huge volume.
The remainder of this paper is arranged as follows. Section 2 pro- This value can be significantly increased by processing and ana-
vides an integrative definition of the concept of Big Data Analytics. The lyzing large volumes of such data.
research scope along with the methodology pursued to conduct the
study are then introduced in Section 3, followed by Section 4 and In addition, a range of ‘big data’ definitions focusing on different
Section 5, which present and deeply discuss the findings and results of aspects of the concept exists in the literature. While some definitions
the literature review and the multiple case studies. Section 6 points out consider the ability perspective (e.g. Qin, 2014), Authors like
the implications for research and practice along with the contributions Arunachalam et al. (2018) and Wamba et al. (2015) proposed more
of the study. Finally, the paper is concluded by the summary, limita- holistic definitions that encompass the technologies of process, man-
tions and suggestions for future research agenda in Section 7. agement and analysis of data. In sum, a benchmark of a universal de-
finition of BD do not yet exists. From the perspective of process man-
2. The concept and definitions of Big Data and Analytics ufacturing, organizations need to think about ‘big data’ as a mean to
decode complex manufacturing systems by expanding advanced skills
Big Data Analytics is undoubtedly a major thoroughfare in the next and competencies that wield advanced IT tools to gather, store, orga-
round of information technology transformation in industry (Qin, 2014; nize, extract data from different ICTs in order to analyze them and
Wamba et al., 2015). Obviously, BDA involves two main concepts: Big generate useful information that support the decision making process to
Data and Data Analytics. Literature has widely discussed the benefits value creation and optimization.
and outcomes of BDA in the growth and profitability of today’s com-
panies (Gunasekaran, Yusuf, Adeleye, & Papadopoulos, 2018; Krumeich 2.2. Insights on Data Analytics
et al., 2014; Qin, 2014). However, the fast-paced evolution of the
concept of BDA has raised some confusion regarding its definition Big data is not an end in itself. Its potential value cannot be high-
(Wamba et al., 2015). Actually, there is no consensus on a clear and lighted unless when employed as a supportive tool of decision-making

2
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

Fig. 1. Classification of BD Analytic problems and techniques.

processes (Gandomi & Haider, 2015). To deal with this issue, quite few Predictive analytics: aim to provide glimpse and foresights into
techniques and processes of data mining and statistical analysis under the future. Based on historical and current data, predictive analytics
the heading “Data Analytics” have been designed by scholars and apply forecasting and statistical modelling to give insight into “what is
practitioners from the artificial intelligence, algorithm, and database likely to happen” in the future based on supervised, unsupervised, and
communities in order to extract from high volumes of scalable and semi-supervised learning models (Gandomi & Haider, 2015; Sivarajah
various data actionable grasp (Chen et al., 2012). When tackling a big et al., 2017). Cheng et al. (2018) talk about two categories of predictive
data project, a plethora of analytical techniques exists. According to analytics techniques. The first category is statistical analytics oriented
(Sivarajah, Kamal, Irani, & Weerakkody, 2017), BDA can enhance the techniques, which use mathematical models to induce and analyze
decision-making and increase organizational output by extracting sense existing data as well as infer and predict unknown information. These
from the data for different types of analytic problems namely, de- techniques include multinomial logit models (Sivarajah et al., 2017),
scriptive analytics, predictive analytics and prescriptive analytics as regression techniques (Gandomi & Haider, 2015), K-nearest neighbor
presented in Fig. 1. (KNN), Bayesian (Cheng et al., 2018). The second category is knowl-
Descriptive analytics: provide hindsight on the current state of a edge discovery KD-oriented techniques, which is data-driven that does
business situation using business intelligence tools through the gen- not require to indicate assumptions and problems in advance. This
eration of regular reports, ad hoc reports, and alerts (Joseph & Johnson, category mainly includes machine learning techniques such as Neural
2013; Sivarajah et al., 2017). Descriptive analytics are regarded back- Networks (NN), Multiple Backpropagation (MBP), Self-Organizing Map
ward looking and disclose ‘what happened’ or alert on what is going to (SOM) (Sivarajah et al., 2017), rough set, genetic algorithm (GA), as-
happen through a subset of techniques. Banerjee, Bandyopadhyay, and sociation rule, support vector machine (SVM), generalized sequential
Acharya (2013) highlight, in addition to usual reporting and score- pattern (GSP), etc. (Cheng et al., 2018).
boards, dashboard as a form of application when an organization sys- Prescriptive analytics: optimize the process models based on the
tematically produces numerous indicators or metrics based on data to output information of predictive analytic models (Sivarajah et al.,
monitor a process over the time. Further tools such as Advanced Data 2017). Furthermore, prescriptive analytics is concerned with the defi-
Visualization (ADV), data mining and advanced statistical analysis are nition of the set of decisions that should be done through the inter-
pointed out to leverage the descriptive analysis of processes. Other pretation of cause-effect relationship among analytic results and busi-
techniques emphasized to support descriptive analytics such as text, ness process policies (Banerjee et al., 2013). In spite of its difficulty,
video and other multimedia analytics (Gandomi & Haider, 2015). These some authors cited a set of techniques like Discrete Choice Modeling,
tools are necessary to feel the need of extracting facts from texts, un- Linear and Non-linear Programming and Value Analysis (Banerjee
structured audios and video streams, connecting them with the relevant et al., 2013; Sivarajah et al., 2017). Moreover, “what if” simulators
decision-making process and finally nurturing a data-driven decision provide insights about what may be the likely options that the business
process. should implement in order to optimize its process (Banerjee et al.,
Inquisitive analytics: vet ‘why something happened’. Inquisitive 2013).
analysis is generally nurtured by descriptive analysis output or addi-
tional data if needed to be gathered using descriptive analytics tech-
niques in order to disclose the root causes of a problem (Banerjee et al., 2.3. Towards an integrative definition of Big Data Analytics
2013). Generally speaking, inquisitive analytics techniques seek to re-
veal the potential or recessive rules, characteristics and relationships “Big data Analytics” is the buzzword of the day. Actually, BDA is a
(such as dependency, similarity, correlations, etc.) that exist in the data, new trend in operations management, which brings together a set of
such as generalization, association, sequence pattern mining, and techniques for handling huge volumes of data so as to identify trends,
clustering analytics, etc. (Cheng et al., 2018). Other techniques exist detect patterns and draw on precious insights. In the view of process
such as Modeling Statistics, Query Tools, Spreadsheets, OLAP Tools, manufacturing management, Fig. 2 depicts our point of view of BDA
Decision Trees (Chen et al., 2012). essence in the manufacturing process context.
According to our point of view, the manufacturing process generates

3
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

Fig. 2. Essence of Big Data Analytics in the context of Manufacturing Process.

data by mean of the emergence of advanced ICTs such as sensors, RFID, structured review and classification of literature using bibliometric
etc. This data is of high-volume, large variety, high velocity, un- network is conducted. This methodology is pursued owing to its ra-
ascertained veracity, wide variability, intense volatility and low value. tionality, transparency and reproducibility while gaining a meaningful
By using the high real-time processing ability of the theories and understanding through the analysis of extant literature and the com-
methods of big data, data undergoes several valuation steps, i.e. parison of influential work (Sivarajah et al., 2017). Moreover, Sivarajah
“Acquisition and Recording”, “Cleaning and Annotation” and et al. (2017) argued that this methodology helps to stipulate current
“Aggregation and Representation” to provide reliable and complete raw evidence around a technology or a treatment, such as to summarize the
data support on further data analysis. Afterwards, valuable information evidence of the benefits and drawbacks of an explicit map technique (in
and knowledge on four levels: descriptive, inquisitive, predictive and this example, BDA in manufacturing process). To that end, this study
prescriptive can be discovered from the big data using advanced ana- adopts the literature review methodology proposed by Rowley and
lytics techniques in order to redirect the piloting of the manufacturing Slack (2004). Recent papers likewise followed similar methodology in
process. extracting insights from past BDA research papers (e.g. Arunachalam
et al., 2018; Sivarajah et al., 2017; Wang et al., 2016). This review
3. Research scope and methodological approaches methodology encompasses, in addition to the step of material collec-
tion, three literature analysis steps: descriptive analysis, bibliometric
Papers addressing BDA in the context of manufacturing process analysis and issues analysis.
begin to make their appearance in the literature, (e.g. Ge et al., 2017;
Gunasekaran et al., 2018; He & Wang, 2018; Krumeich et al., 2014). 3.1.1. Literature search and material collection
However, these papers did not discuss in-depth BDA capabilities for Rowley and Slack (2004) proposed several tools to assist in the
manufacturing processes. In addition, studies and articles dealing with identification and location of documents. From these tools, we opt for
BDA in manufacturing process related issues do not provide an overall the search engines, which are believed to be efficient for locating web
understanding of its capabilities from many facets and dimensions and pages with simple keyword based searches. In order to pick keywords
limit themselves to a single aspect. This suggests that BDA in the con- for this study and ensure that the topic of the study was fully covered,
text of manufacturing process is still evolving and there are yet studies “Big Data”, “Analytics” and “Manufacturing process” was selected as
to explore in this area, hence, an intelligible understanding of the the main keywords. Other related keywords were also added such as
subject, its facets and classification is yet to be fully surrounded. “Operations Management”, “Process industry”, “Production process”,
The significant advancement of BDA not only uncovered a lack of “predictive analytics”. The keywords were used both separately and in
academic and theoretical research in the field but a distinct lack of combination (using Boolean operators “and”/“or”) for a comprehensive
managerial insights and applications in the real-life context of manu- search within the following databases: Science Direct, Emerald,
facturing process. This paucity of literature creates a state of un- Springer, Taylor & Francis, IEEE Xplore, Wiley Library, Inderscience,
certainty among organizations about the capabilities and potential Scopus (Elsevier) and Web of Sciences. The papers that used the
benefits that BDA could provide to their manufacturing processes. abovementioned keywords and/or their combination in the title, ab-
Therefore, this paper seeks to address this shortage in the literature by stract, and full text were identified.
using a two-phase methodological research approach shown in Fig. 3. The search started on June 12, 2018 and ended on June 15, 2019.
The first phase is intended to conduct an extensive and comprehensive Even though the early research was not limited with a time restriction,
literature review of journal articles to uncover the unexplored research the final list of prescreened articles appeared in the period of
trends, while the second phase is dedicated to conduct a profound 2004–2019. Prescreening has resulted in 304 papers, which were up-
analysis of a multiple case studies within a leading chemical company, loaded in Endnote, a reference management software package, for
which is contemporarily implementing BDA for the optimization of its further analysis. Then, a step of removing of duplicated reference was
manufacturing processes. The aim is to outline recommendations to fill carried out and followed by the elimination of out of scope and irre-
the gaps in the research trends identified. levant papers through a comprehensive reading of the abstract of re-
maining papers (Arunachalam et al., 2018; Wamba et al., 2015; Wang
3.1. Comprehensive review of literature on BDA in manufacturing process et al., 2016). Finally, the current review is restrained to only papers,
which clearly describe the application of BDA in manufacturing pro-
With a view to gain insight and provide more comprehensible un- cess. This constituted a final list of 62 papers covering the period from
derstanding on the capabilities of BDA on manufacturing process, a 2004 to 2019.

4
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

Fig. 3. Methodological approaches of the study.

3.1.2. Literature analysis activities of mining and exporting raw phosphate rock towards greater
The list of papers collected is analyzed in two steps. In the first step, production of phosphoric acid and finished fertilizer products. In the
descriptive and bibliometric analysis is complemented. Obviously, the next phase of the program, the company planned to ramp up its focus
content of the selected papers is reviewed and classified based on ca- on fertilizer production, especially for markets in Africa, where ferti-
tegories such as the distribution of publication year, research metho- lizer historically was not well used. Consequently, a huge opportunity
dology, among others (Belhadi, Sha’ri, Touriki, & Fezazi, 2018). The arises to feed and take advantage of the growing demand of fertilizer by
descriptive analysis process is performed using bibliometric analysis in proposing products suitable to the needs of African farmers. In order to
order to synthesize existing research, and thematic analysis for further cope with that increase, the digital revolution and digitalization at all
conceptualization of the content of literature. To this end, BibExcel levels comes across as a new challenge. Therefore, the different entities
software is used due to its flexibility and capability to manage big vo- of the company took advantage of the industrial program to integrate
lumes of data along with its compatibility with applications such as the digitization component into the different manufacturing processes
Excel, Pajek and Gephi. Afterwards, an issues analysis is used to identify (Fig. 4). The company is handling a variety of manufacturing processes
research trends and construct a theoretical framework of BDA cap- and technologies over two production sites. In the mining site, various
abilities in manufacturing process. The purpose is to come across manufacturing processes extract and prepare the phosphate rock, which
themes, topics and conceptual aspects of BDA capabilities relevant to is the main input of the chemical site responsible of the manufacturing
manufacturing process context. The findings from the descriptive ana- of phosphoric acid and various qualities of feeds (MCP/DCP) a long
lysis and issues analysis are given in Section 4. with a wide variety of NPK-based fertilizers. NAPC aims for a qualita-
tive leap with digital and big-data analytics to enable industrial teams
3.2. Multiple case studies: BDA implementation in North African to develop or even duplicate models of turnkey factories in Africa.
Phosphates Company
3.2.2. Strategy of data collection
In order to get practical insights of BDA in manufacturing process in As the author is directly involved in the project of digitalization of
real life context, in-depth data were retrieved from a number of context- manufacturing process of the NAPC, the case study follow the strategy
specific studies in the North African Phosphates Company (NAPC) (For of an engaged research project to collect and analyze data. In engaged
reasons of confidentiality, the name of the company cannot be dis- or action research, the researcher and the company’s staff are working
closed; NAPC is a pseudonym). The aim was to conduct three case together to resolve or to improve a given issue in the company, and to
studies and collect more in-depth and practical data from the company contribute to the body of knowledge (Hutter, Haeussler, & Missbauer,
to draw lessons for the effective use of BDA and the associated cap- 2018). Therefore, the strategy of data collection has based on direct and
abilities and competitive outcomes for the manufacturing process. In on-site observation along with interviews of a number of the key
our case, the case study as a research strategy is strongly recommended members of the project. Throughout the visits, the author started with
because of its perfect suitability for the building of theories during their observation, collection of available materials and semi structured in-
formative stage (Belhadi, Touriki, & fezazi, 2016; Gunasekaran et al., terviews along with a conducted tour around the various entities im-
2018). Moreover, the multiple case studies approach allows studying plementing BDA project. The main interviewer was one of the key en-
different facets of a phenomenon in real environment and particularly gineers directly involved in the project.
while the phenomenon and its conditions cannot be lightly explained
(Ketokivi & Choi, 2014). Therefore, the multiple-case studies approach 4. Findings and results from literature review
was selected.
4.1. Bibliometric analysis
3.2.1. General situation of the study area
NAPC is a leading manufacturer of phosphates and its derivatives As mentioned earlier, the software BibExcel is used to exploit data
located in North Africa. The 95% state-owned firm, charged with in RIS format, which includes all the required bibliographic information
managing the North African country’s vast reserves of phosphate, pro- for the papers. The focus is on information related to publication year,
duces a number of derivative products such as phosphoric acid, feeds authors, title, journal, keywords and affiliations.
and fertilizers. In 2017, NAPC was at the midpoint of a $20 billion
industrial transformation program with the aim of enhancing its in- 4.1.1. Distribution of papers per year of publication
dustrial capacity, upgrading cost effectiveness, and stimulating business As shown in Fig. 5, the distribution of papers per year indicates a
competitiveness. The program hinged on outreaching the traditional clear increasing trend regarding the number of publications dealing

5
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

Fig. 4. Flowchart of the overall manufacturing process of the company.

with BDA in manufacturing process. Actually, starting from 2014, a South Korea.
steady rise of the number of papers can be noticed, within a range from
seven articles in 2014 to 22 articles in the 2018. Moreover, most of the 4.1.3. Keywords statistics
publications were appeared in last four years (85% of total publica- In a similar vein, BibExcel was used to take out the keywords uti-
tions). The trend highlights that there is a significant increase in the lized in the articles and the frequency of usage was calculated for fur-
interest among researchers to investigate the topic of BDA in manu- ther analysis given in Table 1. The top used keywords are “Big Data”,
facturing process. “Big Data analytics”, “Data Mining”, “Manufacturing Processes” and
“Process control”. These keywords are the effective search keywords of
4.1.2. Affiliation statistics the study.
Using BibExcel, the affiliations of all first authors were taken out
from the RIS data file. Afterwards, the country in which the institution 4.1.4. Contributing authors and journals
of the first author is located was extracted for further analysis plotted in Table 2 illustrates the top ten contributing authors along with their
Fig. 6. Although the distribution of papers by first authors’ affiliations number of publications. An astounding four authors, i.e. Zhong RY,
shows that researchers worldwide (26 countries from the five con- Werth D, Krumeich J and Loos P tied for first place with three pub-
tinents) are interested in research in the topic of BDA in manufacturing lications for each. Followed by six other authors with two publications
process, institutions in USA and China dominated the top five list of for each.
contributing institutions. This list contains also Germany, Taiwan and Finally, Table 3 depicts the contribution of journals according to the

Fig. 5. Distribution of papers per year of publication.

6
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

Fig. 6. Distribution of papers by first authors’ affiliations.

number of published articles. It is noticeable that eight journals shared


Table 1
List of the most frequently used keywords.
about one-third of the reviewed articles. These journals are Computers
& Industrial Engineering (CAIE), Computers and Chemical Engineering
Keywords Frequency Keywords Frequency (CCE), International Journal of Production Research (IJPR), The In-
Big Data 26 Manufacturing Data 4 ternational Journal of Advanced Manufacturing Technology (TIJAMT),
Processing Industrial Management & Data Systems (IMDS), Journal of Process
Big Data Analytics 12 Manufacturing 4 Control (JPC), Journal of Cleaner Production (JCLP) and Journal of
Data Mining 8 Process Industry 4 Failure Analysis and Prevention (JFAP). The two other thirds of the
Manufacturing 8 Data Analytics 3
published articles are shared between 22 other journals and 18 inter-
Processes
Process Control 5 Industries 3 national conferences.
Internet of Things 5 Operations Management 3
Data Analysis 5 Optimization 3 4.2. Issues analysis
Machine Learning 5 Predictive Analytics 3
Energy Efficiency 4 Quality Control 3
Having carried out the descriptive analysis of selected papers
through bibliometric analysis, the analysis of issues is used to browse
BDA issues in manufacturing processes in order to conceptualize BDA
Table 2 capabilities. This is a philosophical conceptualization, which is con-
Top ten contributing authors. sidered as a first step towards theory building (Seuring & Müller, 2008).
Authors Number of publications Following the guidance of several authors (e.g. Arunachalam et al.,
2018; Seuring & Müller, 2008), a down top approach is used here re-
Zhong RY 3
laying on reading the articles repeatedly. The deeply reading and re-
Werth D 3
Krumeich J 3 reading of the papers allowed to develop a conceptual framework to
Loos P 3 pave the way and circumscribe BDA capabilities in manufacturing
Akter S 2 processes (Fig. 7).
Chien CF 2 There are three layers toward conceptualizing BDA capabilities for
Chongwatpol J 2
Gunasekaran A 2
manufacturing process. The first layer of BDA capabilities is the man-
Hammer M 2 ufacturing process challenges, which refer to the components of the
Liu Y 2 manufacturing process with the greatest potential of value to catch.
These components include, in addition to Quality and Process Control
(Q&PC), other high-impact components such as Energy & Environment
Efficiency (E&EE), Proactive diagnosis and Maintenance (PD&M) and
Table 3 Safety and Risk Analysis (S&RA). All these components generate a huge
Distribution of the reviewed paper in various journals and conferences. stream of data and thrive on a number of BDA faculties allowing to
transform data to actions. These faculties include Data Warehousing
Journal Number of published %
articles (DW), Data Aggregation and Integration (DA&I), Data Analytics and
Modeling (DA&M) and, Data-Driven Culture (DDC). In the third layer
Computers & Industrial Engineering 7 10.30% concerns the values that BDA could provide to manufacturing process.
Computers and Chemical Engineering 5 7.35%
Indeed, BDA embodies quite few values such as enhancing transpar-
International Journal of Production Research 5 7.35%
The International Journal of Advanced 3 4.41% ency, improving performance, supporting decision making and enhan-
Manufacturing Technology cing knowledge. Further description of the different layers of the fra-
Industrial Management & Data Systems 2 2.94% mework is given in the following sections.
Journal of Process Control 2 2.94%
Journal of Cleaner Production 2 2.94%
Journal of Failure Analysis and Prevention 2 2.94% 4.2.1. Manufacturing process challenges
Others (one reference of each journal) 22 32.36% Table 4 provides the distribution of papers by the challenges of
International conferences 18 26.47% manufacturing process addressed by BDA. Above all, it is noticeable
Total 68 100.00%
that numerous papers dealt with more than one manufacturing process
challenge addressed by BDA. Afterwards, the overwhelming majority of

7
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

Fig. 7. Overall framework of BDA capabilities in manufacturing process.

the papers are about Q&PC (51 papers). This is followed by E&EE, PD& Stanley, 2018; Weese et al., 2016). This is to spot correlations and
M and S&RA with respectively 18, 9 and 9 papers. causal relations between process variables to capture material and in-
Quality and Process Control (Q&PC): In the near future, organi- formation flow paths in the process (Hu et al., 2018; Lee, Shin, & Realff,
zations that are able to monitor their operations through the fast-paced 2018).
increasing amount of data to forecast their quality fault and proactively Energy and Environment Efficiency (E&EE): Bowed to the over-
control their processes by means of advanced analytics will be in ad- whelming pressure of limited natural resources and growing serious
vance of their competitors (Krumeich, Werth, & Loos, 2016). In doing environmental issues (Belhadi, Touriki, & fezazi, 2018), manufacturing
so, He and Wang (2018) talked about a new generation of Statistical processes are placing energy saving and emission reduction as two
Process Control (SPC) dealing with complex and multivariable with important challenges must be addressed by BDA (Zhang, Ma, Yang, Lv,
which MSPM methods may fail or lead to misleading results (multi- & Liu, 2018). For instance, advanced analytical tool can be applied to
modal distribution, dynamics, nonlinear relationships between vari- optimize the factors that are believed to have the biggest impact on
ables, non-Gaussianity, time-varying characteristics, other character- environmental performance. Zhang et al. (2018) proved this fact by
istics such as outliers, gross errors and/or failed sensors). This new proposing a big data driven analytical framework based on two tech-
generation of SPC implies the use of more and more Model Predictive nologies, i.e., energy big data acquisition and energy big data mining,
Control (MPC) (Chongwatpol, 2015; Isaksson, Harjunkoski, & Sand, which are utilized to reduce the energy consumption and emission for
2018; Krumeich et al., 2016; Li & Kashiwagi, 2005; Zhuchkov, 2015) energy-intensive manufacturing industries. Advanced technologies im-
through the integration of advanced BDA techniques such as Control plemented along with sophisticated operating conditions make the
Chart Pattern Recognition (CCPR), Regression-Based Methods, Neural understanding of the energy consumption attitude very hard since very
Networks, Support Vector Methods (Lee, Yoon, & Kim, 2017; Tewari & complicated, nonlinear dynamic variables are included (Chongwatpol,
Dwivedi, 2019; Weese, Martinez, Megahed, & Jones-Farmer, 2016). 2016). Therefore, BDA is vital to better understand and control the
Another aspect of QP&C is Alarm management mentioned by Hu, Shah, operational parameters, optimize energy management, and reduce en-
and Chen (2018) which relay on a set of BDA techniques such as Run vironmental impact (Chiang et al., 2017). In doing so, BDA has been
Length Distribution & Delay Timer Analysis (RLD&DTA), Chattering widely applied in manufacturing processes such as predictive energy
Index (CI), Oscillating Alarm Analysis (OAA), Alarm Flood Analysis consumption models (Eckstein et al., 2016; Grossmann, 2014; Shao, Li,
(AFA), Causality Inference for Alarms (CIA)) and Mode-Dependent Gu, Zhang, & Luo, 2018; Shin, Woo, & Rachuri, 2014), smart grid
Alarm Analysis (MDAA). The aim is to predict the occurrence of ab- management (Chiang et al., 2017) and building energy management
normal situations and then prevent their propagation along the inter- (Favoino, Fiorito, Cannavale, Ranzi, & Overend, 2016; Hammer, 2018).
connected pathways to cause significant and catastrophic disruptions in Proactive diagnosis and Maintenance (PD&M): Besides its use in
the process. In another side, and either when little is known about the manufacturing process for quality and environment issues, BDA is also
process or information is unavailable as to what forms an out of-control being incorporated in another critical challenge facing the manu-
event, machine learning techniques such as Big Data Approximating facturing process namely, proactive diagnosis and maintenance of
Control (BDAC), PCA/PLS, k-means clustering, Self-organizing map equipment. One of the early known industry applications of PD&M is
(SOM) and manifold learning methods are used (Ge et al., 2017; the Condition-based Maintenance (CBM) (Krumeich et al., 2016), which

8
A. Belhadi, et al.

Table 4
Challenges of BDA related to manufacturing process.
Challenges Key elements References Number %

Quality and Process Control (Q&PC), • Statistical Process control Li and Kashiwagi (2005), M’Sahli and Matlaya (2005), López-Escobar, González-Palma, Almorza, Mayorga, and 52 59.09%
• Alarm management Carnero (2012), Krumeich et al. (2014), Grossmann (2014), Qin (2014), Yang, Park, Cho, Song, and Kim (2014),
• Connectivity and causality analysis Chongwatpol (2015), MacGregor, Bruwer, Miletic, Cardin, and Liu (2015), O’Donovan, Leahy, Bruton, and
• Unsupervised Learning Approaches to Process Monitoring O’Sullivan (2015), Stojanovic, Dinic, and Stojanovic (2015), Severson, Chaiwatanodom, and Braatz (2015),
Zhuchkov (2015), Eckstein et al. (2016), Krumeich et al. (2016), Kumar et al. (2016), Li (2016), Smirnov and
Nasonov (2016), Qiu et al. (2016), Wang et al. (2016), Weese et al. (2016), Chen et al. (2016), Beneventi et al.
(2017), Chen (2017), Chiang et al. (2017), Chien, Liu, and Chuang (2017), Choi et al. (2017), Ge et al. (2017),
Hammer, Somers, Karre, and Ramsauer (2017), Ji and Wanga (2017), Lee et al. (2017), Lindström, Larsson,
Jonsson, and Lejon (2017), Wamba et al. (2017), Zhang, Ren, et al. (2017), Zhang, Li, et al. (2017), Zhong, Xu,
Chen, and Huang (2017), Amini and Chang (2018), Gunasekaran et al. (2018), Hammer (2018), He and Wang

9
(2018), Hu et al. (2018), Isaksson et al. (2018), Khakifirooz, Chien, and Chen (2018), Kho, Lee, and Zhong
(2018), Kozjek et al. (2018), Lee et al. (2018), Mehta, Butkewitsch-Choze, and Seaman (2018), Ning and You
(2018), Sadati et al. (2018), Stanley (2018), Wamba, Akter, and Bourmont (2018), Wang et al. (2018), Tewari
and Dwivedi (2019)
Energy and Environment Efficiency (E& • Energy big data acquision and mining Grossmann (2014), Shin et al. (2014), Shrouf, Ordieres, and Miragliotta (2014), O’Donovan et al. (2015), 18 20.45%
EE) • Energy consumption patterns Chongwatpol (2016), Favoino et al. (2016), Li (2016), Papacharalampopoulos et al. (2016), Paul, Zhong, and
• Smart Grids Bose (2016), Beneventi et al. (2017), Chiang et al. (2017), Ge et al. (2017), Zhang, Ren, et al. (2017), Zhang, Li,
et al. (2017), Hammer (2018), Hidalgo et al. (2018), Lee et al. (2018), Shao et al. (2018), Zhang et al. (2018),
Shukla and Mattar (2019)
Proactive diagnosis and Maintenance (PD • Condition based Maintenance Qin (2014), O’Donovan et al. (2015), Krumeich et al. (2016), Beneventi et al. (2017), Lindström et al. (2017), 9 10.23%
&M) • Online/Real Time predictive maintenance Sun, Jämsä-Jounela, Todorov, Olivier, and Craig (2017), Zhang, Ren, et al. (2017), Zhang, Li, et al. (2017),
• Tele-Maintenance Hammer (2018), Isaksson et al. (2018)
Safety and Risk Analysis (S&RA) • Bayesian based HAZOP O’Donovan et al. (2015), Li (2016), Chiremsel et al. (2016), Khakzad and Reniers (2016), Choi et al. (2017), 9 10.23%
• Advanced inherently safer design Zerrouki and Smadi (2017), Hammer (2018), Hu et al. (2018), Wang, Cheng, et al. (2018), Wang, Liu, et al.
• Safety instrumented systems (2018)
Total 88 100%

Note: Some articles are included more than once since they discuss more than one manufacturing process challenge.
Computers & Industrial Engineering 137 (2019) 106099
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

Table 5
Classification of Data Mining and Analytics techniques.
Categories Offline techniques Online techniques References

Descriptive analytics • Fine-Kinney • Visualization & Data Modeling Wang, Cheng, et al. (2018), Wang, Liu, et al. (2018), Mehta et al.
techniques • continuous quality control • Aggregation & (2018), Lindström et al. (2017), Kozjek et al. (2018), Kho et al.
• Data Visualisation Contextualization (2018), Zhang et al. (2018), Chongwatpol (2016), Krumeich et al.
• Big data perception & acquisition • Event Bus (2014), Yang et al. (2014), Tewari and Dwivedi (2019)
• data mining
• In-memory management & connectivity
• API Management
Inquisitive analytics • Random Forest (RF) • Multivariate latent variable Chien et al. (2017), Chiremsel et al. (2016), Kozjek et al. (2018),
techniques • Excursions diagnosis (PCA/PLS) Kho et al. (2018), MacGregor et al. (2015), Hidalgo et al. (2018),
• Fault Tree Analysis • self-adaptive stream processing Amini and Chang (2018), He and Wang (2018); Tewari and
• heuristic algorithms for scheduling tasks: • Data classification Dwivedi (2019), Hu et al. (2018), Kumar et al. (2016), Sadati et al.
• First-in-first-out (FIFO) (2018), Shukla and Mattar (2019)
• Earliest-planned-time (EPT)
• forecasting
• K-means Clustering
• Gradient Descent Optimization
• Process Monitoring
• Statistical process monitoring
• Sampling
• Hadoop MapReduce programming
• data-driven modeling
• meta-heuristic optimization
• Characterization & Classification
Predictive analytics • Bayesian Network • Classifiers for pattern Chiremsel et al. (2016), Chen et al. (2016), Eckstein et al. (2016),
techniques • linear and a nonlinear Predictive Control recognitions Lindström et al. (2017), Khakzad and Reniers (2016),
• control chart and defect prediction • Support Vector Machine (SVM) Chongwatpol (2016), Ji and Wanga (2017), Krumeich et al.
• Process Analytics • online predictive maintenance (2014), Tewari and Dwivedi (2019), Kumar et al. (2016), M’Sahli
• Process Prediction • fault prediction and Matlaya (2005), Shin et al. (2014), Zhuchkov (2015), Weese
• Predictive control • Streaming Analytics et al. (2016), Khakifirooz et al. (2018), Ning and You (2018),
• Statistical Learning Methods • pattern matching and Stanley (2018)
• Regression analysis approximation
• Cluster analysis
• neural network
• clustering-based on prediction scores
• Kernel Smoothing Methods
• principal component analysis
• Quantitative modeling
Prescriptive Analytics • prognostic analysis Wang, Cheng, et al. (2018), Wang, Liu, et al. (2018), Chongwatpol
techniques • Proactive Alerts (2016), Krumeich et al. (2014)
• Bayesian belief network
• time-series model

integrates present process states and events to estimate when equip- demonstrated the use of Bayesian Networks (BN) in HAZOP analysis. In
ment requires maintenance to minimize unplanned shutdown based on the same vein, Khakzad and Reniers (2016) applied Advanced Analytics
the usage, age, and performance of the equipment (Chiang et al., 2017). to risk-based design and decision making in chemical plants to employ
According to Krumeich et al. (2016), big data acquisition and sensor the principles of Inherently Safer Design (ISD) and Land-Use Planning
technology allow to locate and cluster equipment defects in order to (LUP). In addition, Chiremsel, Said, and Chiremsel (2016) used Ad-
easily identify, diagnose and solve equipment problems before a failure vanced Analytics to diagnose the Safety Instrumented Systems (SISs) in
actually occurs. Other aspects of PD&M have been widely discussed in order to prevent the occurrence of hazardous events and to alleviate
the literature. For instance, Beneventi, Bartolini, Cavazzoni, and Benini their aftermath to workers, equipment, and environment. Hu et al.
(2017), Zhang, Ren, Liu, and Si (2017), and Zhang, Li, Li, Yang, and Zou (2018) and Li (2016) underlined, in addition to HAZOP-based analytics,
(2017) discussed the Online/Real Time Predictive Maintenance re- the diagnosis of abnormal working conditions and decisions on emer-
cognizable by the integration of equipment behavior patterns by gency treatment of major accidents.
finding thresholds and relations between parameters that can be used to
indicate potential problems occurring (i.e., diagnostics) and main- 4.2.2. Big data analytics faculties
tenance requirement (i.e., prognostics). Finally, BDA is also applied for Undoubtedly, the application of BDA holds enormous potential of
remote or tele-maintenance of equipment along with some other improvements for the different challenges that manufacturing processes
maintenance-related functions such as management of spares in- are facing. The driving force behind this huge potential is represented
ventories and consumption (Hammer, 2018). by quite few faculties of BDA that can benefit the manufacturing pro-
Safety and Risk Analysis (S&RA): Owing to the advance and the cess.
sophistication of the current manufacturing processes, risk and safety Data Warehousing and Management (DW&M): In context of a
analysis is today more challenging and time-consuming (Zerrouki & manufacturing process, data is much dispersed and can be extracted
Smadi, 2017). Undoubtedly, Safety and Risk Analysis in manufacturing from extremely varied sources. For that, BDA offers the faculty of col-
process would benefit by proper application of BDA (Choi, Wallace, & lecting, integrating, transforming and storing data from disparate data
Wang, 2017). As stated by Hammer (2018), application of advanced sources where traditional database systems are ineffective (Hu et al.,
Analytics along with rendering and acting on the given insights lead to 2018). DW&M is a prerequisite therewith successful gain of data mining
set up strong safety requirements and thus protect the physical safety of can be insured. This would lead to improved data quality enabling the
the workers, users and the environment. Zerrouki and Smadi (2017) enhancement of the accuracy and performance of the subsequent

10
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

mining process (Cheng et al., 2018). Hidalgo, Rosas, Vasquez, and 4.2.3. Big data analytics values
Wladdimiro (2018) talked about self-adaptive stream processing sys- When applying BDA in manufacturing process, huge amounts of
tems dealing with the high frequency data streams featuring time- collected data can be transformed into several values and benefits for
varying characteristics that challenge the traditional stream processing the different components of the manufacturing process. The objective is
systems capacities. Krumeich et al. (2016) outlined the ability to pre- to leverage the knowledge and that to increase the value of information
process real-time data via the in-memory data management platform so (Lee et al., 2018).
that structured information can be extracted from unstructured multi- Enhancing transparency: The first key value realized from BDA is
media data. Zhang, Ren, et al. (2017), and Zhang, Li, et al. (2017) the improvement of intra-and inter-organizational transparency and
highlighted the efficiency of the business-to-manufacturing markup accountability. As stated by Chongwatpol (2015), process, which used
language (B2MML) and extensible markup language (XML). to tap big data, will receive real-time information from sensors, RFID
Data Mining and Analytics (DM&A): Data mining algorithms or and other devices in true transparency and without external inter-
methods are applied to extract useful insights and knowledge from huge ference.
amount of data (Cheng et al., 2018). Embedding big data mining result Improving performance: It is unanimously acknowledged that the
into the Process Monitoring System (PMS) and Decision Support System application of BDA leads to a significant improvement in operational
(DSS) creates a closed-loop system of feedback gathering and timely performance in all the challenges of the process (quality, energy,
adjustment in order to achieve process optimization on all manu- emission, safety …). Case studies indicate a meaningful improvement in
facturing process challenges (Zhang, Ren, et al., 2017). Actually, DM&A performance (e.g. Chiang et al., 2017; Chongwatpol, 2016, 2015;
enables decision-making systems to thrive on self-learning, cognitive Isaksson et al., 2018).
faculties to address real-time data and complex interrelationships. Supporting decision-making: According to Isaksson et al. (2018),
Hammer (2018) and Qiu, He, and Ji (2016) considered DM&A as a vital BDA generates new insights and knowledge-based data disposable for
nexus between operational technology and information technology to consultancy by the manufacturing process in order to support more
form advanced process control systems (APC) with advanced analytics. decentralized decision-making. Zhang, Ren, et al. (2017) discussed the
Chiang et al. (2017) reported the successful use of Enterprise Manu- ability of BDA techniques such as Bayesian, and decision trees of en-
facturing Intelligence (EMI) in the case of chemical sector. abling the decision makers to make flexible decisions in the presence of
Techniques of DM&A can be divided according to the time until attributes such as flexibility, quality, innovativeness, pro-activity and
decision in two categories, viz. real-time techniques and offline tech- cost. Lee et al. (2018) explored the use of recent advances in deep
niques. Table 5 and Fig. 8 depict the classification of offline/online DM learning and reinforcement learning to elaborate decision policy.
&A techniques according to the four stages of the BDA namely de- Enhancing Knowledge: Besides technical values, BDA is confirmed
scriptive, inquisitive, predictive and prescriptive analytics. to cover intelligence organization, including work and staff skills (Li,
As shown in Fig. 8a, apart from a quite few studies dealing with 2016). As shown by a case study presented by Li (2016), BDA allows to
prescriptive analytics techniques (7% of studies), most research works perform job skills training and to study via technological applications
are primarily concerned with the study and implementation of de- such as virtual reality technology, simulation platform, and 3D inter-
scriptive, inquisitive and predictive analytics techniques (93% of stu- action and demonstration. Choi et al. (2017) argued that BDA helps to
dies). On the other hand, according to Fig. 8b, offline techniques are understand phenomena, which cannot be theoretically explained as
used much more than online techniques (38 offline techniques were well.
levied against 12 online techniques), especially in papers offering the
analysis of some real case studies such as Kozjek, Vrabič, Rihtaršič, and 4.3. Critical analysis and research trends
PeterButala (2018), Shukla and Mattar (2019), and Tewari and Dwivedi
(2019). The findings of our literature review stressed out how manu-
Data-Driven Culture (DDC): Data-driven culture is an inviolable facturing process could use BDA capabilities, matter to extract value
faculty that BDA brings to the way of monitoring and optimization of from huge data sets as big data through a three-layer framework.
manufacturing process. DDC is a thought pattern that summarizes a set Consequently, BDA capabilities have confirmed essential to make pro-
of convictions, mindsets, attitudes, and ways towards process optimi- cess engineers and managers obtain drastically more information than
zation. In traditional processes, the vast troves of data collected is ty- before, to what concerns different challenges in their manufacturing
pically used only for monitoring purposes and not as a basis for im- process. However, a set of trends can be identified from the literature
proving operations (Sadati, Chinnam, & Nezhad, 2018). However, the review conducted. In reality, these trends constitute research gaps that
application of BDA participates effectively in culture change towards are still not fully addressed and thus they need much focus.
fact-based decision making through embedded analytics. Hammer
(2018) argued that changes in decision-making culture gained by BDA 4.3.1. Research trend 1: BDA-enabler architecture
could meaningfully improve process performance. In manufacturing process, data are streamed from multiple,

Fig. 8. Analysis of used Data Mining and Analytics techniques. a. distribution of DM&A techniques by categories. b. online vs offline techniques numbers.

11
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

heterogeneous and dispersed sources (structural and unstructured). industry has led to an increasing demand for real-time systems, which
Therefore, the implementation of data mining and BDA requires a smart need more focus in order to contribute to further advancements of BDA
architecture based on data management and storage strategies, gov- research in manufacturing process.
ernance and risk management (Chen, 2017) in order to deal with large
volume of data from different sources. Indeed, volume, variety, velocity 4.3.3. Research trend 3: Integrated human-data intelligence
and the other seven V’s that characterize big data mean that using BDA Human capital has been considered as a ‘must have’ for developing
needs high-performance resources such as storage space and processing BDA capabilities in manufacturing process. In a company, human ca-
modules (Chiang et al., 2017; Isaksson et al., 2018). According to our pital encompasses all the experience, knowledge, judgment, risk-taking
literature review, authors emphasizes this issue through two major propensity, and wisdom of people related to the manufacturing process
approaches. First, some proposed the establishment of internal data (Hammer, 2018). Building upon our literature review, several studies
warehousing which enables to store and manages collected data argue that human intelligence alongside computational intelligence will
(Beneventi et al., 2017; Choi et al., 2017; Hutter et al., 2018). This experience significant evolution in the near future, reaching an ad-
solution allows companies to have full control over their data and vanced stage (Chiang et al., 2017; Li, 2016). On the other hand, an
provide more security but it requires huge investment costs in terms of underpinning issue is also arisen concerning the role of the human
implementation, engineering and maintenance. Second, some other operator in the era of BDA and with more and more automated man-
works proposed the use of solutions based on outsourcing data such as ufacturing process (Isaksson et al., 2018). Let us say that the machine
Cloud Manufacturing (CM) (Kumar, Shankar, Choudhary, & Thakur, still cannot replace human intelligence and the operator is still needed
2016; Papacharalampopoulos, Stavridis, Stavropoulos, & Chryssolouris, in the manufacturing process. Therefore, it is of utmost importance to
2016; Qiu et al., 2016). This solution reduces costs but poses a sig- consider how the operator interacts with model-based control and op-
nificant risk on data security. timization in order not to progressively weaken the operators’ skills
Although there is an increasing trend towards the implementation (Isaksson et al., 2018; Shukla & Mattar, 2019). Authors such as Li
of BDA-enabler architecture, this is currently an underexplored area (2016) discussed automated learning systems that are important to
due to the several issues that arises such as data security and privacy combine human and machine intelligence and must be effectively in-
(Hammer, 2018), crisis and risk management (Zerrouki & Smadi, tegrated with human learning and decision environment.
2017), implementation costs (Shukla & Mattar, 2019), etc. It is clearly observed that the strive for a data-driven system with
humans and technical components working synergistically together in a
4.3.2. Research trend 2: Real-time data mining approaches socio- technical unit, providing creative value-added by working easily
The maximum potential from BDA can be derived for optimized together is a trend in the literature (Hammer, 2018; Shukla & Mattar,
decision making while controlling the manufacturing process ahead of 2019).
time (Hammer, 2018; Hu et al., 2018). One of the most important issue
believed to be faced by the manufacturing process while using BDA is 4.3.4. Research trend 4: Prescriptive analytics within the manufacturing
the time interval until the decision (Chiang et al., 2017; He & Wang, process
2018) because of the inevitably loss of process performance between Prescriptive analytics is the most advanced stage of data analytics
the treatment of offline data and the proactive decision (Choi et al., and can provide the greatest intelligence and value to the manu-
2017). The time scale of this interval may vary depending on the facturing process (Hammer, 2018). The application of prescriptive
computational environment, e.g., real-time vs. offline, and the appli- analytics has been increasingly sparking research concern within the
cation domain. Thus, the ability to gather and process instantaneously manufacturing process in recent times (Wang, Cheng, & Deng, 2018;
online data is of paramount importance. Wang, Liu, & Qin, 2018). For several researchers, prescriptive analytics,
Our literature review shows an apparent trend towards BDA models also called operational analytics, is increasingly sought because it
able to be run in a real-time, streaming computational environment in constitutes the next round towards enhancing data analytics maturity
the context of manufacturing process (Ji & Wanga, 2017; Kumar et al., and triggers optimized decision making without human intervention
2016; Lee et al., 2018; Li, 2016). In recent years, real-time approaches ahead of time (Chongwatpol, 2016; Hammer, 2018; Wang, Cheng,
have been investigated extensively for descriptive analytics (Hidalgo et al., 2018). In fact, with the temporal nature of the manufacturing
et al., 2018; Tewari & Dwivedi, 2019) and inquisitive analytics (He & process, gaining business value from the large amount of data generated
Wang, 2018; Ji & Wanga, 2017). On the other hand, and unlike de- requires action expeditiously on real-time events before the value fades.
scriptive and inquisitive analytics, which have gone forward towards This requires more than a prediction; it needs determining accurately
these trends to be appropriate to use in data-intensive applications, what to do and when to do it. Therefore, prescriptive analytics is ex-
predictive and prescriptive analytics are still at their infancy. Apart pected to get a lot more common and pervasive among a large array of
from some applications of online and real-time Predictive Maintenance practitioners.
(e.g. Beneventi et al., 2017; Zhang, Ren, et al., 2017), studies on pre- In spite of this, our literature review clearly shows that prescriptive
dictive and prescriptive analytics are still conceptual (Cheng et al., analytics within the manufacturing process is still in an early stage
2018; Chiang et al., 2017; Hammer, 2018). compared to descriptive, inquisitive or even predictive analytics. With
When investigating the reasons behind this lateness, multiple issues the exception of some few initiatives (e.g. Chongwatpol, 2016;
arise such as the need to enhance the speed of requests processing and Krumeich et al., 2014; Wang, Cheng, et al., 2018), the potential of
online model updating (He & Wang, 2018). For instance, Amini and prescriptive analytics is far from being well exploited. Therefore, fur-
Chang (2018) admitted that the shortage in their computational en- ther research is required towards the direction of combining the out-
vironment would prevent real-time monitoring from taking place. comes of predictive analytics with big data and advanced algorithms in
Chiang et al. (2017) argued that in spite of the significant strive to ramp order to advance the next generation of manufacturing process based
up storage capacities, there is an ongoing need to swiftly store and on analytics systems. This will lead to not only define risks and po-
process data in order to achieve optimal decision in real-time. On the tential abnormalities while addressing the manufacturing process
other hand, several authors have mentioned the cultural issue within challenges, but also suggest actionable levers, effectively providing
process industries. Chiang et al. (2017) argued that the process industry accurate, reliable, real-time decision support to process managers.
is in nature slow to respond to real-time customer feedback. Therefore,
real-time applications harbor many difficulties to be included in the 5. Insights and learnings from multiple-case studies
mindset of process managers.
Overall, the emergence of IoT and sensor-driven data in the process The goal of the case studies was to further explore the

12
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

interdependencies of core BDA capabilities in a real-life context of viz. people, organization and infrastructure.
manufacturing process and uncover emerging themes. People: A key activity at this level was to involve the workforce in
developing the vision. Accordingly, a number of workshops have been
5.1. Description of the case studies organized to ensure that all employees have the same level of under-
standing of the overall project alongside BDA capabilities. Then, they
We discuss three projects of BDA implementation at different plants were invited to propose the issues to be covered by BDA implementa-
subsidiaries to NAPC. The projects are part of a larger project of digital tion. The technique of change agent was used to ensure broad com-
transformation of the company’s operations in which the top manage- munication and involvement of all employees. Afterwards, the staff has
ment shows clearly its commitment. We selected these cases because undergone multiple training sessions on BDA modules and digital in-
they are well documented, they involved BDA capabilities, they came terfaces in order to educate people to develop the ability to exploit
up with implications for our research trends and they are judged as connected data systems. Another key at this level was the reinforcement
successful projects. of the use of insight analysis and data interpretation to streamline op-
erational processes by creating routines for using historical and real-
5.1.1. Case 1: Implementing BDA in a fertilizers plant time data analytics in the work process.
In April 2018, a fertilizers plant was chosen to be a pilot area of the Organization: on the organization side, the focus was on introdu-
project of BDA implementation. For that, the management hired a cing efficient and flexible structure to the implementation stage by
highly qualified staff of IT engineers and data scientists to bring support integrating a Stage-Gate project model with agile principles. In fact, a
to process and maintenance engineers during the implementation pro- multidisciplinary team was formed including people with an under-
gram. The first prerequisite is the implementation of a real-time system standing of both the manufacturing processes and the digital archi-
through the creation of a Data Hub in which multiple sources of data tecture. This bridged programming and manufacturing and enabled the
from different stages of the manufacturing process (structured and organization to access the full potential of the data stream. In parallel,
unstructured) across various departments (production, maintenance, the roles and responsibilities inherent in data management were de-
laboratories, …) are integrated. The aim was to combine these data fined by a “Data governance guidelines”. Bringing in the right people
with historical information for better process monitoring. In doing that, and defining their roles ensured the reliability of data management
a range of IT capabilities endowed the manufacturing process of the system and amplified the organization’s digitalization potential.
company. For instance, Hadoop Data Lake, which tied also into HDFS Infrastructure: in order to create an enabled architecture for BDA
(Hadoop Distributed File System), was acquired in order to ensure DW& implementation, the first keystone was connecting existing applications
M faculty. Actually, the Hadoop Data-Lake constitutes a solid infra- across data flow to create a common platform. Afterwards, a system of
structure for real-time data management besides several faculties such real-time performance analysis was implemented based on cloud-based
as data gathering, historicizing, recovering, analyzing, presenting, and connected services provided by PI system. The system was based on a
visualizing. Several difficulties motivated the use of Hadoop Data Lake, real-time data infrastructure that collects sensor-based data from many
namely: (1) The OPC server is deployed in windows machine, which different sources, stores them in a secure, central location to facilitate
present security risk, (2) Due to lack of Lora network, the context data their real-time visualization, analyze and share. The systems enabled
of smart sensors are not exploited, and (3) The storage capacity of DSC automated analysis of operational information and gave warning sig-
historical data is limited to one year. The use of Hadoop Data-Lake nals for environment, production and maintenance departments. Alerts
enables to bring together a vast variety of data from different platforms such as emissions, up/downtime, MTBF, MTTR and failure rates of
linked to the manufacturing process such as: equipment and production tools were shared on-line to be analyzed and
consequential proactive maintenance boosted and operated accordingly
• PI system: used as self-service monitoring capability of data Plant by to attain a more sustainable plant, both in economical and eco-friendly
Process Engineer, aspects as the lifetime is prolonged and durability enhanced.
• OPM Treatment: used as a tool of performance monitoring,
• MyOPS platform: already deployed as a mandatory tool for the HSE 5.1.3. Case 3: Implementation of an intelligent and self-controlled
processes & the maintenance reliability processes, production unit
• Oracle EAM: addresses the comprehensive and routine asset main- The project was concerned the implementation of an intelligent and
tenance requirements of asset intensive organizations, self-controlled unit of the concentration of phosphoric acid. In that
• LIMS: allows to effectively managing the flow of samples and as- vein, the unit needed to have four characteristics: (1) connectedness,
sociated data to improve lab efficiency through standardizing (2) context awareness, (3) intelligence and (4) metered services.
workflows, tests and procedures, while providing accurate controls Therefore, the overall architecture of the system was based on cloud-
of the process. based manufacturing equipment using CPS and BDA.
• Connect and AlMaarifa: two platforms, by applying the principle of The first step was to upgrade manufacturing equipment to cloud-
Internet of People (IoP), constitute an information space where based manufacturing equipment. For that, several modules were in-
people are interlinked and their knowledge and expertise are shared. tegrated into the manufacturing devices in order to improve the cog-
nitive nature of the system in understanding its status and communicate
Besides the Data Hub, an Intelligent Monitoring System (ISM) was this information to the external environment.
implemented based on two dimensions; (1) a vast campaign of data
informatization using online sensors for process and environmental • Intelligent adaptive control module, which covers the principal
parameters (concentration, emission of toxic gas, etc.), (2) Complex control options such as interpolation calculation, input/output
Event Processing (CEP) using specific algorithms. The aim was to track management, motion control, etc. suing fuzzy logic and neural
and process streams of data about occurred events by integrating network control,
multiple sources with the aim to detect situations or patterns that • Equipment monitoring module, which monitors all the data attri-
comprise a particular meaning for the system, such as opportunities or butes (working progress of production tasks, real-time process
threats, and to react to them. parameters, etc.) of cloud-based manufacturing equipment. The aim
is to generate production history in the cloud for subsequent data
5.1.2. Case 2: BDA implementation in a phosphoric acid plant analytics in the cyberspace.
In the early 2019, BDA implementation project was conducted in a • Data processing module functioning with the numerical control
phosphoric acid plant. The project was based on a tree maturity levels, kernel to pre-process all the unit condition data that is collected

13
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

from different sensors before the data goes to the cloud for further

Rec1: The implementation of internal sourcing (Data Hub) to the storage and the processing of big data can resolve the issue of data security and privacy.

Rec4: Intelligent Monitoring System (IMS) using sensor-driven information and Complex Event Processing (CEP) can promote the implementation of
analysis.

Rec9: Generic BDA methods and algorithms utilizing artificial intelligence and machine learning can serve as a basis while developing prescriptive
Rec3: The definition of roles and responsibilities inherent in data management through “Data governance guidelines” may reduce the risk of data

Rec8: Hiring skilled employees alongside advanced training sessions on data science can promote the tie of human-data intelligence connection

Rec10: Generalization of automation and sensor-driven information is of outmost importance in the next generation of prescriptive analytics
The data attributes were then transferred to the cloud via an OPC
system. The objective is to create a permanent link with the cyberspace
to maintain the digital twin of a cloud-based equipment. This stream is
bidirectional since it transfer back orders from the cyberspace to smart

Rec2: External sourcing such as cloud computing can offer several opportunities for data management at the expense of privacy

Rec7: Broad communication using IoP along with early involvement of all employees ensure the integration of humans in BDA
equipment to execute actions based on MQTT protocol.
The self-control of the unit of phosphoric acid concentration al-
lowed to increase the quality of produced phosphoric acid alongside

Rec5: Connection between the different process elements and applications is crucial to promote real-time monitoring
several environmental benefits such as the efficiency of water use
during washing cycles and the reduction in HF emissions.

5.2. Recommendations drawn from the case studies

Rec6: Cloud manufacturing (CM) can provide solutions to promote real-time techniques application
When analyzing the experiences of BDA implementation within the
manufacturing process in the case studies, several lessons can be
learned. Thereafter, a couple of recommendations would be useful to
address the research trends that emerge from the literature. Table 6
summarizes the proposed recommendations.

5.2.1. BDA-enabler architecture


During the case 1, a Data Hub relying on internal sourcing was
implemented; providing great benefits in term of the big storage ca-
pacity proposed by Hadoop Data Lake. Indeed, the huge capacity of
Data warehousing allowed keeping process parameters history for more
than one year available in the DCS, and then modeling more accurately
trends and tendencies based on advanced analytics. Although this so-

However, the issue of implementation costs arises


lution is not widespread in the literature due to the required investment
and the high technical complexity, it is shown to be beneficial espe-
cially to resolve the problem of privacy and data security.
Recommendation 1: The implementation of internal sourcing (Data
Hub) to the storage and the processing of big data can resolve the issue of
data security and privacy. However, the issue of implementation costs arises.
Another solution for BDA-enabler architecture implementation is to
rely on outsourcing resources provided by cloud manufacturing as ex-
real-time approaches

plored in case 2 and case 3. This is the most popular solution in the
Recommendations

literature (Kumar et al., 2016; Papacharalampopoulos et al., 2016; Qiu


et al., 2016) as it allows to get access to various functionalities while approaches

storing and processing data.


security

Recommendation 2: External sourcing such as cloud computing can


offer several opportunities for data management at the expense of privacy.
As shown in case 2, the definition of roles and responsibility related
Case 3

to data management strategy is of outmost importance in order to mi-



tigate the risk of privacy and data security.


Case 2

Recommendation 3: The definition of roles and responsibilities in-


herent in data management through “Data governance guidelines” may re-




duce the risk of data security.


Case 1


5.2.2. Real-time data mining approaches


Intelligent monitoring is a well-established system of technology
Prescriptive analytics within the manufacturing process
The summary of the proposed recommendations.

management that combines real-time sensing with project-specific data


processing such as complex event processing (CEP), predictive analy-
tics, and collaborative tools for data interpretation and decision-making
(Chen, Fan, & Chang, 2016; Hu et al., 2018; Krumeich et al., 2014).
Recommendation 4: Intelligent Monitoring System (IMS) using
Integrated Human-Data intelligence

sensor-driven information and Complex Event Processing (CEP) can pro-


Real-time data mining approaches

mote the implementation of real-time approaches.


In all case studies, the establishment of a communication between
BDA-enabler architecture

different elements and applications in the manufacturing process, either


by creating a common platform (case 1) or using virtual cyberspace
(case 2 and 3) is very beneficial for the manufacturing process to de-
Research trend

velop effective real-time monitoring.


Recommendation 5: Connection between the different process ele-
Table 6

ments and applications is crucial to promote real-time monitoring


One of the most remarkable field of interest while addressing real-

14
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

time challenge if BDA is cloud manufacturing (CM), which represents a repository for researchers and companies interested in the enormous
loosely-connected network of manufacturing services that can be potential of BDA. To do so, we have proceeded on three different axes:
swiftly adopted in the process (Isaksson et al., 2018). In agreement with
several studies in the literature (e.g. Papacharalampopoulos et al., • Firstly, we have conduct a systematic literature review on BDA in
2016; Kumar et al., 2016; Qiu et al., 2016), the results of case 2 and the context of manufacturing process where we selected and con-
case 3 confirm the usefulness of CM in advancing real-time control of sidered 68 papers from the mains scientific data bases. A critical
the process. analysis of literature was made based on a bibliometric and issues
Recommendation 6: Cloud manufacturing (CM) can provide solu- analysis to classify selected papers according to BDA aspects, trends
tions to promote real-time techniques application. and categories.
• Secondly, we propose an overall framework of BDA capabilities in
5.2.3. Integrated Human-Data intelligence manufacturing process, which present a summary of manufacturing
The implementation of IoP along with knowledge share platforms in process challenges, bid data analytics faculties and capabilities and
case 1 (e.g. AlMaarifa of Knowledge Management and Connect) is be- bid data analytics values. This architecture was proposed based on
lieved to ensure an early involvement of human in BDA implementa- the analysis of relevant works proposed in literature.
tion. These platforms is a real tool for enhancing the knowledge of • Finally, we conduct a multiple case studies on a leading company
operators who become more familiarized with new technologies. that has already implemented BDA in their manufacturing process.
Recommendation 7: Broad communication using IoP along with early The choice of the case studies was not arbitrary. In fact, we conduct
involvement of all employees ensure the integration of humans in BDA case studies on companies that use BDA to deal with the different
The secret to leverage the upshots from BDA is to outfits managers identified research trends in our critical analysis. The purpose of
and operators with solid professional proficiency as depicted in case 2 these case studies is to compare the obtained results from im-
and case 3. Hence, it is vital that companies provide analytical training plementing BDA in these companies with results, affirmations and
courses in fields like basic statistics, data mining and business in- predictions found in literature. We aimed also to analyze the cases
telligence to the process operators who will assume an important sup- and give new lines of research advancement.
port part in the new information-rich work context.
Recommendation 8: Hiring skilled employees alongside advanced It is noteworthy that our paper is the first that combines a sys-
training sessions on data science can promote the tie of human-data in- tematic literature review with multiple case study to offer a holistic
telligence connection overview regarding implementing BDA in manufacturing process.

5.2.4. Prescriptive analytics within the manufacturing process 6.1. Theoretical implications
A starting point of switching to the new generation of prescriptive
analytics is predictive BDA methods and algorithms. As described in The present paper yields some interesting insights for theoretical
case 3, fuzzy logic and neural network were utilized. The same con- implications of using BDA in manufacturing process and contributes on
clusion can be drawn from the extant literature (e.g. Chongwatpol, its methodological literature. Thus, this study provides a broader un-
2016; Krumeich et al., 2014; Wang, Cheng, et al., 2018; Wang, Liu, derstanding of BDA implications within the manufacturing process by
et al., 2018). conceptualizing various concepts related to BDA in manufacturing
Recommendation 9: Generic BDA methods and algorithms utilizing process in a holistic and data-driven manner.
artificial intelligence and machine learning can serve as a basis while de- Actually, the findings from the systematic literature review can aid
veloping prescriptive approaches. academic researchers to tackle new empirical research in this field and
It is of paramount importance to computerize the collection of data clear up BDA concepts in the context of manufacturing processes, which
all over the manufacturing process. To do so, sensors and intelligent is still in the initial stage. To do so, a review, a classification and a
devices must be generalized to enhance the ability of the process to summary of all relevant paper dealing with BDA permitted to comple-
exchange data and actions with the cyberspace. ment a descriptive and bibliometric analysis. This analysis shows an
Recommendation 10: Generalization of automation and sensor- exponential growth regarding the number of publications dealing with
driven information is of outmost importance in the next generation of pre- BDA in manufacturing process. Considering research trends it was
scriptive analytics. proved from our analysis that research projects in the field of big data
or industry as well focuses and investigates more and more on the topic
6. Discussion and implications of BDA especially during this last four years which represent 85% of
total founded and selected publications.
Using BDA in an industrial context allows offering several oppor- In general, the present manuscript permits to foster the heated and
tunities for manufacturing processes. However, there is just few com- scalable debate of BDA in manufacturing process and contributes to a
panies that fully benefit from this proven potential especially for their deeper understanding of this relevant technology from the systematic
manufacturing processes (Choi et al., 2017; Isaksson et al., 2018). In literature review and conceptualization of main capabilities.
fact, despite the various outcomes of BDA in industrial fields, its de- Furthermore, this paper regroup the main BDA challenges, faculties and
ployment is still very limited since there is no clear documentation that values related to manufacturing process in a comprehensive way based
groups together big data capabilities and classifies them according to on obtained results from our systematic review and based on conducted
each context and to each use case. On the other hand, the extant re- trends analysis which will permit researches to better understand BDA
search work discussing the transformative potential of BDA is actually capabilities and opportunities.
dealing with different aspects of the use of BDA in manufacturing In this paper, we made a critical analysis on selected research pro-
context rather than focusing on manufacturing processes. Other works jects and we classified them according to each Data Mining and
focus primarily on the practical side and propose technical models for Analytics categories (descriptive, inquisitive, predictive and pre-
the use of BDA and they generally deal with one capability at a time. scriptive analytics). Then we identified the different techniques used in
Nevertheless, most of these researches do not discuss the theoretical each paper and divided them into two categories (offline technics &
aspects without proposing any model or referential that represents the real-online technics). The results of our critical analysis lead us to
different BDA insights and capabilities. identify the main research trends (BDA-enabler architecture, Real-time
The present study aimed to provide an overview of the different data mining approaches, Integrated Human-Data intelligence and
opportunities offered by using BDA techniques thereby, it offers a Prescriptive analytics within the manufacturing process). Afterward we

15
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

analyzed each identified trends in order to respond to the main BDA over time, boosting swift technological advancement all over the in-
research questions with regard to performance management, produc- dustry. Therefore, researchers on the topic argued that most companies
tion control and maintenance in manufacturing processes. admit the prominence of their data for monitoring their manufacturing
The conducted analysis and conceptualization lead us to gives rise process and have supported the use of enhanced analytics and business
to a theoretical framework of BDA capabilities, which might contribute intelligence as a top priority in the near future. In this respect, the
to redirecting the future research in this field. The proposed frame- present paper has provided a better understanding of how these com-
works can be a good basis for all researchers that aim to understand the panies can harness the potential capabilities of Big Data Analytics
opportunities of using BDA and its limitations. The framework is (BDA) in their manufacturing processes as a way of boosting the digital
composed of three related layers: The first layers aims to regroup BDA transformation to gain business value. Accordingly, the present study
challenges on fourth distinct categories: quality and process control, begins with the identification of most agreed capabilities of BDA in
energy and environment efficiency, predictive diagnosis and main- manufacturing process through the findings of a systematic literature
tenance and safety and risk analysis. The second layer presents the BDA review to develop a framework of BDA capabilities in manufacturing
faculties in a comprehensive way. The main goal from building this process. Afterwards, the research uses multiple case studies to corro-
layer is to demonstrate that just some few faculties can represent the borate this framework in order to provide a practical way for managers
enormous potentials of BDA. This approach will permit researcher to to disclose the potency of BDA along the interdisciplinary capabilities
focuses on these faculties instead of being dispersed by analyzing all identified in the study.
capabilities described in literature. The third layer concern BDA values Like any other study, ours has quite a few of limitations that must be
and regroup main BDA opportunities into four categories: enhancing recognized. The first limitation of this study is the selection of papers to
transparency, improving performance, supporting decision-making and be reviewed. Actually, the restrictions posed during the search process
enhancing knowledge. The objective is to offer an overall view on BDA (e.g. use of specific keywords, specific databases…) may contribute to
opportunities based on different aspects and objectives of manu- left out some high-quality articles on BDA in manufacturing process.
facturing industry. Furthermore, qualitative analysis pursued to conduct the multiple case
studies is another limitation of this study. Indeed, the insights gained
6.2. Practical implications from the case studies are qualitative, which makes it difficult to go
deeper and draw objective conclusions for the capabilities of BDA in
The present paper yields some interesting insights for practical manufacturing process.
implications of using BDA in manufacturing process and offers some Future research may be performed on the following aspects. First,
interesting results issued by conducting and analyzing some meaningful the evaluation of the impact of BDA capabilities on manufacturing
case studies. By mingling the findings of the systematic literature re- performance with quantitative analysis method based on primary data
view and analyzed case studies, this paper has attempted to provide must be carried out using more in-depth empirical studies. Moreover,
researchers and managers with meaningful knowledge on the for- BDA-enabler infrastructure must be explored in specific contexts such
mulation and implementation of BDA-enabler approach in the manu- as SMEs and service companies. It is worth noting that the research on
facturing process environment: BDA in manufacturing process, and the corresponding theoretical study
has just started out, as a result of which there is a lot of work to be done
• This study gives a detailed overview of some successful case studies driven by application requirements and related technologies before the
that may help manufacturing companies to transform into an agile BDA is successfully built in the manufacturing process.
and fully digitalized smart factories while using BDA as an im-
portant pillar in industry 4.0 for manufacturing process. References
• In addition, this study may help manufacturing companies to im-
plement fully IT-enabled BDA architecture to increase their perfor- Amini, M., & Chang, S. I. (2018). MLCPM: A process monitoring framework for 3D metal
mance efficiency, improve production quality, improve main- printing in industrial. Computers & Industrial Engineering, 124, 322–330.
Arunachalam, D., Kumar, N., & Kawalek, J. P. (2018). Understanding big data analytics
tenance management, foster employee engagement and capabilities in supply chain. Transportation Research Part E: Logistics and
empowerment, and strengthen safety culture and execution in the Transportation Review, 114(June), 416–436.
long term. Banerjee, A., Bandyopadhyay, T., & Acharya, P. (2013). Data analytics: Hyped up as-
pirations or true potential? Vikalpa, 4, 1–12.
Belhadi, A., Sha’ri, Y. B. M., Touriki, F. E., & Fezazi, S. E. (2018). Lean production in
Furthermore, the case studies carried out are in perfect concord SMEs: Literature review and reflection on future challenges. Journal of Industrial and
with the identified BDA research trends. The analysis of these case Production Engineering, 35(6), 368–382.
Belhadi, A., Touriki, F. E., & fezazi, S. E. (2016). A framework for effective im-
studies allowed to leive the main recommendations while using BDA plementation of lean production in Small and Medium-sized Enterprises. Journal of
according to each trend. Industrial Engineering and Management, 9(3), 786–810.
On the other hand, the case studies analysis enabled to conclude Belhadi, A., Touriki, F. E., & fezazi, S. E. (2018). Benefits of adopting lean production on
green performance of SMEs: A case study. Production Planning & Control, 29(11),
that in order to successfully implement BDA in manufacturing context
873–894.
we must rely on some key transformation levers such as process and Beneventi, F., Bartolini, A., Cavazzoni, C., & Benini, L. (2017). Continuous learning of
operations automation and the use of digital services and advanced HPC infrastructure models using big data analytics and in-memory processing tools.
analytics techniques. This finding represents a huge contribution since Design, automation & test in europe conference & exhibition (DATE) (pp. 1038–1043). .
Beyer, M. A., & Laney, D. (2012). The importance of ‘big data’: A definition. s.l.:Stamford,
it provides essential recommendations to consider while building fully CT: Gartner.
BDA capabilities. In addition, companies operating in similar conditions Chen, Y. (2017). Integrated and intelligent manufacturing: Perspectives and enablers.
to that of the case company can benefit from the findings of the cases Engineering, 3, 588–595.
Chen, H., Chiang, R. H. L., & Storey, V. C. (2012). Business intelligence and analytics:
studies to design their infrastructure that enable the use of BDA for their From big data to big impact. MIS Quarterly, 36(4), 1165–1188.
manufacturing process. Chen, Y.-J., Fan, C.-Y., & Chang, K.-H. (2016). Manufacturing intelligence for reducing
false alarm of defect classification by integrating similarity matching approach in
CMOS image sensor manufacturing. Computers & Industrial Engineering, 99, 465–473.
7. Conclusion and future works Cheng, Y., Chen, K., Sun, H., Zhang, Y., & Tao, F. (2018). Data and knowledge mining
with big data towards smart production. Journal of Industrial Information Integration,
In contemporary era of Big Data, the utilization of data for manu- 9, 1–13.
Chiang, L., Lu, B., & Castillo, I. (2017). Big Data analytics in chemical engineering. Annual
facturing process intelligence takes an increased importance in the path review of chemical and biomolecular engineering, 8, 63–85.
towards the operational excellence. On the other hand, manufacturing Chien, C.-F., Liu, C.-W., & Chuang, S.-C. (2017). Analysing semiconductor manufacturing
processes and production tools have experienced whopping evolutions big data for root cause detection of excursion for yield enhancement. International

16
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

Journal of Production Research, 55(17), 5095–5107. (2012). Statistical quality control through process self-induced vibration spectrum
Chiremsel, Z., Said, R. N., & Chiremsel, R. (2016). Probabilistic fault diagnosis of safety analysis. The International Journal of Advanced Manufacturing Technology, 58(9–12),
instrumented systems based on fault tree analysis and Bayesian network. Journal of 1243–1259.
Failure Analysis and Prevention, 16(5), 747–760. M’Sahli, F., & Matlaya, R. (2005). A neural network model based predictive control ap-
Choi, T., Wallace, S. W., & Wang, Y. (2017). Big data analytics in operations management. proach: Application to a semi-batch reactor. The International Journal of Advanced
Production and Operations Management, 27(10), 1868–1883. Manufacturing Technology, 26(1–2), 161–168.
Chongwatpol, J. (2015). Prognostic analysis of defects in manufacturing. Industrial MacGregor, J. F., Bruwer, M. J., Miletic, I., Cardin, M., & Liu, Z. (2015). Latent variable
Management & Data Systems, 115(1), 64–87. models and big data in the process industries. IFAC-Papers OnLine, 48(8), 520–524.
Chongwatpol, J. (2016). Managing big data in coal-fired power plants: A business in- Manyika, J., Bughin, J., Chui, M., Dobbs, R., Brown, B., & Roxburgh, C. (2011). Big data:
telligence framework. Industrial Management & Data Systems, 116(8), 1779–1799. The next frontier for innovation, competition, and productivity. s.l.: McKinsey &
Eckstein, J., Peitz, S., Schäfer, K., Friedel, P., Köhler, U., Hessel-von Molo, M., ... Dellnitz, Company.
M. (2016). A comparison of two predictive approaches to control the longitudinal Mehta, P., Butkewitsch-Choze, S., & Seaman, C. (2018). Smart manufacturing analytics
dynamics of electric vehicles. Procedia Technology, 26, 465–472. application for semi-continuous manufacturing process–a use case. Procedia
Favoino, F., Fiorito, F., Cannavale, A., Ranzi, G., & Overend, M. (2016). Optimal control Manufacturing, 26, 1041–1052.
and performance of photovoltachromic switchable glazing for building integration in Ning, C., & You, F. (2018). Data-driven decision making under uncertainty integrating
temperate climates. Applied Energy, 178, 943–961. robust optimization with principal component analysis and kernel smoothing
Gandomi, A., & Haider, M. (2015). Beyond the hype: Big data concepts, methods, and methods. Computers & Chemical Engineering, 112, 190–210.
analytics. International Journal of Information Management, 35(2), 137–144. O’Donovan, P., Leahy, K., Bruton, K., & O’Sullivan, D. T. J. (2015). Big data in manu-
Ge, Z., Song, Z., Ding, S. X., & Huang, B. (2017). Data mining and analytics in the process facturing: A systematic mapping study. Journal of Big Data, 2(20), 1–22.
industry: The role of machine learning. IEEE Access, 5, 20590–20616. Papacharalampopoulos, A., Stavridis, J., Stavropoulos, P., & Chryssolouris, G. (2016).
Grossmann, I. E. (2014). Challenges in the application of mathematical programming in Cloud-based control of thermal based manufacturing processes. Procedia CIRP, 55,
the enterprise-wide optimization of process industries. Theoretical Foundations of 254–259.
Chemical Engineering, 48(5), 555–573. Paul, D., Zhong, W.-D., & Bose, S. K. (2016). Energy efficient cloud service pricing: A two-
Gunasekaran, A., Yusuf, Y. Y., Adeleye, E. O., & Papadopoulos, T. (2018). Agile manu- timescale optimization approach. Journal of Network and Computer Applications, 64,
facturing practices: The role of big data and business analytics with multiple case 98–112.
studies. International Journal of Production Research, 56(1–2), 385–397. Qin, S. J. (2014). Process data analytics in the era of big data. AIChE Journal, 60(9),
Hammer, M. (2018). Digitization perspective: Impact of digital technologies in manu- 3092–3100.
facturing. Management Approach for Resource-Productive Operations, 27–68. Qiu, X., He, G., & Ji, X. (2016). Cloud manufacturing model in polymer material industry.
Hammer, M., Somers, K., Karre, H., & Ramsauer, C. (2017). Profit per hour as a target The International Journal of Advanced Manufacturing Technology, 84(1–4), 239–248.
process control parameter for manufacturing systems enabled by big data analytics Rowley, J., & Slack, F. (2004). Conducting a literature review. Management Research News,
and industry 4.0 infrastructure. Procedia CIRP, 63, 715–720. 27(6), 31–39.
He, Q. P., & Wang, J. (2018). Statistical process monitoring as a big data analytics tool for Sadati, N., Chinnam, R. B., & Nezhad, M. Z. (2018). Observational data-driven modeling
smart manufacturing. Journal of Process Control, 67, 35–43. and optimization of manufacturing processes. Expert Systems with Applications, 93,
Hidalgo, N., Rosas, E., Vasquez, C., & Wladdimiro, D. (2018). Measuring stream proces- 456–464.
sing systems adaptability under dynamic. Future Generation Computer Systems, 88, Seuring, S., & Müller, M. (2008). From a literature review to a conceptual framework for
413–423. sustainable supply chain management. Journal of Cleaner Production, 16(15),
Hu, W., Shah, S. L., & Chen, T. (2018). Framework for a smart data analytics platform 1699–1710.
towards process monitoring and alarm management. Computers & Chemical Severson, K., Chaiwatanodom, P., & Braatz, R. D. (2015). Perspectives on process mon-
Engineering, 114, 225–244. itoring of industrial systems. IFAC-PapersOnLine, 48(21), 931–939.
Hutter, T., Haeussler, S., & Missbauer, H. (2018). Successful implementation of an order Shao, Y., Li, C., Gu, J., Zhang, J., & Luo, Y. (2018). Efficient jobs scheduling approach for
release mechanism based on workload control: A case study of a make-to-stock big data applications. Computers & Industrial Engineering, 117, 249–261.
manufacturer. International Journal of Production Research, 56(4), 1565–1580. Shin, S.-J., Woo, J., & Rachuri, S. (2014). Predictive analytics model for power con-
Isaksson, A. J., Harjunkoski, I., & Sand, G. (2018). The impact of digitalization on the sumption in manufacturing. Procedia CIRP, 15, 153–158.
future of control and operations. Computers and Chemical Engineering, 114, 122–129. Shrouf, F., Ordieres, J., & Miragliotta, G. (2014). Smart factories in Industry 4.0: A review
Ji, W., & Wanga, L. (2017). Big data analytics based fault prediction for shop floor of the concept and of energy management approached in production based on the
scheduling. Journal of Manufacturing Systems, 43, 187–194. Internet of Things paradigm. 2014 IEEE international conference on industrial en-
Joseph, R. C., & Johnson, N. A. (2013). Big data and transformational government. IT gineering and engineering management (IEEM) (pp. 697–701). .
Professional, 15(6), 43–48. Shukla, M., & Mattar, L. (2019). Next generation smart sustainable auditing systems using
Ketokivi, M., & Choi, T. (2014). Renaissance of case research as a scientific method. Big Data Analytics: Understanding the interaction of critical barriers. Computers &
Journal of Operations Management, 32(5), 232–240. Industrial Engineering, 128, 1015–1026.
Khakifirooz, M., Chien, C. F., & Chen, Y.-J. (2018). Bayesian inference for mining semi- Sivarajah, U., Kamal, M. M., Irani, Z., & Weerakkody, V. (2017). Critical analysis of Big
conductor manufacturing big data for yield enhancement and smart production to Data challenges and analytical methods. Journal of Business Research, 70, 263–286.
empower industry 4.0. Applied Soft Computing, 68, 990–999. Smirnov, P. A., & Nasonov, D. (2016). Quality-based workload scaling for real-time
Khakzad, N. R., & Reniers, G. (2016). Application of Bayesian network and multi-criteria streaming systems. Procedia Computer Science, 101, 323–332.
decision analysis to risk-based design of chemical plants. Chemical Engineering Stanley, G. (2018). Big Data Approximating Control (BDAC)—A new model-free esti-
Transactions, 48, 223–228. mation and control paradigm based on pattern matching and approximation. Journal
Kho, D. D., Lee, S., & Zhong, R. Y. (2018). Big Data Analytics for processing time analysis of Process Control, 67, 141–159.
in an IoT-enabled manufacturing shop floor. Procedia Manufacturing, 26, 1411–1420. Stojanovic, N., Dinic, M., & Stojanovic, L. (2015). Big Data process analytics for con-
Kozjek, D., Vrabič, R., Rihtaršič, B., & PeterButala (2018). Big data analytics for opera- tinuous process improvement in manufacturing. IEEE International Conference on Big
tions management in engineer-to-order manufacturing. Procedia CIRP, 72, 209–214. Data (Big Data), 1398–1407.
Krumeich, J., Jacobi, S., Werth, D., & Loos, P. (2014). Big Data Analytics for predictive Sun, B., Jämsä-Jounela, S. L., Todorov, Y., Olivier, L. E., & Craig, I. K. (2017). Perspective
manufacturing control - A case study from process industry. IEEE international con- for equipment automation in process industries. IFAC-PapersOnLine, 50(2), 65–70.
gress on big data (pp. 530–537). . Tewari, S., & Dwivedi, U. (2019). Ensemble-based big data analytics of lithofacies for
Krumeich, J., Werth, D., & Loos, P. (2016). Prescriptive control of business processes. automatic development of petroleum reservoirs. Computers & Industrial Engineering,
Business & Information Systems Engineering, 58(4), 261–280. 128, 937–947.
Kumar, A., Shankar, R., Choudhary, A., & Thakur, L. S. (2016). A big data MapReduce Wamba, S. F., Akter, S., & Bourmont, M. D. (2018). Quality dominant logic in big data
framework for fault diagnosis in cloud-based manufacturing. International Journal of analytics and firm performance. Business Process Management Journal, 25(3),
Production Research, 54(23), 7060–7073. 512–532.
Lee, J. H., Shin, J., & Realff, M. J. (2018). Machine learning: Overview of the recent Wamba, S. F., Akter, S., Edwards, A., Chopin, G., & Gnanzou, D. (2015). How ‘big data’
progresses and implications for the process systems engineering field. Computers & can make big impact: Findings from a systematic review and a longitudinal case
Chemical Engineering, 114, 111–121. study. International Journal of Production Economics, 165, 234–246.
Lee, J. Y., Yoon, J. S., & Kim, B.-H. (2017). A big data analytics platform for smart fac- Wamba, S. F., Gunasekaran, A., Akter, S., Ren, S. J. F., Dubey, R., & Childe, S. J. (2017).
tories in small and medium-sized manufacturing enterprises: An empirical case study Big data analytics and firm performance: Effects of dynamic capabilities. Journal of
of a die casting factory. International Journal of Precision Engineering and Business Research, 70, 356–365.
Manufacturing, 18(10), 1353–1361. Wang, C.-H., Cheng, H.-Y., & Deng, Y.-T. (2018). Using Bayesian belief network and time-
Li, D. (2016). Perspective for smart factory in petrochemical industry. Computers & series model to conduct prescriptive and predictive analytics for computer industries.
Chemical Engineering, 91, 136–148. Computers & Industrial Engineering, 115, 486–494.
Li, Y., & Kashiwagi, H. (2005). High-order Volterra Model Predictive Control and its Wang, G., Gunasekaran, A., Ngai, E. W., & Papadopoulos, T. (2016). Big data analytics in
application to a nonlinear polymerisation process. International Journal of Automation logistics and supply chain management: Certain investigations for research and ap-
and Computing, 2(2), 208–214. plications. International Journal of Production Economics, 176, 98–110.
Lindström, J., Larsson, H., Jonsson, M., & Lejon, E. (2017). Towards intelligent and Wang, W., Liu, X., & Qin, Y. (2018). A fuzzy Fine-Kinney-based risk evaluation approach
sustainable production: Combining and integrating online predictive maintenance with extended MULTIMOORA method based on Choquet integral. Computers &
and continuous quality control. Procedia CIRP, 63, 443–448. https://linkinghub. Industrial Engineering, 125, 111–123.
elsevier.com/retrieve/pii/S2212827117302457https://doi.org/10.1016/j.procir. Weese, M., Martinez, W., Megahed, F. M., & Jones-Farmer, L. A. (2016). Statistical
2017.03.099. learning methods applied to process monitoring: An overview and perspective.
López-Escobar, C., González-Palma, R., Almorza, D., Mayorga, P., & Carnero, M. C. Journal of Quality Technology, 48(1), 4–24.

17
A. Belhadi, et al. Computers & Industrial Engineering 137 (2019) 106099

Yang, H., Park, M., Cho, M., Song, M., & Kim, S. (2014). A system architecture for framework for energy-intensive manufacturing industries. Journal of Cleaner
manufacturing process analysis based on big data and process mining techniques. Production, 197, 57–72.
IEEE International Conference on Big Data (Big Data), 1024–1029. Zhang, Y., Ren, S., Liu, Y., & Si, S. (2017). A big data analytics architecture for cleaner
Zerrouki, H., & Smadi, H. (2017). Bayesian belief network used in the chemical and manufacturing and maintenance processes of complex products. Journal of Cleaner
process industry: A review and application. Journal of Failure Analysis and Prevention, Production, 142(Part 2), 626–641.
17(1), 159–165. Zhong, R. Y., Xu, C., Chen, C., & Huang, G. Q. (2017). Big Data Analytics for Physical
Zhang, H., Li, T., Li, J., Yang, S., & Zou, Z. (2017). Progress in aluminum electrolysis Internet-based intelligent manufacturing shop floors. International Journal of
control and future direction for smart aluminum electrolysis plant. JOM, 69(2), Production Research, 55(9), 2610–2621.
292–300. Zhuchkov, R. N. (2015). Application of predictive control approach in stabilizing control
Zhang, Y., Ma, S., Yang, H., Lv, J., & Liu, Y. (2018). A big data driven analytical design of networked plants. Automation and Remote Control, 76(9), 1704–1712.

18

You might also like