Driving Innovation With Ai: Getting Ahead With Dataops and Mlops
Driving Innovation With Ai: Getting Ahead With Dataops and Mlops
TOPICAL SURVEY
Foreword...................................................................................................................................................................................... 2
Management summary.............................................................................................................................................................. 3
DataOps and MLOps are promising approaches for greater efficiency in ML.................................................................... 9
Companies are getting ahead with DataOps and MLOps, but should be careful to bring everyone on board...........18
Demographics........................................................................................................................................................................... 20
BARC profile.............................................................................................................................................................................. 22
Sponsor profiles...................................................................................................................................................................... 23
Authors...................................................................................................................................................................................... 26
Artificial intelligence (AI), machine learning (ML) and data science are among regarding all interdependencies across involved systems along an end-to-
a large group of buzzwords associated with major breakthroughs in the end data pipeline. It is a concept that fosters collaboration between ex-
way business is done and better outcomes are achieved. In this study, we perts and makes the process of developing data products more agile and
will examine the mechanisms behind those success stories and the meth- efficient. It is relevant for all kinds of data products whose success depends
ods that come into play to achieve them. In short, we will focus on the on being up to date.
concepts of DataOps and MLOps and their impact on the application of ML.
We will evaluate the perception and adoption of these concepts in detail This also applies without exception to machine learning (ML) models, which
as well as their contribution to the success of AI, or more specifically, ML are a particular kind of data product. As such, the development and de-
applications in enterprises. Therefore, we will be addressing the functional ployment of ML models are accompanied by special requirements such as
scope of DataOps extended by MLOps without differentiating between the retraining, testing, tracking of relevant performance metrics and versioning
two. This is an exploratory study that provides an overview and uncovers of different model configurations as well as the general management of
interrelationships. code. The maintenance of efficient and automated processes for the fulfill-
ment of these special requirements is addressed by MLOps.
Because there are conceptual challenges linked to DataOps and MLOps, we Therefore, we regard MLOps as a functional extension to DataOps. Both
would like to share our understanding of these two concepts first. This will concepts derive from DevOps principles and could be regarded as an ex-
illustrate why we do not distinguish between them throughout this study. tension to it with a special focus on the quality, validity and reliability of
data as a carrier of logic.
DataOps and MLOps: Only a gradual transition from one to the other
Because of the large functional overlap and various interpretations, mark- We view DataOps, MLOps and DevOps as concepts for the design of pro-
ing borders between DataOps and MLOps is arbitrary and never produces cesses which ensure the seamless operation of software and data prod-
satisfactory results. Therefore, we will consider these concepts as one for ucts. Tools and solutions in this area generally provide functions and tech-
the purposes of this study. nical frameworks for the support of these processes.
DataOps focuses on the realization of a manageable, maintainable and au- Alexander Rode and Timm Grosser
tomated flow of quality-assured data. The goal is to achieve transparency Würzburg, May 2022
MANAGEMENT SUMMARY
Most companies are only just beginning to tap the benefits of ML for themselves. 55 percent of the compa- Developing ML models is the easy part.
nies represented in this survey have not deployed a ML model yet and only 10 percent consider themselves Get informed early about the challenges of
advanced in this area. deployment by learning about DataOps and
MLOps. This will help to prepare you as well
As soon as ML models are ready to be transferred into production, things really start to get complex. Deploy- as avoid setbacks and unpleasant surprises.
ment is a difficult hurdle for many companies to overcome.
Please describe your company´s level of advancement in the area of machine learning (n=248)
Our survey results are surprising and also More than half of our survey participants (55
somewhat deceptive. Despite the hype and percent) have not yet operationalized any ML
numerous success stories over quite a period of models, and those who have not even started
time, the majority of companies seems to be stuck developing ML models make up the largest group
in an early phase of ML advancement. (37 percent). This result reflects a reality where
DataOps/MLOps unaware
29%
ML beginners 8% 51% 41%
Developing and deploying ML successfully DataOps/MLOps adopters DataOps/MLOps aspirants DataOps/MLOps unaware
is challenging and its complexity is often
underestimated, as indicated by 68 percent of Adoption of DataOps/MLOps by ML practitioners and beginners (n=228)
our respondents. It is widely believed that the
implementation of DataOps and MLOps enables This impression becomes even more apparent But why should anyone care about DataOps and
companies to cope with these complex challenges when we differentiate between ML practitioners MLOps before they have operationalized any ML
and to increase efficiency. and beginners (see chart). ML practitioners have models? It is certainly advisable to address the
mostly started to adopt DataOps and MLOps objectives of these concepts early on in order to
As the majority of companies represented in this (50 percent) or are evaluating steps to do so (37 become aware of the many problems that may
survey has not even deployed operationalized ML percent). There seems to be no way around Data/ arise during the deployment of ML. Starting to
models yet, it is not surprising that the proportion MLOps to cope with the complexities arising from deploy and rely on ML models that are not backed
of those already adopting Data/MLOps is not very the deployment of ML. by adequate worst-case precautions can lead to
large at 26 percent (see chart). disappointment and even disaster.
Greater transparency
34%
Better reliability
33%
No improvements achieved
3%
Which significant improvements has your company achieved with the introduction of MLOps/DataOps? (n=61)
Asked whether their initial expecations regarding Adopters of these concepts are less likely to be Over 70% of the DataOps/MLOps
the positive impact of ML were met, 53 percent of surprised by unforeseen complexities: 41 percent aspirants and unaware underestimated the
adopters of Data/MLOps, 45 percent of aspirants described the level of complexity “as expected”. complexity of machine learning adoption.
and only 31 percent of those unaware of Data/
MLOps agreed. Indeed, the ‘unaware’ showed There is a greater difference in perceived
higher rates of both exceeded expectations complexity between those companies that have
and disappointment. These results suggest that already adopted DataOps/MLOps and those
companies familiar with topics around DataOps that are just considering doing so. Since around
and MLOps have more realistic expectations about three-quarters of the latter group (76 percent)
what they can achieve with machine learning. As say they underestimated the complexity of using
we will see, this finding will be confirmed when we ML, it is reasonable to assume that they only
come to evaluate expected complexity.
Our results prove that adopters of DataOps and not show much variation in terms of Data/MLOps
MLOps achieve faster time-to-market, higher adoption, faster deployments lasting just “weeks”
productivity, better scalability and higher levels of or “days” are common among adopters but are
automation. These are all measures of improved unheard of among the unaware.
efficiency and speed in delivering ML results. An
examination of the time taken from development With regard to beginners, companies that have
to deployment seems to confirm these findings not yet put ML models into production but are
(see chart). at least considering implementing DataOps/
MLOps are generally less likely to expect very long
Adopters of Data/MLOps are significantly less implementation times lasting years.
likely to spend “years” on this process (8 percent
versus 26 percent/46 percent). While deployments DataOps and MLOps not only facilitate shorter
taking “months” seem to be the standard and do implementation times but also forge the
In turn, we would also expect that adopters of The most significant effect can be identified with The higher frequency of problems relating to
Data/MLOps face fewer problems when deploying collaboration. Adopters of MLOps generally seem poor CI/CD and testing capabilities most likely
ML models. Indeed, this is the case, but the advan- to be better placed to overcome the barriers indicates a keener awareness of these challenges
tage in adopting Data/MLOps is not as significant that hinder effective cooperation between stake- on the one hand, and also that they seem to be
as we imagined. In selecting an average of 3.4 of holders. The need to overcome this problem is downstream challenges that may well be harder
the 16 options presented in our survey, adopters apparently one of the main drivers of the adoption to solve.
of Data/MLOps do not differ much from those of Data/MLOps. This is underlined by a compar-
who deploy ML without putting these concepts ison of the roles involved in the development However, handling restricted data access and
into use (3.8 out of 16). Apart this marginal quan- and deployment of ML. The average number of data silo issues seems to be outside the scope of
titative difference, significant variations regarding different roles involved for adopters is 4.2, which most Data/MLOps initiatives, as this problem is
response behavior can be observed (see chart). is significantly higher than the 3.0 for ML practi- even more widespread among adopters.
tioners not using Data/MLOps.
1 in 3
before purchasing expensive licences. Again,
There are no major differences between ML
For almost companies from our perspective, studying the principles of
practitioners and beginners regarding the general
without a platform, ML projects are far DataOps and MLOps and learning how to apply
usage of platforms. Surprisingly, 43 percent of
more complex than expected. them in practice can facilitate the identification
the companies that have not yet deployed a ML
1 in 10
of important requirements and help to avoid
model (beginners) state that their ML deployment
This is only true for common mistakes and setbacks on the road to
requirements are already covered by some form
companies with a platform. successful ML deployment.
of platform.
2 in 3
Non-adopters of Data/MLOps Adopters of Data/MLOps
Almost adopters consider 20% 37%
the strategic integration of DataOps/ Use of special Data/MLOps
technologies and tools
MLOps to be highly conducive to success 37% 53%
with ML. Hiring of experts
10% 24%
Use of end-to-end platforms
33% 43%
We have already outlined that DataOps and Data & analytics infrastructure
modernization
MLOps are valuable concepts for successful and
Internal training and 29% 37%
efficient ML initiatives. Indeed, survey data on
skills development
the success of 12 individual measures supports
this finding. 59 percent of Data/MLOps adopters Comparison of effective measures taken by ML practitioners, by DataOps/MLOps adoption status (n=102)
consider the integration of these concepts to be
highly conducive to success with ML. However, Adopters of DataOps and MLOps also seem to However, we can state that they have a greater
there are other promising measures as well. On be better able to attract capable experts on the ability to profit from the use of new technology
average, adopters were 52 percent more likely to labor market who can generate added value. This and modernized infrastructures, as they are
state that a given measure had been instrumental is no surprise, as they can usually better ensure probably more capable of selecting suitable tools
in increasing success. Looking at the biggest a professional working atmosphere because they and profiting from their range of functions. This
differences between adopters and non-adopters have already put effort into making collaboration assumption is supported by our finding that
of Data/MLOps in the chart, it is reasonable to work and establishing management frameworks adopters are significantly less likely to see a lack
assume that the implementation of Data/MLOps for ML. Nevertheless, the lack of personnel, of orientation as a challenge (see chart on the
also has a positive impact on the success of other especially in the area of advanced analytics, also next page).
measures. remains a challenge for adopters of Data/MLOps.
Industry Europe
23% 66%
Services North America
19% 27%
Public sector Asia and Pacific
18% 6%
IT South America
14% 1%
Banking and finance Africa
0%
13%
Retail/Wholesale/Trade In which region are you located? (n=163)
10%
Other
2%
BARC (Business Application Research Center) is Our independent research brings market
one of Europe’s leading analyst firms for business developments into clear focus, puts software and
software, focusing on the areas of data, business vendors through their paces and gives users a
intelligence (BI) and analytics, enterprise content place to express their opinions. Germany
management (ECM), customer relationship BARC GmbH
Events
management (CRM) and enterprise resource Berliner Platz 7
Decision-makers and IT industry leaders come D-97080 Würzburg
planning (ERP).
together at BARC events. BARC seminars in small +49 931 880 6510
Our passion is to help organizations become digital www.barc.de
groups, online webinars and conferences with
companies of tomorrow. We do this by using
more than 1,000 participants annually all offer
technology to rethink the world, trusting data- Austria
inspiration and interactivity. Through exchange
based decisions and optimizing and digitalizing BARC GmbH
with peers and an overview of current trends Hirschstettner Straße 19 / I / IS314
processes. It‘s about finding the right tools and
and market developments, you will receive new A-1200 Wien
using them in a way that gives your company the
impetus to drive your business forward. +43 660 6366870
best possible advantage.
This unique blend of knowledge, exchange of Consulting Switzerland
information and independence distinguishes In confidential expert workshops, coaching and BARC Schweiz GmbH
our services in the areas of research, events and in-house consultations, we transform the needs Täfernstraße 22a
CH-5405 Baden-Dättwil
consulting. of your company into future-proof decisions. We
+41 56 470 94 34
provide you with successful, holistic concepts that
Research
enable you to use the right information correctly. Rest of the World
Our BARC studies are based on internal market Our project support covers all stages of the +44 1536 772 451
research, software tests and analyst comments, successful use of software. www.barc-research.com
giving you the security to make the right decisions.
ABOUT DATAROBOT
DataRobot AI Cloud is the next generation of AI. DataRobot AI Cloud is built for every organization
DataRobot’s AI Cloud vision is to bring together all across any industry. From government,
data types, all users, and all environments to deliver commercial, to non-profit organizations around
critical business insights for every organization. the world, and has helped organizations across
DataRobot
DataRobot is trusted by global customers across industries harness the transformational power
Contact:
industries, including a third of the Fortune 50, of AI. From restoring supply chain resiliency to Lauren Sanborn
delivering over a trillion predictions for leading accelerating the treatment and prevention of press@datarobot.com
companies around the world. countless diseases to combating climate change, +1-617-765-4500
all while jumpstarting innovation to power
DataRobot also offers DataRobot AI Cloud for economies.
www.datarobot.com
Industries, an extension of our AI Cloud platform
Learn more at datarobot.com.
with unique capabilities and expertise tailored to
address growth, risk and scale needs across major
industries. DataRobot has delivered more than one
million active customer projects for organizations
like Mars, McLaren Racing, and Lenovo, resulting
in a vast library of best practices for applying AI to
enhance patient care, supply chain management,
fraud detection and more. DataRobot AI Cloud for
Industries extends this expertise to accelerate the
impact of AI for all banking, retail, manufacturing
and healthcare organizations.
ABOUT ONE LOGIC which analyze and link data sources via AI to meet
the biggest challenges faced by companies today.
Founded in 2013, ONE LOGIC is a leading company
in the use of artificial intelligence (AI) with turnkey As an agile provider of AI-based data products,
data products (end-to-end applications). With ONE LOGIC draws on the latest machine learning,
extensive experience in data science projects, deep learning, and AI technologies and algorithms.
ONE LOGIC Passau
ONE LOGIC is an expert in tackling industry- It also fosters partnerships with companies and Kapuzinerstraße 2c
specific challenges with targeted solutions. The universities to pursue the intensive research 94032 Passau
Germany
company’s aim is to enable its customers to and development of innovative data products.
+49 851 225 906 0
monetize their data quickly and sustainably, Our goal? To help companies rapidly and
digitize their business models, and independently comprehensively digitalize. Sustainability is always ONE LOGIC Munich
Prinzregentenstraße 50
generate added value. It does so through a unique a top priority. At ONE LOGIC, we draw on our vast
80538 Munich
combination of turnkey data products that turn expertise in data science to create efficient data Germany
data into long-term corporate assets and expert products that add tangible value and make a +49 89 954 592 70
support from experienced data science and AI pervasive impact in business and society.
ONE LOGIC Frankfurt
specialists. This paves the way for users at any Eschenheimer Anlage 1
stage of their business journey to develop high- An owner-managed company, ONE LOGIC’s 60316 Frankfurt/Main
executive board consists of company founder Germany
quality, productive and scalable data products and +49 69 299 5703 10
innovative business models that create value–and Dr. Andreas Böhm, Dr. Stefan Roskos, Christian
to do so at a much more rapid pace. Aumüller and Prof. Dr. Andreas Peifer (ppa.). As ONE LOGIC Zurich
of April 2022, the workforce is almost 300 strong Europaallee 41
8004 Zurich
Our enabling technology for this is ONE DATA and still growing, with teams based at ONE LOGIC Switzerland
Cartography. This automates processes and offices in Passau, Munich, Frankfurt, and Zurich. +41 44 214 6295
consistently creates the perfect conditions for Our sights are primarily set on the manufacturing,
Contact:
using data by merging heterogenous data sources automotive, retail, and pharma and biotech contact@onelogic.de
via AI. ONE LOGIC also offers turnkey data products industries. www.onelogic.ai