Professional Documents
Culture Documents
Whilst it may not appear to be the case, a form of data ecosystem has been
around for thousands of years.
From early accounts of using data-driven insights to record and forecast crops
and herds up to more refined accounting principles in the 1600s, which laid the
basis of big data today, data ecosystems have played a fundamental part in many
of the professional areas we experience in our everyday lives, and it is not new.
Something to consider as you progress through this chapter is that the data
ecosystem is an ever-changing data-led collection of component parts (see
Fig. 2.1) that are expected to develop and evolve over time.
It’s for this reason this text refers to the data ecosystem as opposed to the data
environment.
Analytics Infrastructure
U
I s
n e
s f
i u
Applications Data Intelligent
g Ecosystem Algorithms l
h n
t e
s s
Machine s
Learning / Data
Artificial Sources/Types
Intelligence
Data
Ecosystem
business like Office, Word, Excel, PowerPoint and others. The application is any
program or suite of programs designed for the end user to use the data effectively.
Intelligent Algorithms – Using the definition provided in Chapter 1, Intelligent
Algorithms, also referred to as artificial intelligent algorithms, relate to computer
systems that are able to replicate the logic, approach, expertise and reasoning
behind processing information for problem-solving.
Machine Learning (ML) – ML is the field of Artificial Intelligence (AI) which
uses statistical driven processes, functions and techniques to facilitate the ability
for a computer (machine) to learn from new and changing data sets provided. It is
the automated computerised (or ‘self’) learning without the prerequisite for
explicit coding or computer programming that is the foundation of ML.
Artificial Intelligence (AI) – The demonstration by computers (machines) to
fulfil the role and the actual completion of tasks which are usually classified as
human actions and needing human intelligence.
Data Sources/Types – Data sources are fairly self-explanatory terminologies to
describe the originating location or primary source of where the data came from.
Understanding the Data Ecosystem 13
This data source can range from Google documents and Excel spreadsheets to
databases and forms. The data type is the characterisation of the data and nor-
mally defined by the values it holds, coding/programming languages used or the
actions that can be used with the data types.
Machine Learning
At the time of writing this section of the guide, ML delivers over 6.76 billion
results with a top-level Google UK search query, reinforcing the active interest in
this field, as well as the confusion that often surrounds it.
• The branch of AI that deals with systems and the ability for computers to learn
from data and identify patterns without human intervention
• Practical application of AI, providing systems to automate actions and learn
from changing data sets, improving the insights derived without any manual
coding and programming involved
• The most basic practice of working with intelligent algorithms to process data,
learn from the data, and make a judgement or prediction based on the data,
providing new meaning and insights depending on data changes
• The science of enabling computers to act using data and make informed
decisions attributed to the changing data they are exposed to
Feature
Learning Type ML Algorithm
Extraction
Classifying / Forecasting/
•Semi- Regression
Training Set
supervised •Logistic
Predictions / Other)
Regression
•Decision Tree
•Random
Forest
•Gradient
Boosting
•K Means
•Other
New Data
Annotated Data
Unsupervised learning – Involves letting the computer figure things out on its
own. For example, an objective with unsupervised learning can be for the
machine to tell you something you didn’t know about the data or to share pat-
terns and opportunity that sit within the data.
Semi-supervised learning – Sitting between supervised and unsupervised
learning, the computer is provided with partial information and training signals
with the expectation that it will figure the remainder out for itself.
There are additional learning environments within the ML field including
active learning and reinforced learning, but the above will give you enough of an
understanding and awareness on the topic for the rationales of this guide.
If you are eager to dig deeper into the world of ML and the algorithm types
involved, there are seven extra (commonly applied) algorithm-based models
plus some top-level overviews to provide added awareness at this stage. These
follow now.
Naı̈ve Bayes (Classifier algorithm) – This is often deployed for grouping (or
classifying) similar items together. Its uses include classifying documents, emails
(think spam filtering) and wider applications including disease predictions and
assigning sentiment to messages on platforms like Facebook.
Understanding the Data Ecosystem 15
Some of the more widely cited business uses of ML comprise everything from
face detection and spam filtering in emails, all the way through to recommending
songs, artists, films and television shows, plus the more B2B application of items
such as speech recognition.
The sections that follow now cover practical ML applications applicable for
many companies today.
Artificial Intelligence
AI relates to a machine’s (computers) capability to show intelligence as opposed
to the natural intelligence demonstrated by human beings and other animals. As a
facet of computer science, AI places emphasis on the creation of machines that
can demonstrate intelligence and react/respond like humans.
Intelligence is related to intelligent agents and refers to any device that is able
to take actions that increase the likelihood of achieving goals and objectives.
Many of the same approaches for teaching AI are attributable to the teaching of
children.
AI needs to be trained in structured and unstructured way (supervised, unsu-
pervised, semi-supervised learning) and has to be exposed to different types of
data sets (environments) and uses the lessons from these data sets to learn, adapt
and improve (in the same way that people do).
Understanding the Data Ecosystem 17
This is where ‘AI for good’ comes to the fore. This (AI for good) is a United
Nations (UN) platform and growing movement where institutions are using AI to
solve global issues like homelessness – in this example, AI is used to identify areas
across the world where it predicts have the highest levels of homelessness to
support added focus and support.
In this example, AI practically works by analysing satellite images and
matching these to expected characteristics.
Qualitative Data
These are data that do not have a measurement for a specific attribute. As such
qualitative data use approximations, interpretations and characterisations to
describe something.
In this way you can describe qualitative data as data that are observed, cat-
egorised and recorded in a nonnumerical fashion.
Qualitative data are often gathered through focus groups, case studies, testi-
monials, interviews and other similar data collection approaches, giving
researchers access to the meanings that people place onto things.
Examples of qualitative data in action include:
Quantitative Data
In direct contrast to qualitative data, quantitative data looks at the measurable
(information about quantities) that can be categorised based on numerical values/
numbers alone – it does not look at the characteristics of the data, just the metric/
measurement.
Anything that has a number value assignable to it can be seen as quantitative
data. Examples would include height, weight, clothing size, income amount,
traffic volumes, number of sales and lots more.
20 Data-driven Marketing Content
Quantitative data examples include the following (note: these all have limited
(closed) response types):
Chapter Summary
This chapter introduced the notion of the data ecosystem, how this differs from a
data environment, plus the building blocks and visualisation of what a typical
data ecosystem looks like (refer Fig. 2.1 for more information).
In this section, many of the key components that are active within data eco-
systems were covered such as ML, AI, qualitative data and quantitative data, plus
definitions spanning other related topics too.
This part of the business guide supplied insights into some of the practical
applications of the broader theoretical areas including ML and AI, so that as a
company these topics can start to be viewed with a more feasible mindset. This
will help with furthering the thought process of how these areas can give your
business a competitive advantage (something that will be targeted in a much
greater extent throughout this guide).
As you move into the next chapter we take a look at extra real-world appli-
cations of big data for marketing and broader content creation. The guide con-
siders how to collect data your business needs and ways to effectively manage
your data, and highlights a few of the recent developments when it comes to data
privacy/protection and particularly the role of GDPR.
Definitions
• Data ecosystems: these are expected to change, develop and evolve over time.
A data ecosystem is a group of applications, analytics, technology and infra-
structure used to collect, process and support big data.
• B2B: B2B, standing for business to business, is the acronym of the term used to
describe companies that have a primary focus on selling to other businesses
rather than directly to the consumer (B2C).
• Paid advertising: this relates to all types of business promotion and advertising
which involves a direct payment. With paid advertising, companies are effec-
tively paying for positions and advertising space rather than earning them (as
you would with organic search). Paying for placement traditionally occurs
within a bidding (or auction) environment.
• Automated bid adjustments: these are a form of paid advertising bidding
strategy, often deployed on larger scale accounts, to enable more effective bulk
bidding. Often these are based on setting parameters for bids tied to intended
outcomes (for example, a targeted cost per acquisition).
Understanding the Data Ecosystem 21
• Search engine results pages (SERPs): these are the pages of information
(organic and paid advertising along with integrated features like Google My
Business and Google Answers) displayed in the search engines like Google,
Bing, Duck-Duck-Go and others. SERPs display relevant results in a fast time
(usually milliseconds) tied to any given search query.
• Google RankBrain: this is the AI and ML algorithm and name the search
engine provider Google use for understanding new and niche search queries.