You are on page 1of 1

Artificial Intelligence and Data Analytics (AIDA) Guidebook

5 Machine Learning Methodology


This section provides an overview of how developers effectively build, evaluate, and manage
analytic and learning systems through a machine-learning pipeline. A machine-learning pipeline
is a way to codify and automate the workflow necessary to produce a machine-learning model.
Machine learning pipelines consist of multiple sequential steps that do everything from data
extraction and preprocessing to model training and deployment. Figure 6 provides a high-level
example of a machine-learning pipeline, with each step described in more detail to follow.

Figure 6: Machine Learning Pipeline

Pipeline steps:

1. Establish AI system goal – traditional goals of AI research include reasoning, knowledge


representation, planning, learning, natural language processing, perception, and the
ability to move and manipulate objects.
2. Establish requirements – consider desired performance, usability, integration, and
statistical behavior.
3. Identify AI solution design – identify the algorithms and programming language to be
used.
4. Identify use constraints – constraints enumerate the possible values a set of variables
may take in a given world.
5. Identify required data sets – the more complex your model becomes, the more data you
will need to determine its parameters.

Page 15

You might also like