You are on page 1of 2

For the purpose of gathering publications related to bias in AI/ML applications, integrated

bottom-up and top-down research methodologies were exercised. Each co-author


gathered pertinent material and added it to a shared repository. The keywords for search
are optimized in each search with expectation to shortlist papers containing exact
information. The three core domains of artificial intelligence, machine learning, and
software engineering were the focus of the search as shown in figure 1. And & OR string
operators were used along with double quotation marks to further narrow down to search.

Fairness Management
Bias Potentially biased Attributes
Sensitive Attributes
Fairness Metrics
Protected Attributes
Taxonomy of Bias in AI
Best AI Practices Artificial
Intelligence Bias Algorithm

Decision Making
Explainability
MLLC
Bias Mitigation Strategies
Bias Minimization
IEEE, ACM, ScienceDirect, arXiv, and GoogleMachine
Scholar established online repositories were
Definition of Biases Learning research.
used to conduct Mostly
CRISP-DM
cited
Transparency
Fair Learning survey on
Bias Data Bias and Fairness in AI
SDLC
/ ML
Best ML Practices Software Discrimination
were used to explore the taxonym,
Engineering
detection, identification,
mitigation /adoption of bias and Unfairness
Fairness

Bias Management Bias Detection


Figure 1.
Bias Identification
Search
AI Ethics
words across three predefined domains- AI, ML & SE

fairness in AI/ML. Interesting fact was that each survey introduce some new types of bias or
fairness definitions. Only those papers that go into great detail about any of these issues,
such as BIAS IN DATA, ALGORITHMS, Assessment Tools, fairness management approaches,
bias detection, identification, and mitigation strategy, fairness metrices, development
cycles of AI/ML/SE, datasets characteristics, AI ethics and principals, etc. were shortlisted
during the inclusion criteria. We are further guided to our goal by sections from academic
books, keynote addresses by illustrious speakers, and various advisories published from
time to time. Non-Tabular data, domain/language dependent technology, the absence of
experimental results, etc. are among the exclusion factors that reduce the number of
papers we can find from 254 to 72. Our claim in this research is further supported by more
than 15 years of professional experience in the IT field of all authors.

"

You might also like