You are on page 1of 71

Artificial Intelligence

What is Artificial Intelligence ?


⚫ making computers that think?
⚫ the automation of activities we associate with human thinking,
like decision making, learning ... ?
⚫ the art of creating machines that perform functions that require
intelligence when performed by people ?
⚫ the study of mental faculties through the use of computational
models ?
What is Artificial Intelligence ?
⚫ the study of computations that make it possible to perceive,
reason and act ?
⚫ a field of study that seeks to explain and emulate intelligent
behaviour in terms of computational processes ?
⚫ a branch of computer science that is concerned with the
automation of intelligent behaviour ?
⚫ anything in Computing Science that we don't yet know how
to do properly ? (!)
Artificial Intelligence in the Movies
Artificial Intelligence in Real Life
A young science (≈ 50 years old)
⚫ Exciting and dynamic field, lots of uncharted territory left
⚫ Impressive success stories
⚫ “Intelligent” in specialized domains
⚫ Many application areas

Face detection Formal verification


What is Artificial Intelligence ?

THOUGHT Systems that think Systems that


like humans think
rationally

Systems that act Systems that act


BEHAVIOUR like humans rationally

HUMAN RATIONAL
Systems that act like humans:
Turing Test

⚫ “The art of creating machines that perform functions


that require intelligence when performed by people.”
(Kurzweil)
⚫ “The study of how to make computers do things at
which, at the moment, people are better.” (Rich and
Knight)
Systems that act like humans

?
⚫ You enter a room which has a computer terminal.
You have a fixed period of time to type what you
want into the terminal, and study the replies. At
the other end of the line is either a human being
or a computer system.
⚫ If it is a computer system, and at the end of the
period you cannot reliably determine whether it
is a system or a human, then the system is
deemed to be intelligent.
Systems that act like humans

⚫ The Turing Test approach


⚫ a human questioner cannot tell if
⚫ there is a computer or a human answering his question, via
teletype (remote communication)
⚫ The computer must behave intelligently
⚫ Intelligent behavior
⚫ to achieve human-level performance in all cognitive
tasks
Systems that act like humans
⚫ These cognitive tasks include:
⚫ Natural language processing
⚫ for communication with human
⚫ Knowledge representation
⚫ to store information effectively & efficiently
⚫ Automated reasoning
⚫ to retrieve & answer questions using the stored information
⚫ Machine learning
⚫ to adapt to new circumstances
The total Turing Test
⚫ Includes two more issues:
⚫ Computer vision
⚫ to perceive objects (seeing)
⚫ Robotics
⚫ to move objects (acting)
What is Artificial Intelligence ?

THOUGHT Systems that think Systems that


like humans think
rationally

Systems that act Systems that act


BEHAVIOUR like humans rationally

HUMAN RATIONAL
Systems that think like humans:
cognitive modeling
⚫ Humans as observed from ‘inside’
⚫ How do we know how humans think?
⚫ Introspection vs. psychological experiments
⚫ Cognitive Science
⚫ “The exciting new effort to make computers think
… machines with minds in the full and literal
sense” (Haugeland)
⚫ “[The automation of] activities that we associate
with human thinking, activities such as decision-
making, problem solving, learning …” (Bellman)
What is Artificial Intelligence ?

THOUGHT Systems that think Systems that


like humans think
rationally

Systems that act Systems that act


BEHAVIOUR like humans rationally

HUMAN RATIONAL
Systems that think ‘rationally’
"laws of thought"

⚫ Humans are not always ‘rational’


⚫ Rational - defined in terms of logic?
⚫ Logic can’t express everything (e.g. uncertainty)
⚫ Logical approach is often not feasible in terms of
computation time (needs ‘guidance’)
⚫ “The study of mental facilities through the use of
computational models” (Charniak and McDermott)
⚫ “The study of the computations that make it
possible to perceive, reason, and act” (Winston)
What is Artificial Intelligence ?

THOUGHT Systems that think Systems that


like humans think
rationally

Systems that act Systems that act


BEHAVIOUR like humans rationally

HUMAN RATIONAL
Systems that act rationally:
“Rational agent”

⚫ Rational behavior: doing the right thing


⚫ The right thing: that which is expected to maximize
goal achievement, given the available information
⚫ Giving answers to questions is ‘acting’.
⚫ I don't care whether a system:
⚫replicates human thought processes
⚫makes the same decisions as humans
⚫uses purely logical reasoning
Systems that act rationally
⚫ Logic 🡪 only part of a rational agent, not all of
rationality
⚫ Sometimes logic cannot reason a correct conclusion
⚫ At that time, some specific (in domain) human
knowledge or information is used
⚫ Thus, it covers more generally different situations
of problems
⚫ Compensate the incorrectly reasoned conclusion
Systems that act rationally
⚫ Study AI as rational agent –
2 advantages:
⚫ It is more general than using logic only
⚫ Because: LOGIC + Domain knowledge
⚫ It allows extension of the approach with more scientific
methodologies
Rational agents
⚫ An agent is an entity that perceives and acts

⚫ This course is about designing rational agents

⚫ Abstractly, an agent is a function from percept histories to actions:

[f: P* 🡪 A]

⚫ For any given class of environments and tasks, we seek the agent
(or class of agents) with the best performance

⚫ Caveat: computational limitations make perfect rationality


unachievable
⚫ 🡪 design best program for given machine resources
⚫ Artificial
⚫ Produced by human art or effort, rather than originating
naturally.
⚫ Intelligence
⚫ is the ability to acquire knowledge and use it"
[Pigford and Baur]
⚫ So AI was defined as:
⚫ AI is the study of ideas that enable computers to be
intelligent.
⚫ AI is the part of computer science concerned with design
of computer systems that exhibit human
intelligence(From the Concise Oxford Dictionary)
From the above two definitions, we can see that AI has
two major roles:
⚫ Study the intelligent part concerned with humans.
⚫ Represent those actions using computers.
Goals of AI
⚫ To make computers more useful by letting them take
over dangerous or tedious tasks from human
⚫ Understand principles of human intelligence
The Foundation of AI
⚫ Philosophy
⚫ At that time, the study of human intelligence began with
no formal expression
⚫ Initiate the idea of mind as a machine and its internal
operations
The Foundation of AI
⚫ Mathematics formalizes the three main area of AI:
computation, logic, and probability
⚫ Computation leads to analysis of the problems that can
be computed
⚫ complexity theory
⚫ Probability contributes the “degree of belief” to handle
uncertainty in AI
⚫ Decision theory combines probability theory and utility
theory (bias)
The Foundation of AI
⚫ Psychology
⚫ How do humans think and act?
⚫ The study of human reasoning and acting
⚫ Provides reasoning models for AI
⚫ Strengthen the ideas
⚫ humans and other animals can be considered as information
processing machines
The Foundation of AI
⚫ Computer Engineering
⚫ How to build an efficient computer?
⚫ Provides the artifact that makes AI application possible
⚫ The power of computer makes computation of large and
difficult problems more easily
⚫ AI has also contributed its own work to computer
science, including: time-sharing, the linked list data type,
OOP, etc.
The Foundation of AI
⚫ Control theory and Cybernetics
⚫ How can artifacts operate under their own control?
⚫ The artifacts adjust their actions
⚫ To do better for the environment over time
⚫ Based on an objective function and feedback from the
environment
⚫ Not limited only to linear systems but also other
problems
⚫ as language, vision, and planning, etc.
The Foundation of AI
⚫ Linguistics
⚫ For understanding natural languages
⚫ different approaches has been adopted from the linguistic work
⚫ Formal languages
⚫ Syntactic and semantic analysis
⚫ Knowledge representation
The main topics in AI
Artificial intelligence can be considered under a number of
headings:
⚫ Search (includes Game Playing).
⚫ Representing Knowledge and Reasoning with it.
⚫ Planning.
⚫ Learning.
⚫ Natural language processing.
⚫ Expert Systems.
⚫ Interacting with the Environment
(e.g. Vision, Speech recognition, Robotics)
We won’t have time in this course to consider all of these.
Some Advantages of Artificial
Intelligence

⚫ more powerful and more useful computers


⚫ new and improved interfaces
⚫ solving new problems
⚫ better handling of information
⚫ relieves information overload
⚫ conversion of information into knowledge
The Disadvantages

⚫ increased costs
⚫ difficulty with software development - slow and expensive
⚫ few experienced programmers
⚫ few practical products have reached the market as yet.
Search
⚫ Search is the fundamental technique of AI.
⚫ Possible answers, decisions or courses of action are structured into an
abstract space, which we then search.
⚫ Search is either "blind" or “uninformed":
⚫ blind
⚫ we move through the space without worrying about what is coming
next, but recognising the answer if we see it
⚫ informed
⚫ we guess what is ahead, and use that information to decide where to
look next.
⚫ We may want to search for the first answer that satisfies our goal, or we
may want to keep searching until we find the best answer .
Knowledge Representation & Reasoning
⚫ The second most important concept in AI
⚫ If we are going to act rationally in our environment, then we must have
some way of describing that environment and drawing inferences from that
representation.
⚫ how do we describe what we know about the world ?
⚫ how do we describe it concisely ?
⚫ how do we describe it so that we can get hold of the right piece of
knowledge when we need it ?
⚫ how do we generate new pieces of knowledge ?
⚫ how do we deal with uncertain knowledge ?
Knowledge

Declarative Procedural

• Declarative knowledge deals with factoid questions


(what is the capital of India? Etc.)
• Procedural knowledge deals with “How”
• Procedural knowledge can be embedded in
declarative knowledge
Planning
Given a set of goals, construct a sequence of actions that achieves
those goals:
⚫ often very large search space
⚫ but most parts of the world are independent of most other parts
⚫ often start with goals and connect them to actions
⚫ no necessary connection between order of planning and order of
execution
⚫ what happens if the world changes as we execute the plan and/or
our actions don’t produce the expected results?
Learning
⚫ If a system is going to act truly appropriately, then it must
be able to change its actions in the light of experience:
⚫how do we generate new facts from old ?
⚫how do we generate new concepts ?
⚫how do we learn to distinguish different
situations in new environments ?
Interacting with the Environment
⚫ In order to enable intelligent behaviour, we will
have to interact with our environment.
⚫ Properly intelligent systems may be expected to:
⚫ accept sensory input
⚫vision, sound, …
⚫ interact with humans
⚫understand language, recognise speech,
generate text, speech and graphics, …
⚫ modify the environment
⚫robotics
AI Applications
⚫ Autonomous Planning
& Scheduling:
⚫ Autonomous rovers.
AI Applications
⚫ Autonomous Planning & Scheduling:
⚫Telescope scheduling
AI Applications
⚫ Autonomous Planning & Scheduling:
⚫Analysis of data:
AI Applications
⚫ Medicine:
⚫Image guided surgery
AI Applications
⚫ Medicine:
⚫Image analysis and enhancement
AI Applications
⚫ Transportation:
⚫ Autonomous
vehicle control:
AI Applications
⚫ Transportation:
⚫Pedestrian detection:
AI Applications
Other application areas:
⚫ Bioinformatics:
⚫ Gene expression data analysis
⚫ Prediction of protein structure
⚫ Text classification, document sorting:
⚫ Web pages, e-mails
⚫ Articles in the news
⚫ Video, image classification
⚫ Music composition, picture drawing
⚫ Natural Language Processing
CRISP-DM

CRoss-Industry Standard Process


for Data Mining
Why Should There be a Standard Process?
The data mining process must be reliable and
repeatable by people with little data mining
background.
Why Should There be a Standard Process?
⚫ Framework for recording experience
⚫ Allows projects to be replicated
⚫ Aid to project planning and management
⚫ “Comfort factor” for new adopters
⚫ Demonstrates maturity of Data Mining
⚫ Reduces dependency on “stars”
Process Standardization
⚫ Initiative launched in late 1996 by three “veterans” of data mining market.
Daimler Chrysler (then Daimler-Benz), SPSS (then ISL) , NCR

⚫ Developed and refined through series of workshops (from 1997-1999)

⚫ Over 300 organization contributed to the process model

⚫ Published CRISP-DM 1.0 (1999)

⚫ Over 200 members of the CRISP-DM SIG worldwide


- DM Vendors - SPSS, NCR, IBM, SAS, SGI, Data Distilleries, Syllogic, etc.
- System Suppliers / consultants - Cap Gemini, ICL Retail, Deloitte & Touche, etc.
- End Users - BT, ABB, Lloyds Bank, AirTouch, Experian, etc.
CRISP-DM
⚫ Non-proprietary
⚫ Application/Industry neutral
⚫ Tool neutral
⚫ Focus on business issues
⚫ As well as technical analysis
⚫ Framework for guidance
⚫ Experience base
⚫ Templates for Analysis
CRISP-DM: Overview

⚫ Data Mining methodology


⚫ Process Model
⚫ For anyone
⚫ Provides a complete blueprint
⚫ Life cycle: 6 phases
CRISP-DM: Phases
⚫ Business Understanding
Project objectives and requirements understanding, Data mining problem definition
⚫ Data Understanding
Initial data collection and familiarization, Data quality problems identification
⚫ Data Preparation
Table, record and attribute selection, Data transformation and cleaning
⚫ Modeling
Modeling techniques selection and application, Parameters calibration
⚫ Evaluation
Business objectives & issues achievement evaluation
⚫ Deployment
Result model deployment, Repeatable data mining process implementation
Phases and Tasks
Data Data
Business
Evaluatio Deploym
Understanding Understa Preparati Modeling
n ent
nding on
Determine Collect Select
Select Evaluate Plan
Business Initial Modeling
Data Results Deployment
Objectives Data Technique

Plan Monitering
Assess Describe Clean Generate Review
&
Situation Data Data Test Design Process
Maintenance

Determine Produce
Explore Construct Build Determine
Data Mining Final
Data Data Model Next Steps
Goals Report

Verify
Produce Integrate Assess Review
Data
Project Plan Data Model Project
Quality

Format
Data
Phase 1. Business Understanding

⚫ Statement of Business Objective


⚫ Statement of Data Mining Objective
⚫ Statement of Success Criteria

Focuses on understanding the project objectives and requirements from a


business perspective, then converting this knowledge into a data mining
problem definition and a preliminary plan designed to achieve the
objectives
Phase 1. Business Understanding
⚫ Determine business objectives
- thoroughly understand, from a business perspective, what the client really
wants to accomplish
- uncover important factors, at the beginning, that can influence the outcome of
the project
- neglecting this step is to expend a great deal of effort producing the right
answers to the wrong questions

⚫ Assess situation
- more detailed fact-finding about all of the resources, constraints, assumptions
and other factors that should be considered
- flesh out the details
Phase 1. Business Understanding
⚫ Determine data mining goals
- a business goal states objectives in business terminology
- a data mining goal states project objectives in technical terms
ex) the business goal: “Increase catalog sales to existing customers.”
a data mining goal: “Predict how many widgets a customer will buy,
given their purchases over the past three years,
demographic information (age, salary, city) and
the price of the item.”
⚫ Produce project plan
- describe the intended plan for achieving the data mining goals and the
business goals
- the plan should specify the anticipated set of steps to be performed during the
rest of the project including an initial selection of tools and techniques
Phase 2. Data Understanding
• Explore the Data
• Verify the Quality
• Find Outliers

Starts with an initial data collection and proceeds with activities in


order to get familiar with the data, to identify data quality problems,
to discover first insights into the data or to detect interesting
subsets to form hypotheses for hidden information.
Phase 2. Data Understanding
⚫ Collect initial data
- acquire within the project the data listed in the project resources
- includes data loading if necessary for data understanding
- possibly leads to initial data preparation steps
- if acquiring multiple data sources, integration is an additional issue, either
here or in the later data preparation phase

⚫ Describe data
- examine the “gross” or “surface” properties of the acquired data
- report on the results
Phase 2. Data Understanding
⚫ Explore data
- tackles the data mining questions, which can be addressed using querying,
visualization and reporting including:
distribution of key attributes, results of simple aggregations
relations between pairs or small numbers of attributes
properties of significant sub-populations, simple statistical analyses
- may address directly the data mining goals
- may contribute to or refine the data description and quality reports
- may feed into the transformation and other data preparation needed
⚫ Verify data quality
- examine the quality of the data, addressing questions such as:
“Is the data complete?”, Are there missing values in the data?”
Phase 3. Data Preparation
⚫Takes usually over 90% of the time
- Collection
- Assessment
- Consolidation and Cleaning
- Data selection
- Transformations

Covers all activities to construct the final dataset from the initial raw data. Data
preparation tasks are likely to be performed multiple times and not in any
prescribed order. Tasks include table, record and attribute selection as well as
transformation and cleaning of data for modeling tools.
Phase 3. Data Preparation
⚫ Select data
- decide on the data to be used for analysis
- criteria include relevance to the data mining goals, quality and technical
constraints such as limits on data volume or data types
- covers selection of attributes as well as selection of records in a table

⚫ Clean data
- raise the data quality to the level required by the selected analysis techniques
- may involve selection of clean subsets of the data, the insertion of suitable
defaults or more ambitious techniques such as the estimation of missing data
by modeling
Phase 3. Data Preparation
⚫ Construct data
- constructive data preparation operations such as the production of derived
attributes, entire new records or transformed values for existing attributes

⚫ Integrate data
- methods whereby information is combined from multiple tables or records to
create new records or values

⚫ Format data
- formatting transformations refer to primarily syntactic modifications made to
the data that do not change its meaning, but might be required by the
modeling tool
Phase 4. Modeling
⚫ Select the modeling technique
(based upon the data mining objective)
⚫ Build model
(Parameter settings)
⚫ Assess model (rank the models)
Various modeling techniques are selected and applied and their parameters are
calibrated to optimal values. Some techniques have specific requirements on the
form of data. Therefore, stepping back to the data preparation phase is often
necessary.
Phase 4. Modeling
⚫ Select modeling technique
- select the actual modeling technique that is to be used
ex) decision tree, neural network
- if multiple techniques are applied, perform this task for each techniques
separately

⚫ Generate test design


- before actually building a model, generate a procedure or mechanism to test
the model’s quality and validity
ex) In classification, it is common to use error rates as quality measures for
data mining models. Therefore, typically separate the dataset into train and
test set, build the model on the train set and estimate its quality on the
separate test set
Phase 4. Modeling
⚫ Build model
- run the modeling tool on the prepared dataset to create one or more models

⚫ Assess model
- interprets the models according to his domain knowledge, the data mining
success criteria and the desired test design
- judges the success of the application of modeling and discovery techniques
more technically
- contacts business analysts and domain experts later in order to discuss the
data mining results in the business context
- only consider models whereas the evaluation phase also takes into account all
other results that were produced in the course of the project
Phase 5. Evaluation
⚫ Evaluation of model
- how well it performed on test data
⚫ Methods and criteria
- depend on model type
⚫ Interpretation of model
- important or not, easy or hard depends on algorithm
Thoroughly evaluate the model and review the steps executed to construct the
model to be certain it properly achieves the business objectives. A key objective is
to determine if there is some important business issue that has not been
sufficiently considered. At the end of this phase, a decision on the use of the data
mining results should be reached
Phase 5. Evaluation
⚫ Evaluate results
- assesses the degree to which the model meets the business objectives
- seeks to determine if there is some business reason why this model is
deficient
- test the model(s) on test applications in the real application if time and
budget constraints permit
- also assesses other data mining results generated
- unveil additional challenges, information or hints for future directions
Phase 5. Evaluation
⚫ Review process
- do a more thorough review of the data mining engagement in order to
determine if there is any important factor or task that has somehow been
overlooked
- review the quality assurance issues
ex) “Did we correctly build the model?”

⚫ Determine next steps


- decides how to proceed at this stage
- decides whether to finish the project and move on to deployment if
appropriate or whether to initiate further iterations or set up new data mining
projects
- include analyses of remaining resources and budget that influences the
decisions
Phase 6. Deployment
⚫ Determine how the results need to be utilized
⚫ Who needs to use them?
⚫ How often do they need to be used

⚫ Deploy Data Mining results by


Scoring a database, utilizing results as business rules,
interactive scoring on-line

The knowledge gained will need to be organized and presented in a way that the
customer can use it. However, depending on the requirements, the deployment
phase can be as simple as generating a report or as complex as implementing a
repeatable data mining process across the enterprise.
Phase 6. Deployment
⚫ Plan deployment
- in order to deploy the data mining result(s) into the business, takes the
evaluation results and concludes a strategy for deployment
- document the procedure for later deployment

⚫ Plan monitoring and maintenance


- important if the data mining results become part of the day-to-day business
and it environment
- helps to avoid unnecessarily long periods of incorrect usage of data mining
results
- needs a detailed on monitoring process
- takes into account the specific type of deployment

You might also like