Machine Learning-Unit-V-Notes

ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
UNIT – V
Introduction Of AI: Definition of AI - AI Problems – Underlying
Assumption – AI technique – Level of the Model - Criteria for Success.
Problems, Problem Spaces, Search: Defining the Problem as State Space
Search - Production Systems - problem Characteristics – Production System
Characteristics
INTRODUCTION OF AI
What is Artificial intelligence (Al)?
• Artificial intelligence (Al) is the study of how to make computers do things which, at
present, people do better.
• Father of Artificial Intelligence, John McCarthy, defines AI as “The science and
engineering of making intelligent machines, especially intelligent computer programs”.
• Artificial Intelligence is a way of making a computer, a computer-controlled robot,
or a software think intelligently, in the similar manner the intelligent humans think.
AI is achieved through the study of, how human brain thinks and how humans learn, decide,
and work while trying to solve a problem, and use the outcomes of this study to develop
intelligent software and systems.
AI can identify patterns in large dataset (big-data) more efficiently than humans, enabling
businesses to gain more insight out of their data. This quick and accurate process helps
businesses to extract invaluable insights from their data.
AI uses a set of very powerful tools, and methodologies for solving business problems.
From a programming perspective, AI includes the study of symbolic programming, problem
solving, and search.
THE Al PROBLEMS
• Earlier stage of AI focused on formal tasks, such as game playing and theorem proving.
Samuel wrote a checkers-playing program which played games with opponents and
used its experience to improve its later performance.
• AI focused on the Chess game also.
• Logic Theorist is a computer program written in 1956 by Allen Newell, Herbert A.
Simon, and Cliff Shaw. It was the first program to perform automated reasoning. It is
"the first artificial intelligence program". It was able to prove several theorems even
from Whitehead and Russell's Principia Mathematica.
• Gelernter's theorem prover explored on geometry, game playing and theorem
proving.
• Al focused on commonsense reasoning - is a human-like ability. It includes reasoning
about physical objects and their relationships to each other.
• Newell. Shaw, and Simon built the General Problem Solver (GPS), a universal
problem solver machine. In contrast to the former Logic Theorist project, the GPS
works with means–ends analysis
• Perception – vision and speech, natural language understanding, and problems
solving in specialized domains such as medical diagnosis and chemical analysis and
are referred as
• Mundane tasks. Perceptual tasks involve analog signals rather than digital signals. The
problem of understanding spoken language is a perceptual problem and is hard to solve.
If the problem is restricted to written language, the problem is referred to as natural
language understanding,
• AI was also used for specialized tasks such as engineering design, scientific discovery,
medical diagnosis, and financial planning.
Figure 1.1 lists some of the tasks that are the targets of work in AL
First, perceptual, linguistic, and commonsense skills are learned. Later expert skills such as
engineering, medicine, or finance are acquired
• Expert Systems Al focused on the domains that require only specialized expertise
without commonsense knowledge, known as expert systems. An expert system is a
computer program that uses artificial intelligence (AI) technologies to simulate the
judgment and behaviour of a human or an organization that has expertise and
experience in a particular field.
• DENDRAL is a chemical-analysis expert system.
• MYCIN is an expert system for treating blood infections.
• CYC is a large experiment in symbolic AI.
The following four questions need to be answered:
1. What are our underlying assumptions about intelligence?
2. What kinds of techniques will be useful for solving Al problems?
3. At what level of detail, if at all, are we trying to model human intelligence?
4. How will we know when we have succeeded in building an intelligent program?
THE UNDERLYING ASSUMPTION

• Intelligence can be simulated: The assumption that intelligence can be replicated in
machines, and that machines can learn and improve their performance over time.
• Data can be processed: The assumption that large amounts of data can be analysed,
processed, and used to train machine learning models that can make predictions or
decisions.
• Algorithms can be developed: The assumption that algorithms can be developed to solve
complex problems and make decisions, based on input data and rules.
• Human-like cognition: The assumption that AI can replicate some aspects of human
cognition, such as perception, reasoning, decision-making, and natural language
processing.
• Autonomous decision-making: The assumption that machines can make decisions without
human intervention, based on programmed rules or learned behaviors.
• Generalizability: The assumption that machine learning models can generalize from
specific examples to new, unseen cases, and that they can be adapted to different contexts.
• Ethical and social implications: The assumption that the development and use of AI have
ethical and social implications that need to be considered, including issues such as privacy,
bias, transparency, and accountability.
These assumptions provide the foundation for the development of AI and guide its application
in various fields.
Newell and Simon define a physical symbol system as follows. A physical symbol system
consists of a set of entities, called symbols, which are physical patterns that can occur as
components of another type of entity called an expression (or symbol structure).
A physical symbol system is a machine that produces through time an evolving collection of
symbol structures. Such a system exists in a world of objects wider than just these symbolic
expressions themselves.
The Physical Symbol System Hypothesis. A physical symbol system has the necessary and
sufficient means for general intelligent action.
The hypothesis cannot be proved or validated. It can only be experimented. Computers can be
programmed to simulate any physical symbol system we like.
Charles Babbage proposed Analytical Engine which was observed by Lady Ada Lovelace.
Sub-symbolic models such as neural networks are beginning to challenge symbolic ones at
such low-level tasks.
Understanding joke is a process of symbol manipulation provided in the book of
Mathematics and Humour.
The physical symbol system hypothesis is important in two ways:
1. It is a significant theory of human intelligence and also to psychologists.
2. It is the basis for the belief that it is possible to build programs that can perform
intelligent tasks now performed by people.
We concentrate on the second part.
WHAT IS AN Al TECHNIQUE?
Artificial intelligence problems span a very broad spectrum. The problems have very little in
common. The problems are hard in nature. Let us find out some techniques for the solution to
these variety of problems.
Intelligence requires knowledge. And knowledge possesses the following properties:
• It is voluminous.
• It is hard to characterize accurately.
• It is constantly changing.
• It differs from data by being organized in a way that corresponds to the ways it will be
used.
Al techniques
AI technique is a method that exploits knowledge that should be represented in such a way
that:
• Knowledge captures generalisation. It means that, it is not necessary to represent
separately each individual situation. Instead, situations that share important properties
are grouped together. If knowledge does not have this property, excessive amounts
of memory and updating will be required. It can be easily modified to correct errors
and to reflect changes in the world.
• It can be understood by people who must provide it. Although for many programs,
the bulk of the data can be acquired automatically, for example by taking readings
from a variety of instruments.
• It can easily be modified to correct errors and to reflect changes in the world and in
our world view.
• It can be used in many situations even if it is not totally accurate or complete.
• It can be used to reduce its own complete volume by narrowing the range of
possibilities.
• Although AI techniques must be designed in keeping with these constraints imposed

by AI problems, there is some degree of independence between problems and problem-
solving techniques. It is possible to solve Al problems without using AI techniques and
it is possible to apply AI techniques to the solution of non-Al problems.
There are three important AI techniques:
1. Search — Provides a way of solving problems for which no direct approach is

available. It also provides a framework into which any direct techniques that are
available can be embedded.
2. Use of knowledge — Provides a way of solving complex problems by exploiting the
structure of the objects that are involved.
3. Abstraction — Provides a way of separating important features and variations from
many unimportant ones that would otherwise overwhelm any process.
Let's look at two very different problems and a series of approaches for solving each of them.
1. Tic-Tac-Toe
2. Question Answering
i)Tic-Tac-Toe Problem Here, presented a series of three programs to play tic-tac-toe. The
programs in this series increase in:
The clarity of their knowledge
Their complexity
Their use of generalizations
The extensibility of their approach. Thus, they move toward being
representations of what we call AI techniques.
But it has several disadvantages:

• It takes a lot of space to store the table that specifies the correct move to make from each
board position
• Someone will have to do a lot of work specifying all the entries in the move table
• It is very unlikely that all the required move table entries can be determined and entered
without any errors.
• If we want to extend the game. say to three dimensions. we would have to start from scratch,
and in fact this technique would no longer work at all, since 377 board positions would
have to be stored. Thus, overwhelming present computer memories.
So, this technique is not a good AI technique.
Program 3 is an example of the use of an Al technique. For very small problems, it is less
efficient than a variety of more direct methods. However, it can be used in situations where
those methods would fail.
Comments
This program will require much more time because it must search a tree representing all
possible move sequences before making each move.
This program can be extended to handle games more complicated than tic-tac-toe.
It can also be augmented by a variety of specific kinds of knowledge about games and how to
play them.
ii)Question Answering Problem

Here, the programs that read in English text and then answer questions, which are also stated
in English, about that text. It is more difficult to state the problem formally and precisely and
the correct solution to it.
Example: A simple sentence given as input
Russia massed troops on the Czech border.
Then either of the following question-answering dialogues might occur with the POLITICS
program.
Dialogue 1
Q: Why did Russia do this?
A: Because Russia thought that it could take political control of Czechoslovakia by
sending troops.
Q: What should the United States do?
A: The United States should intervene militarily.
Dialogue 2
Q: Why did Russia do this?
A: Because Russia wanted to increase its political influence over Czechoslovakia.
Q: What should the United States do?
A: The United States should denounce the Russian action in the United Nations.
In the POLITICS program, answers were constructed by considering both the input text and a
separate model of the beliefs and actions of various political entities, including Russia.
When the model is changed, the system’s answers also change. To produce a correct answer to
a question may be very hard.
THE LEVEL OF THE MODEL

Artificial Intelligence Model
What is our goal in trying to produce programs that do the intelligent things that people do?’
Are we trying to produce programs that do the tasks the same way that people do? OR Are
we trying to produce programs that simply do the tasks the easiest way that is possible? Or,
are we attempting to produce programs that simply do the tasks in whatever way appears
easiest? There have been Al projects motivated by each of these goals.
Efforts to build programs that perform tasks the way people do can be divided into two classes.
1. Programs in the first class attempt to solve problems that do not really fit our definition
of an Al task. They are problems that a computer could easily solve, Example of this
class of program is the EPAM - Elementary Perceiver and Memorizer which
memorized associated pairs of nonsense syllables. Memorizing pairs of nonsense
syllables is easy for a computer.
2. The second class of programs that attempt to model human performance. Programs thar
are useful tools for psychologists who want to test theories of human performance.
There are several reasons one might want to model human performance at these sorts
of tasks:
• To test psychological theories of human performance: An example of a

program that was written for this reason is PARRY [Colby, 1975], which
exploited a model of human paranoid behavior to simulate the conversational
behavior of a paranoid (Suspicious) person. The model was good enough that
when several psychologists were given the opportunity to converse with the
program via a terminal, they diagnosed its behavior as paranoid.
• To enable computers to understand human reasoning: For example, for a
computer to be able to read a newspaper story and then answer a question, such
as “why did the terrorists kill the hostages?” Its program must be able to
simulate the reasoning processes of people.
• To enable people to understand computer reasoning: In many
circumstances, people are reluctant to rely on the output of a computer unless
they can understand how the machine arrived at its result. If the computer’s
reasoning process is similar to that of people, then producing an acceptable
explanation is much easier.
• To exploit what knowledge, we can gather from people: Since people are the
best- known performers of most of the tasks with which we are dealing, it makes
a lot of sense to look to them for clues as to how to produce.
Disciplines that contribute to AI

The following are the disciplines that contributed ideas, new points, and techniques to AI:
• Philosophy: (428BC- Present)
1. Can formal rules be used to draw valid conclusions?
2. How does the mental mind arise from a physical brain?
3. Where does knowledge come from?
4. How does knowledge lead to action?
• Mathematics
5. What are the formal rules to draw valid conclusions?
6. What can be computed?
7. How do we reason with uncertain information?
• Economics
8. How should we make decisions so as to maximize pay off?
9. How should we do this when others may not go along?
10. How should we do this when the pay off may be far in the feature?
• Neuroscience
How do brains process information?
• Psychology
How do humans and animals think & act?
• Computer Engineer
How can we an efficient as computer?
• Computer Theory and Cybernetics
How can computer operate under their own control?
• Linguistic How does language relate to thought?
The History / Early works of AI:

1. The first work in AI was done by McCalloch and Pitts (1943), who proposed artificial
neurons.
They worked on three sources:
a Knowledge of the basic Physiology.
b Functions of neurons in the brain.
c A formal analysis of propositional logic and turing’s theory of
computations.
2. Later in 1949, Donald Hebb demonstrated a simple updating rule for modifying the
connection strengths between neurons, called Hebbbian Learning.
3. Alan Turing (1950), was the first to articulate the complete vision of AI in his article (1950)
– “Computing Machinery and Intelligence”, he introduced the Turing Test, Machine
learning, genetic algorithms and reinforcement learning.
4. John McCarthy of Dart Month College called the father of AI, introduced automata theory,
neural networks, and the study of intelligence (1956). Also he designed the language LISP
in 1958.
5. Newell and Simon (1976) designed a program called General Problem Solver (GPS) that
imitated human problem solving protocols. GPS was the first program to embody the
“Thinking humanly” approach.
6. Rosenblatt (1962) developed perceptrons by enhancing the Hebban learning algorithm.
7. DENDRAL and MYCIN were the two expert systems – one for Molecule structure analysis
and the other for diagnose the blood infections, developed during 1969, at MIT.
8. Recent years have seen a revolution in both the content and the methodology of work in AI.
9. Complete Agent architecture proposed during 1990 aims to understand the workings of
agents embedded real environments, with continuous sensory inputs. One of the most
important environments for intelligent agents in the Internet.
10. AI Systems have become so common in web-based applications including search engines,
recommended systems, and website construction systems.
What can AI do today?

These are just examples of AI systems that exist today.
Autonomous Planning and scheduling: NASA’S Remote Agent program became the first
on board autonomous planning program to control the scheduling of operations for a
spacecraft (Jonsson et.al., 2000), such as detection, diagnosis and recovery from problems as
that occurred.
Game Planning: IBM’S Deep Blue, Became the first computer to defect the world champion
in chess match when it beated Garry Karpasov (1997), the value of IBM’S stock increased
by $18 billion.
Autonomous Control: The ALVINN computer vision system was trained to steer a car to
keep it following a lane (driving autonomously 98% of the time from Pittsburgh to San Diego,
2850 miles)
Diagnosis: Medical diagnosis programs based on probabilistic analysis have been able to
perform at the level of an expert physician is several areas of medicine.
Logistics Planning: During the Persian Gulf crisis of 1991, US forces deployed a Dynamic
Analysis and Re-planning Tool, (DART) in1994, to do automated logistics planning and
scheduling for transportation. This involved up to 50,000 vehicles, cargo, and people at a
time, and had to account for starting points, destinations, routes and conflict resolution among
all parameters.
Robotics: Many surgeries now use robot assistants in microsurgery. Ex: HIPNAV (1996) is
a system that uses computer vision techniques to create a three- dimensional model of a
patient’s internal anatomy and the uses robotic control to guide the insertion of a hip
replacement prosthesis.
Language Understanding and Problem Solving: PROVERB (1999) is a program that
solves crossword puzzles setter than most humans.
CRITERIA FOR SUCCESS

• One of the most important questions to answer in any scientific or engineering
research project is “How will we know if we have succeeded?” Artificial
intelligence is no exception. How will we know if we have constructed a machine that
is intelligent? That question is at least as hard as the unanswerable question “What is
intelligence?” But can we do anything to measure our progress? - Yes
• In 1950, Alan Turing proposed the following method for determining whether a
machine can think. His method has since become known as the Turing Test. To
conduct this test, we need two people and the machine to be evaluated. One person
plays the role of the interrogator, who is in a separate room from the computer and
the other person. The interrogator can ask questions of either the person or the
computer by typing questions and receiving typed responses. However, the
interrogator knows them only as A and B and aims to determine which is the person
and which is the machine. The goal of the machine is to fool the interrogator into
believing that it is the person. If the machine succeeds at this, then we will conclude
that the machine can think. The machine is allowed to do whatever it can to fool the
interrogator. So, for example, if asked the question “How much is 12,324 times
73,9817” it could wait several minutes and then respond with the wrong answer
[Turing, 1963]. The more serious issue, though, is the amount of knowledge that a
machine would need to pass the Turing test.
• For example, DENDRAL is a program that analyses organic compounds to determine
their structure. It is hard to get a precise measure of DENDRAL's level of achievement
compared to human chemists, but it has produced analyses that have been published as
original research results. Thus, it is certainly performing competently.
• Programs called R1 require minutes to perform tasks that previously required hours of
a skilled engineer’s time. Such programs are usually evaluated by looking at the bottom
line— whether they save (or make) money.
• If our goal in writing a program is to simulate human performance at a task, then the
measure of success is the extent to which the program’s behavior corresponds to
that performance, as measured by various kinds of experiments and protocol analyses.
• Various techniques developed by psychologists for comparing individuals and for
testing models can be used to do this analysis.
• We are forced to conclude that the question of whether a machine has intelligence or
can think is too vague to answer precisely. But it is often possible to construct a
computer program that meets some performance standard for a particular task. That
does not mean that the program does the task in the best possible way. It means only
that we understand at least one way of doing at least part of a task.
LANGUAGE for AI
LISP was the most commonly used language for Al programming. Among the various LISPs,
Common Lisp is now accepted as a standard. Another language that is often used for AI
programming is PROLOG. Al systems are being written in general purpose programming
languages such as C. One reason for this is that AI programs are ceasing to be standalone
systems; instead, they are becoming components of larger systems, which may include
conventional programs and databases of various forms. Robots form the ultimate test-bed for
AL. While Al researchers have brought forth a reasonably large repository of techniques and
programs that are based on the symbol system, implementing them on robots have posed
several problems.
CHAPTER II
PROBLEMS, PROBLEM SPACES, AND SEARCH
Problem
To build an AI system to solve a particular problem, we need to do four things:
1. Define the problem precisely. This definition must include precise specifications of the
initial situation (s) and the final situations that are acceptable solutions to the problem.
2. Analyze the problem. A few very important features can have a huge impact on the
appropriateness of various possible techniques for solving the problem.
3. Isolate and represent the task knowledge that is necessary to solve the problem.
4. Choose the best problem-solving technique(s) and apply it (them) to the particular
problem.
Formal Description of the problem
DEFINING THE PROBLEM AS A STATE SPACE SEARCH
Problem solving is a method of deriving solution steps beginning from initial description of the
problem to the desired solution. In AI, the problems are frequently modelled as a state space
problem where the state space is a set of all possible states from start state to goal states.
In order to provide a formal description of a problem, we must do the following:
1. Define a state space that contains all the possible configurations of the relevant objects.
2. Specify one or more states within that space that describe the situation from which the
problem-solving process starts. These states are called the initial states.
3. Specify one or more states that would be acceptable as solutions to the problem. These
states are called goal states.
4. Specify a set of rules that describe the actions/operators available.
5. Doing this will require the following issues: the unstated assumptions, generalization
of the rules and how much of the work can be precomputed and represented in the rules
6. The problem can then be solved by using the rules, with a control strategy, to move
through the problem space until a path from an initial state to a goal state is found.
7. Thus, the process of search is fundamental to the problem-solving process.
Playing Chess:
We need to specify the starting position of the chess board, the rules that define the legal
moves, and the board positions that represent a win for one side or the other.
• The starting position can be described as an 8 x 8 array where each position
contains a symbol standing for the appropriate piece in the official chess opening
position.
• We can define as our goal any board position in which the opponent does not have
a legal move and his or her king is under attack.
• The legal moves provide the way of getting from the initial state to a goal state.
• They can be described easily as a set of rules consisting of two parts: a left side
that serves as a pattern to be matched against the current board position and a right
side that describes the change to be made to the board position to reflect the move.
• There are several ways in which these rules can be written.
• No person could ever supply a complete set of such rules.
• No program could easily handle all those rules.
So, the problem of playing chess is defined as a problem of moving around in a state space,
where each state corresponds to a legal position of the board. This state space representation
will be suitable for chess.
Water Jug Problem

Problem definition:
There are two jugs, a 4-gallon one and a 3-gallon one. There is no measurement marked
on it. There is a pump that can be used to fill the jugs with water. How can you get exactly 2
gallons of water into the 4-gallon jug?
State Space representation of Water Jug problem
The state space for this problem can be described as the set of ordered pairs of integers (x, y),
such that
x = 0, 1, 2, 3, or 4 and y=0, 1, 2, or 3
x represents the number of gallons of water in the 4-gallon jug, and
y represents the number of gallons of water in the 3-gallon jug.
The start state is (0, 0). The goal state is (2, n) for any value of n because the problem does
not specify how many gallons need to be in the 3-gallon jug
Assumptions:
It is necessary to make explicit some assumptions that are not mentioned in the problem
statement.
1. We can fill a jug from the pump & we can pour water out of a jug onto the ground
2. We can pour water from one jug to another
3. There are no other measuring devices available.
Production Rules:
The word “operator” refers to some representation of an action. The operators to be used to
solve the problem are described in Fig. 2.3. They are represented as rules whose left sides are
matched against the current state and whose right sides describe the new state that results from
applying the rule.
Control structure
we need a control structure that loops through a simple cycle in which some rule whose left
side matches the current state is chosen. Then the appropriate change is made as described in
the right side. The resulting state is checked to see if it corresponds to a goal state. If it is not a
goal state, the cycle continues.
Clearly the speed with which the problem gets solved depends on the mechanism that is used
to select the next operation to be performed.
For the water jug problem, as with many others, there are several sequences of operators that
solve the problem. One such sequence is shown in Fig. 2.4.
PRODUCTION SYSTEMS or (Rule based systems) or (Inferential systems)
The search forms the core of many intelligent processes, it is useful to structure AI programs
in a way that facilities describing the search process. So, production systems provide the
structure for AI programs that makes searching process easy. Production systems provide a
structure to facilitate describing and performing the search process. A definition of a production
system is given below:
A production system consists of:
• A set of rules, each consisting of a left side (a pattern) that determines the applicability
of the rule and a right side that describes the operation to be performed if the rule is
applied.
• One or more knowledge/databases that contain whatever information is appropriate
for the particular task. Some parts of the database may be permanent, while other parts
of it may pertain only to the solution of the current problem. The information in these
databases may be structured in any appropriate way.
• A control strategy that specifies the order in which the rules will be compared to the
database and a way of resolving the conflicts that arise when several rules match at
once.
• A tule applier.
Some of them are:
• Basic production system languages, such as OPS5and ACT*
• More complex hybrid systems called expert system shells, which provide complete
environment for the construction of knowledge- based expert systems.
• General problem-solving architectures like SOAR, a system based on a specific set of
cognitively motivated hypotheses about the nature of problem-solving.
The process of solving the problem can be modelled as a production system.
Advantages of production system
In general, the process of solving a problem can usefully be modeled as a production system.
i.e, any computable procedure can be modelled as production system.
It models strong data-driven nature of intelligent action
New rules can easily be added to take care of new situations without disturbing the
rest of the system
Handling of changes in knowledge base is dynamic
Helps in making inference mechanism easy.
PRODUCTION SYSTEM CHARACTERISTICS
There are 4 categories of Production Systems
Monotonic Non-Monotonic
Partially Ex: theorem proving Ex: Blocks world, 8-
commutative puzzle
Non-
Ex: chemical Ex: Bridge/cards
partially
Synthesis problem
commutative
• A Monotonic Production System: is one in which the application of a rule never prevents
the later application of another rule that could also have been applied at the time the first rule
was selected.
• A Non-Monotonic Production System: is a production system is one in which the above
statement is not true.
• A partially commutative production system: is a production system with the property that
if the application of a particulars sequence of rules transforms state ‘X’ into state ‘Y’ then any
permutation of those rules that is allowable also transforms state ‘X’ into state ‘Y’. Partially
commutative monotonic production systems are useful for solving ignorable problems and
these can be easily implemented without the ability to backtrack to previous states when it is
discovered that an incorrect path has been followed.
• Non- Partially commutative production system: are useful for many problems in which
irreversible changes occur.
• Non-Monotonic Partially commutative systems: are useful for problems in which changes
occur but can be reversed in which order of operations is not critical.
• Commutative production system is one which is both monotonic and partially
commutative.
Control Strategies
Control strategy describes the order of application of the rules to the current state. A control
strategy is needed to decide which rule to apply next during the process of searching for a
solution to a problem when more than one rule matches the current state. There are many
control Strategies exist uniformed search strategies like DFS, BFS and informed / heuristic
Strategies like Hill climbing, Backtracking, Branch and Bound, Best first search etc.,.
• They should cause motion, so that it lead to a solution

• They should be systematic, so that it lead to a solution Ex: BFS and DFS strategies
Systematic Control Strategy
Breadth First Search
Construct a tree with initial state as root. Generate all possible successors to this root node.
Continue this process until some rule produces a goal state. The tree for two levels is as
follows:
This process, called breadth-first search, can be described precisely as follows.

Algorithm: Breadth-First Search
1. Create a variable called NODE-LIST and set it to the initial state.
2. Until a goal state is found or NODE-LIST is empty:
(a) Remove the first element from NODE-LIST and call it E. If NODE-LIST was
empty. quit.
(b) For each way that each rule can match the state described in E do:
(i) Apply the rule to generate a new state,
(ii) If the new state is a goal state. quit and return this state.
(iii) Otherwise, add the new state to the end of NODE-LIST.
Search Tree for Water Jug Problem:
Depth First Search
we could pursue a single branch of the tree until it yields a solution or until a decision to
terminate the path is made. It makes sense to terminate a path if it reaches a dead-end,
produces a previous state, or becomes longer than some prespecified “futility” limit. In such a
case, backtracking occurs. The most recently created state from which alternative moves are
available will be revisited and a new state will be created. This form of backtracking is called
chronological backtracking because the order in which steps are undone depends only on the
temporal sequence in which the steps were originally made. Specifically, the most recent step
is always the first to be undone. This form of backtracking is what is usually meant by the
simple term backtracking. But there are other ways of retracting steps of a computation. We
discuss one important such way, dependency-directed backtracking. in Chapter 7. Until
then, though, when we use the term backtracking, it means chronological backtracking.
The search procedure we have just described is also called depth-first search. The following
algorithm
describes this precisely.
Algorithm: Depth-First Search

1. If the initial state is a goal state, quit and return success.
2. Otherwise, do the following until success or failure is signalled:
(a) Generate a successor, E, of the initial state. If there are no more successors, signal
failure.
(b) Call Depth-First Search with E as the initial state.
(c) If success is returned, signal success. Otherwise continue in this loop.
Figure 2.7 shows a snapshot of a depth-first search for the water jug problem. A comparison
of these two simple methods produce the following observations:
Advantages of Depth-First Search
Depth-first search requires less memory since only the nodes on the current path are stored.
This contrasts with breadth-first search, where all of the tree that has so far been generated
must be stored.
By chance (or if care is taken in ordering the alternative successor states), depth-first search
may find a solution without examining much of the search space at all. This contrasts with
breadth-first search in which all parts of the tree must be examined to level # before any nodes
on level n+ 1 can be examined.
This is particularly significant if many acceptable solutions exist. Depth-first search can stop
when one of them is found.
Advantages of Breadth-First Search
s Breadth-first search will not get trapped exploring a blind alley. This contrasts with depth-
{irst searching, which may follow a single, unfruitful path for a very long time, perhaps forever,
before the path actually terminates in a state that has no successors, This is a particular problem
in depth-first search if there are loops (i.e., a state has a successor that is also one of is ancestors)
unless special care is expended to test for such a situation. The example in Fig. 2.7, if it
continues always choosing the first (in numerical sequence) rule that applies, will have exactly
this problem.
o If there is a solution, then breadth-first search is guaranteed to find it. Furthermore, if there
are multiple solutions, then a minimal solution (i.e., one that requires the minimum number of
steps) will be found.
This is guaranteed by the fact that longer paths are never explored until 4ll shorter ones have
already been examined. This contrasts with depth-first search, which may find 4 long path to a
solution in one part of the tree, when a shorter path exists in some other. unexplored part of the
tree.
Clearly what we would like is a way to combine the advantages of both of these methods. In
Section 3.3 we
will talk about one way of doing this when we have some additional information. Later, in
Section 12.5, we will describe an uninformed way of doing so.
For the water jug problem, most control strategies that cause motion and are systematic will
lead to an answer. The problem is sitnple. But this 1s not always the case. In order to solve
some problems during our lifetime, we must also demand a control structure that is efficient.
The Travelling Salesman Problem
A Salesman has a list of cities, each of which he must visit exactly once. There are direct
roads between each pair of cities on the list. Find the route the Salesman should follow for
the shortest possible round trip that both starts and finishes at any one of the cities.
A simple, motion-causing and systematic control structure could, in principle, solve this
problem. It would simply explore all possible paths in the tree and return the one with the
shortest length. This approach will work in practice for very short lists of cities. But it breaks
down quickly as the number of cities grows. If there are N cities, then the number of
different paths among them is 1.2… (N-1), or (N-1)! . The time to examine a single path is
proportional to N. So the total time required to perform this search is proportional to N! .
Assuming there are only 10 cities, 10! is 3,628,800, which is a very large number. The
salesman could easily have 25 cities to visit. To solve this problem would take more time
than he would be willing to spend. This phenomenon is called combinatorial explosion. To
combat it, we need a new control strategy.
Heuristic Search
✓
A Heuristic function is a function that maps from problem state description to
measures of desirability, usually represented as numbers.
✓
Improves the efficiency of a search process and always finds a very goal solution for
hard problems.
✓
In general, it points in interesting directions.
✓
Like tour guides, helps to guide a search process. Ex: nearest Neighbor Heuristic
✓
Well designed Heuristic functions can play an important part in efficiently guiding a
search process toward a solution.
✓
Sometimes very simple Heuristic functions can provide a fairly good estimate of
whether a path is good or not. In other situations, more complex Heuristic functions should
be employed.
Note:
1. Production systems represent both knowledge as well as action.

2. Production systems provide a language in which the representation of expert
knowledge is very natural.
3. Production systems provide a Heuristic model for human behavior.
4. A good solution to a problem requires an efficient control strategy.
AI PROBLEM CHARACTERISTICS
In order to choose the most appropriate method for a particular problem, it is necessary to
analyze the problem along several key dimensions / Characteristics:
1 Is the problem Decomposable into a set of independent smaller and/or easier sub
problem? (D)
2 Can solution steps be Ignored or at least undone if they prove unwise? (I)
3 Is the problem’s universe Predictable? [predictability (P)]
4 Is a good solution to the problem obvious without comparison to all other possible
solutions? [Comparability ( C )]
5 Is the desired solution a state of the world OR a path to a state? (R)
6 Is a large amount of knowledge absolutely required to solve the problem, OR is
knowledge important only to constrain the search? [knowledge based consistency (C
)]
7 Can a computer that is simply given the problem return the solution, or will the solution
of the problem require interaction between the computer and a person? [ Interactions
(I) ]
Note: Above characteristics may be abbreviated as D I P C R C I for easy remembrance.
1. Is the problem decomposable?

Suppose the following expression is given, then it could be solved as follows. It can be
broken down or decomposed into three smaller problems, each of which can be solved
using small collection of specific rules.
2. Can solution steps be Ignored or Undone?
These three problems: theorem proving, 8-puzzle and chess illustrate the differences
between three important classes of problems:
Example:
• Ignorable (e.g.., theorem proving), in which solution steps can be ignored
• Recoverable (e.g.., 8-puzzle), in which solution steps can be undone
• Irrecoverable (e.g.., chess), in which solution steps cannot be undone
These three definitions make reference to the steps of the solution to a problem and thus
may appear to characterize particular production system for solving a problem. This is
true for each of the problem used as examples above. When this is the case, it makes
sense to view the recoverability of a problem as equivalent to recoverability of a natural
formulation of it.
3. Is the problem’s Universe Predictable?
The planning process can only be done effectively for certain-outcome. A few examples
of such problems are:
Playing bridge: we can do fairly well since we have available accurate estimates of the
probabilities of each of the possible outcomes.
Controlling a robot arm: The outcome is uncertain for a variety of reasons. Someone
might move something into the path of the arm. The gears of the arm might stick. A slight
error could cause the arm to knock over a whole stack of things.
Helping a lawyer decide how to defend his client against a murder charge: Here we
probably cannot even list all the possible outcomes, much less assess their probabilities.
Examples:
In 8-puzzle problem  certain outcome
In bridge problem  uncertain outcome (cards)
In controlling a Robot arm  uncertain outcome
In helping a lawyer decides how to depend his client against a murder charge. (cannot
list all possible outcomes)
4. Is a good solution absolute OR Relative?

Example: In the problem of drawing conclusions from a given set of premises, we may
need to consider the facts in some sequence and try to relate them to find new facts
which can intern be used as premises in conclusion drawing process. So, we will have a
relative path from facts / premises to the conclusion.
Example: Marcus was a man.

Marcus was a Pompeian.
Marcus was born in 40 AD.
All men are mortal.
All Pompeian’s died when the volcano erupted in 79 AD. No
mortal lives longer than 150 years it is now 1991 AD.
Suppose, we ask the question, “Is Marcus alive?”
No. Here, Solution will be a relative path.
5. Is the solution a state or a path?

Example: In man, Tiger, Cow, & Grass problem, solution is a
state. In Travelling salesman problem solution is a path.
6. What is the Role of the Knowledge? Is the Knowledge base consistent?

Example: In playing chess, role of knowledge is little-just the rules for legal moves &
simple control mechanisms whereas, in the problem of scanning daily news papers and
deciding which are supporting the democrats and which are supporting republicans in
some election, the role of knowledge is extensive. Knowledge base used for solving a
problem should be consistent.
7. Does the task require interactions with a person?

Can we compute simply the given problem and return the solution? OR will the solution
of a problem requires user interaction?
8. Problem Classification
There are several broad classes into which the problems may fall.
Example: In Diagnostic tasks, i.e. both in medical and fault diagnosis, we need to
examine the input and decide which of a set of known classes, the input is an instance
of?
ISSUES IN THE DESIGN OF SEARCH PROGRAM
Every search process can be viewed as a traversal of a tree structure in which each node
represents a problem state and each arc represents a relationship between the states represented
by the nodes it connects. For example, Figure below shows part of a search tree for a water
jug problem. The search process must find a path or paths through the tree that connect an
initial state with one or more final states. The tree that must be searched could in principle,
be constructed in it’s entirely from the rules that define allowable moves in the problem space.
Some important issues that arise in all of them:
The direction in which to conduct the search (forward versus backward reasoning).
How to select applicable rules (matching). Production systems typically spend most of their
time looking for rules to apply, so it is critical to have efficient procedures for matching rules
against states.
How to represent each node of the search process. For problems like chess, a node can be
fully represented by a simple array. In more complex problem solving, however, it is
inefficient and/or impossible to represent all of the facts in the world and to determine all of
the side effects an action may have.
A Search Tree for the Water Jug Problem
For example, in the tree shown above, the node (4, 3), can be generated either by first filling
the 4-gallon jug and then the 3-gallon one. This example also illustrates another problem that
often arises when the search process operates as a tree walk. On the third level, the node (0,
0) appears. But this is the same as the top node of the tree, which has already been expanded.
Those two paths have not gotten us anywhere, so we would like to eliminate them and
continue only along the other branches. The waste of effort that arises when the same node is
generated more than once can be avoided at the price of additional bookkeeping. Instead of
traversing a search tree, we traverse a directed graph. This graph differs from a tree in that
several paths may come together at a node. The graph corresponding to the tree is shown in
figure below. Any tree search procedure that keeps track of all the nodes that have been
generated so far can be converted to a graph search procedure by modifying the action
performed each time a node is generated. Graph search procedures are useful for dealing
with partially commutative production systems.
A Search Graph for the Water Jug Problem
ADDITIONAL AI PROBLEMS
• The Missionaries and Cannibals Problem

• The Tower of Hanoi
• The Monkey and Bananas Problem
• Cryptarithmetic Problems

Machine Learning-Unit-V-Notes

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Machine Learning-Unit-V-Notes

Uploaded by

Copyright:

Available Formats

ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

THE UNDERLYING ASSUMPTION

• Although AI techniques must be designed in keeping with these constraints imposed

There are three important AI techniques:

1. Search — Provides a way of solving problems for which no direct approach is

But it has several disadvantages:

ii)Question Answering Problem

Russia massed troops on the Czech border.

THE LEVEL OF THE MODEL

• To test psychological theories of human performance: An example of a

Disciplines that contribute to AI

The History / Early works of AI:

What can AI do today?

CRITERIA FOR SUCCESS

Water Jug Problem

PRODUCTION SYSTEM CHARACTERISTICS

There are 4 categories of Production Systems

• They should cause motion, so that it lead to a solution

This process, called breadth-first search, can be described precisely as follows.

Algorithm: Depth-First Search

The Travelling Salesman Problem

1. Production systems represent both knowledge as well as action.

Note: Above characteristics may be abbreviated as D I P C R C I for easy remembrance.

1. Is the problem decomposable?

• Recoverable (e.g.., 8-puzzle), in which solution steps can be undone

• Irrecoverable (e.g.., chess), in which solution steps cannot be undone

3. Is the problem’s Universe Predictable?

4. Is a good solution absolute OR Relative?

Example: Marcus was a man.

5. Is the solution a state or a path?

6. What is the Role of the Knowledge? Is the Knowledge base consistent?

7. Does the task require interactions with a person?

ISSUES IN THE DESIGN OF SEARCH PROGRAM

Some important issues that arise in all of them:

A Search Graph for the Water Jug Problem

• The Missionaries and Cannibals Problem

You might also like