The Intellectual History of AI: CS158-2: Introduction To Artificial Intelligence 4 Term AY 2019-2020

The Intellectual
History of AI
CS158-2: Introduction to Artificial
Intelligence
4th Term AY 2019-2020
The gestation of artificial intelligence (1943-1955)
The birth of artificial intelligence (1956)
Outline Early enthusiasm, great expectations (1952-1969)
A dose of reality (1966-1973)
Knowledge-based systems: The key to power?

(1969-1979)
AI becomes an industry (1980-present)
The return of neural networks (1986-present)
Outline AI adopts the scientific method (1987-present)
The emergence of intelligent agents (1995-present)
The availability of large data sets (2001-present)

The gestation of artificial
intelligence (1943-1955)
First Work Recognized
as Artificial Intelligence
In 1943, Warren McCulloch &

Walter Pitts proposed a model of
artificial neurons in which each
neuron is characterized as being
“on” or “off”
Hebbian Learning
• In 1949, Donald Hebb
demonstrated a simple updating
rule for modifying the connection
strengths between neurons.
• His rule, now called Hebbian
learning, remains an influential
model to this day.
SNARC (1950)
• Stochastic Neural Analog
Reinforcement Calculator
• First neural network computer built
by Harvard undergraduate students
Marvin Minsky and Dean
Edmonds.
• Used 3000 vacuum tubes and a
surplus automatic pilot mechanism
from a B-24 bomber to simulate a
network of 40 neurons.
Turing Test (1950)
In his article “Computing Machinery
and Intelligence”, Alan Turing
introduced the Turing Test, machine
learning, genetic algorithms, and
reinforcement learning.
The birth of
artificial
intelligence (1956)
“We propose that a 2 month, 10 man study of artificial
intelligence be carried out during the summer of 1956 at
Dartmouth College in Hanover, New Hampshire.
…The study is to proceed on the basis of the conjecture that

every aspect of learning or any other feature of intelligence
Dartmouth can in principle be so precisely described that a machine can
be made to simulate it.
workshop
proposal …An attempt will be made to find how to make machines
use language, form abstractions and concepts, solve kinds of
problems now reserved for humans, and improve themselves.
…We think that a significant advance can be made in one or

more of these problems if a carefully selected group of
scientists work on it together for a summer.”
Early enthusiasm, great
expectations (1952-1969)
Written by Allen Newell, Herbert Simon, & John
Clifford Shaw
First program deliberately engineered to perform

automated reasoning
Logic Regarded as the “first artificial intelligence program”
Theorist Coded using an early version of IPL (Information

(1956) Processing Language) programming language
Established the field of heuristic programming
Proved 38 of the first 52 theorems in chapter 2 of

Prinicipia Mathematica
General Problem Solver (1959)
Written by Allen Newell, Herbert Simon, & John Clifford Shaw
Designed to imitate human problem-solving protocols
Probably the first program to embody the “thinking humanly”

approach
“A physical symbol system has the
Physical necessary and sufficient means for
general intelligent action”
Symbol System -Allen Newell & Herbert Simon
(1976)
First AI Programs at IBM
Herbert Gelernter (1959): Arthur Samuel (1952): checkers

Geometry Theorem Prover program
Was able to prove theorems that many Was able to learn to play at a strong
students of mathematics would find quite amateur level
tricky
John McCarthy’s Crucial Contributions in 1958
Defined high-level language Lisp, which was to become the dominant AI

programming language for the next 30 years.
Invented time-sharing, which is the sharing of a computing resource

among many users at the same time.
Published a paper entitled Programs with Common Sense, in which he

described the Advice Taker, a hypothetical program that can be seen as
the first AI system.
James Slagle’s SAINT program (1963) was
able to solve closed-form calculus
integration problems typical of first-year
college courses.
Microworld Tom Evan’s ANALOGY program (1968)

solved geometric analogy problems that
s appear in IQ tests.
Daniel Bobrow’s STUDENT program

(1967) solved algebra story problems.
A scene from the blocks world. SHRDLU
Blocks world (Winograd, 1972) has just completed the command
“Find a block which is taller than the one you are
holding and put it in the box”
A dose of reality
(1966-1973)
Difficulties that have arisen
1 2 3
Most early programs Intractability of many of Some fundamental
knew nothing of their the problems that AI limitations on the basic
subject matter; they was attempting to solve. structures being used to
succeeded by means of generate intelligent
simple syntactic behavior.
manipulations.
Knowledge-based systems: The
key to power? (1969-1979)
Expert System: The DENDRAL Program (1969)
Developed in Stanford by Ed Feigenbaum, Bruce Buchanan, and Joshua

Lederberg
Aimed to solve the problem of inferring molecular structure from the

information provided by a mass spectrometer
Regarded as the first successful knowledge-intensive system: its expertise

derived from large numbers of special-purpose rules
Feigenbaum, Buchanan, and Dr. Edward
Shortliffe developed MYCIN to
diagnose blood infections.
MYCIN
Expert Two major differences from DENDRAL:
System
(1972) 1.Unlike DENDRAL rules, no general theoretical
model existed from which the MYCIN rules could
be deduced
2.The rules had to reflect the uncertainty associated
with medical knowledge
SHRDLU System (1972)
Developed by Terry Winograd at MIT as an early natural language

understanding computer program
Its dependence on syntactic analysis caused some of the same problems

as occurred in the early machine translation work
It was able to overcome ambiguity and understand pronoun references

AI becomes an industry
(1980-present)
Developed by John P. McDermott of CMU in 1978 to
assist in the ordering of Digital Equipment Corporations’
VAX computer systems by automatically selecting the
computer system components based on the customer’s
requirements.
R1 Program/
XCON (for Regarded as the first successful commercial expert
eXpert
system.
CONfigurer)
By 1986, it was saving the company an estimated $40
million a year.
In 1981, Japan announced the “Fifth Generation”
project, a 10 year plan to build intelligent computers
running Prolog.
AI Industry In response, USA formed the Microelectronics and

Computer Technology Corporation (MCC) as a
Boom research consortium designed to assure national

competitiveness.
In Great Britain, the Alvey report reinstated the

funding that was cut by the Lighthill report.
The return of neural
networks (1986-
present)
AI adopts the scientific
method (1987-present)
Two aspects of
Hidden Markov Models HMMs are relevant:
(HMMs) in Speech
Recognition
Generated by a
Based on rigorous process of training on
mathematical theory a large corpus of real
speech data
Other fields
Machine translation Neural networks

Bayesian Network
In 1988, Judea Pearl’s Probabilistic Reasoning in Intelligent Systems led to a
new acceptance of probability and decision theory in AI.
The Bayesian network formalism was invented to allow efficient

representation of, and rigorous reasoning with, uncertain knowledge.
This approach allows for learning from experience, and combines the best of
classical AI and neural nets.
The emergence of
intelligent agents (1995-
present)
Intelligent Agent
• An agent is just something that acts: operate autonomously, perceive
their environment, persist over a prolonged time period, adapt to
change, and create and pursue goals.
• A rational agent is one that acts so as to achieve the best outcome or,
when there is uncertainty, the best expected outcome.
• An intelligent agent refers to an autonomous entity which acts upon
an environment using observation through sensors and consequent
actuators towards achieving goals.
• A complete agent architecture created by John
Laird, Allen Newell, and Paul Rosenbloom at
Carnegie Mellon University.
SOAR (1990) • Goal is to develop the fixed computational building
blocks necessary for general intelligent agents
• Both a theory of what cognition is and a
computational implementation of that theory.
Consequences of building complete agents
Realization that the previously AI has been drawn into much

isolated subfields of AI might closer contact with other fields,
need to be reorganized such as control theory and
somewhat when their results economics, that also deal with
are to be tied together. agents.
Symposium first held in 2004 in effort to
require very large knowledge bases for AI
and to discuss where these knowledge
Human- bases might come from.
Level AI
(HLAI) Based on the belief that AI should return to
its roots of striving for, in Simon’s words,
“machines that think, that learn, and that
create.”
A related idea which held its first
conference in 2007.
Artificial
General Organized the Journal of Artificial
Intelligence General Intelligence in 2008.
(AGI)
Looks for a universal algorithm for
learning and acting in any
environment, and has its roots in the
work of Ray Solomonoff.
The availability of very large
data sets (2001-present)
Examples
Yarowsky (1995) – work on word-sense disambiguation showed accuracy of above

96% with no labelled examples.
Banko and Brill (2001) – shows a mediocre algorithm with 100 million words of
unlabeled training data outperforms the best known algorithm with 1 million words.
Hays and Efros (2007) – defined an algorithm to find matches in photographs and
found out performance of their algorithm was excellent when they grew the collection
from 10,000 to 2 million photos.
Questions?

The Intellectual History of AI: CS158-2: Introduction To Artificial Intelligence 4 Term AY 2019-2020

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

The Intellectual History of AI: CS158-2: Introduction To Artificial Intelligence 4 Term AY 2019-2020

Uploaded by

Copyright:

Available Formats

The Intellectual

The birth of artificial intelligence (1956)

Outline Early enthusiasm, great expectations (1952-1969)

A dose of reality (1966-1973)

Knowledge-based systems: The key to power?

The return of neural networks (1986-present)

Outline AI adopts the scientific method (1987-present)

The emergence of intelligent agents (1995-present)

The availability of large data sets (2001-present)

In 1943, Warren McCulloch &

…The study is to proceed on the basis of the conjecture that

…We think that a significant advance can be made in one or

First program deliberately engineered to perform

Logic Regarded as the “first artificial intelligence program”

Theorist Coded using an early version of IPL (Information

Established the field of heuristic programming

Proved 38 of the first 52 theorems in chapter 2 of

Written by Allen Newell, Herbert Simon, & John Clifford Shaw

Designed to imitate human problem-solving protocols

Probably the first program to embody the “thinking humanly”

Herbert Gelernter (1959): Arthur Samuel (1952): checkers

Defined high-level language Lisp, which was to become the dominant AI

Invented time-sharing, which is the sharing of a computing resource

Published a paper entitled Programs with Common Sense, in which he

Microworld Tom Evan’s ANALOGY program (1968)

Daniel Bobrow’s STUDENT program

Developed in Stanford by Ed Feigenbaum, Bruce Buchanan, and Joshua

Aimed to solve the problem of inferring molecular structure from the

Regarded as the first successful knowledge-intensive system: its expertise

Developed by Terry Winograd at MIT as an early natural language

Its dependence on syntactic analysis caused some of the same problems

It was able to overcome ambiguity and understand pronoun references

AI Industry In response, USA formed the Microelectronics and

Boom research consortium designed to assure national

In Great Britain, the Alvey report reinstated the

Machine translation Neural networks

The Bayesian network formalism was invented to allow efficient

Realization that the previously AI has been drawn into much

Intelligence General Intelligence in 2008.

Yarowsky (1995) – work on word-sense disambiguation showed accuracy of above

You might also like