Professional Documents
Culture Documents
CH 1
CH 1
Why Machine Learning? What is a well-de ned learning problem? An example: learning to play checkers What questions should we ask about Machine Learning?
Patient103 time=2
Age: 23 FirstPregnancy: no Anemia: no Diabetes: YES PreviousPrematureBirth: no Ultrasound: abnormal Elective CSection: no Emergency CSection: ? ...
...
Patient103 time=n
Age: 23 FirstPregnancy: no Anemia: no Diabetes: no PreviousPrematureBirth: no Ultrasound: ? Elective CSection: no
Given: 9714 patient records, each describing a pregnancy and birth Each patient record contains 215 features Learn to predict: Classes of future patients at high risk for Emergency Cesarean Section
Datamining Result
Data:
Patient103 time=1
Age: 23 FirstPregnancy: no Anemia: no Diabetes: no PreviousPrematureBirth: no Ultrasound: ? Elective CSection: ? Emergency CSection: ? ...
Patient103 time=2
Age: 23 FirstPregnancy: no Anemia: no Diabetes: YES PreviousPrematureBirth: no Ultrasound: abnormal Elective CSection: no Emergency CSection: ? ...
...
Patient103 time=n
Age: 23 FirstPregnancy: no Anemia: no Diabetes: no PreviousPrematureBirth: no Ultrasound: ? Elective CSection: no
Customer103: (time=t1)
Years of credit: 9 Loan balance: $3,250 Income: ? Own House: Yes Other delinquent accts: 2 Max billing cycles late: 4 Profitable customer?: ? ...
...
Customer103: (time=tn)
Years of credit: 9 Loan balance: $4,500 Income: ? Own House: Yes Other delinquent accts: 3 Max billing cycles late: 6
Profitable customer?: No
...
Other-Delinquent-Accounts = 0, and (Income > $30k) OR (Years-of-Credit > 3) Then Profitable-Customer? = Yes Accept Credit Card application]
Customer103: (time=t1)
Sex: M Age: 53 Income: $50k Own House: Yes MS Products: Word Computer: Pentium Purchase Excel?: ? ...
...
Customer103: (time=tn)
Sex: M Age: 53 Income: $50k Own House: Yes MS Products: Word Computer: Pentium
Customer retention:
Customer103: (time=t0)
Sex: M Age: 53 Income: $50k Own House: Yes Checking: $5k Savings: $15k Currentcustomer?: yes ...
Customer103: (time=t1)
Sex: M Age: 53 Income: $50k Own House: Yes Checking: $20k Savings: $0 ... Currentcustomer?: yes
...
Customer103: (time=tn)
Sex: M Age: 53 Income: $50k Own House: Yes Checking: $0 Savings: $0
Currentcustomer?: No
Process optimization:
Product72:
(time=t0)
Product72:
(time=t1)
...
Product72:
(time=tn)
Stage: mix Mixingspeed: 60rpm Viscosity: 1.3 Fat content: 15% Density: 2.8 Spectral peak: 2800 Product underweight?: ?? ...
Stage: cook Temperature: 325 Viscosity: 3.2 Fat content: 12% Density: 1.1 Spectral peak: 3200 Product underweight?: ?? ...
Stage: cool Fanspeed: medium Viscosity: 1.3 Fat content: 12% Density: 1.2 Spectral peak: 3100
Problems Too Di cult to Program by Hand ALVINN Pomerleau] drives 70 mph on highways
Sharp Left
Straight Ahead
Sharp Right
30 Output Units
4 Hidden Units
http://www.wisewire.com
Machine Learning, T. Mitchell, McGraw Hill, 1997
Relevant Disciplines
Arti cial intelligence Bayesian methods Computational complexity theory Control theory Information theory Philosophy Psychology and neurobiology Statistics
:::
10
E.g., Learn to play checkers : Play checkers : % of games won in world tournament : opportunity to play against self
T P E
11
: Play checkers : Percent of games won in world tournament What experience? What exactly should be learned? How shall it be represented? What speci c algorithm to learn it?
12
13
...
Board
! < ??
Board
M ove
??
14
if is a nal board state that is won, then ( ) = 100 if is a nal board state that is lost, then ( ) = ?100 if is a nal board state that is drawn, then ( )=0 if is a not a nal state in the game, then ( ) = ( ), where is the best nal board state that can be achieved starting from and playing optimally until the end of the game. This gives correct values, but is not operational
b V b b V b b V b b V b V b
0
15
16
rt b
( )+ 2 ( )+ 3 ( )+ 4 ( )+ 5 ( )+ ( ): number of black pieces on board ( ): number of red pieces on ( ): number of black kings on ( ): number of red kings on ( ): number of red pieces threatened by black (i.e., which can be taken on black's next turn) ( ): number of black pieces threatened by red
w1 bp b w rp b w bk b w rk b w bt b bp b b rp b b bk b b rk b b bt b
w6 rt b
17
V V
( ): the true target function ^ ( ) : the learned function train( ): the training value
b b b b V S uccessor b
18
Do repeatedly: Select a training example at random 1. Compute ( ): ( ) = train( ) ? ^ ( ) 2. For each board feature i, update weight i: () i i+ i is some small constant, say 0.1, to moderate the rate of learning
b error b error b V b V b f w w w c f error b
19
Design Choices
Determine Type of Training Experience Games against experts
...
Board move
...
Polynomial
...
Linear function of six features Artificial neural network
Linear programming
...
20
21