Professional Documents
Culture Documents
Akshay Krishnamurthy
akshay@cs.umass.edu
With Alekh Agarwal, Tzu-Kuo Huang, Hal Daumé III, John Langford
Cost-Sensitive Learning
Multi-class prediction where different predictions incur different cost.
• Data (x, c) where c(y ) is cost for predicting label y on x.
Cost-Sensitive Learning
Multi-class prediction where different predictions incur different cost.
• Data (x, c) where c(y ) is cost for predicting label y on x.
• Training: With data (xi , ci )
X
gy = argmin (g (xi ) − ci (y ))2
g ∈G
i
Cost-Sensitive Learning
Multi-class prediction where different predictions incur different cost.
• Data (x, c) where c(y ) is cost for predicting label y on x.
• Training: With data (xi , ci )
X
gy = argmin (g (xi ) − ci (y ))2
g ∈G
i
• Prediction: On example x
ŷ = argmin gy (x)
y
Cost-Sensitive Learning
Multi-class prediction where different predictions incur different cost.
• Data (x, c) where c(y ) is cost for predicting label y on x.
• Training: With data (xi , ci )
X
gy = argmin (g (xi ) − ci (y ))2
g ∈G
i
• Prediction: On example x
ŷ = argmin gy (x)
y
• In VW:
vw --csoaa k
Cost-Sensitive Learning
Multi-class prediction where different predictions incur different cost.
• Data (x, c) where c(y ) is cost for predicting label y on x.
• Training: With data (xi , ci )
X
gy = argmin (g (xi ) − ci (y ))2
g ∈G
i
• Prediction: On example x
ŷ = argmin gy (x)
y
• In VW:
vw --csoaa k
0.08
0.06
0.04216 2 18 2 20 2 22 2 24 2 26
Number of Queries
c+ (y2 )
g? (y1 )
y1 y2 y3 y4
Cost Overlapped Active Learning (COAL)
On each xi
1. Compute version space Gi (y ) of good regressors for each y .
2. Compute cost ranges
c+ (y2 )
g? (y1 )
y1 y2 y3 y4
Properties
go
Approximate cost ranges
In practice, one pass, linear time.
1. Use online least squares optimization.
2. Compute cost range with sensitivity analysis.
3. Look for large weight w such that with new weighted example, loss
is still close
0.12 ImageNet 40
Passive
0.10 COAL (1e-1)
COAL (1e-2)
0.08 COAL (1e-3)
Test Cost
0.06
0.04
0.02212 2 13 2 14 2 15 2 16 2 17 2 18 2 19
Number of Queries
Experiments
0.12 ImageNet 40
Passive
0.10 COAL (1e-1)
COAL (1e-2)
0.08 COAL (1e-3)
Test Cost
0.06
0.04
0.02212 2 13 2 14 2 15 2 16 2 17 2 18 2 19
Number of Queries