Professional Documents
Culture Documents
a map
Introduction
Disclosure risk
Risk measure
Example
Discussion
2
Introduction
NSIs often have Geographical output:
– Traditionally using tables (counts, magnitudes, …)
– Using administrative regions, defined by e.g.
- property (fire brigade area)
- size of ground area (grid)
- number of units (NUTS)
3
Introduction
NSIs often have Geographical output:
– Traditionally using tables (counts, magnitudes, …)
– Using administrative regions, defined by e.g.
- property (fire brigade area)
- size of ground area (grid)
- number of units (NUTS)
– Aggregated data plotted on map
- e.g. Electricity consumption enterprises
3
Introduction
NSIs often have Geographical output:
– Traditionally using tables (counts, magnitudes, …)
– Using administrative regions, defined by e.g.
- property (fire brigade area)
- size of ground area (grid)
- number of units (NUTS)
– Aggregated data plotted on map
- e.g. Electricity consumption enterprises
- Electricity consumption private dwellings
3
Introduction
Plotting a continuous variable on a map
Continuous in value
Not necessarily continuous in location
4
Introduction
Plotting a continuous variable on a map
Continuous in value
Not necessarily continuous in location
4
Introduction
Plotting a continuous variable on a map
Continuous in value
Not necessarily continuous in location
4
Some Notation
Population 𝒰 = {r , … , r } with r = (𝑥 , 𝑦 )
Measurements 𝑔 , … , 𝑔
Spatial distribution 𝑔(r) ∶ ℝ → ℝ
Total (energy consumption) in area 𝒜
𝐺(𝒜) = 𝑔(r) 𝑑r
𝒜
𝐺(𝒜) 𝐺(𝒜)
𝐺̄ (𝒜) = or 𝐺̄ (𝒜) =
||𝒜|| ∑ 𝟙(r ∈ 𝒜)
5
Disclosure scenario
Attacker scenario
Determine ‘hot-spots’
Zoom in into region of interest
Link value of spatial distribution to individuals
6
Disclosure scenario
Sub-scenarios:
External attacker
Attacker is not a population unit with location in area of hot-spot
and can only use information from the spatial distribution
Internal attacker
Attacker is a population unit with location in area of hot-spot and
can additionally use its own information to derive more accurate
information about another unit in that area
7
Risk measure
0, if 𝒜 ∩ 𝒰 = Ø
RC (𝒜; 𝑝) = (1 + 𝑝/100)𝑔 − 𝐺(𝒜), if 𝒜 ∩ 𝒰 = {r }
(1 + 𝑝/100)𝑔 + 𝑔 − 𝐺(𝒜), otherwise
where
8
Risk measure
Which area 𝒜 to be used to determine risk?
Note:
For any unit 𝑗 we can find an area 𝒜∗ such that
|𝐺(𝒜∗ ) − 𝑔 | 𝑝
<
𝑔 100
Practical approach
Calculate RC (𝒜; 𝑝) on the lowest level to which zooming is allowed
9
Example: risk vs resolution
75% 75%
p% criterium p% criterium
5% 5%
50% 50%
10% 10%
15% 15%
25% 25%
0% 0%
10
Example
𝐺̂ (𝒜) = 𝑔̂ (r) 𝑑r
𝒜
𝐺̂ (𝒜) < 𝑔
or
𝐺̂ (𝒜) > 0 while 𝒜 ∩ 𝒰 = Ø
11
Example
0, if 𝒜 ∩ 𝒰 = Ø
R̂ C, (𝒜; 𝑝) = (1 + 𝑝/100)𝑔̂ , − 𝐺̂ (𝒜), if 𝒜 ∩ 𝒰 = {r }
(1 + 𝑝/100)𝑔̂ , + 𝑔̂ , − 𝐺̂ (𝒜), otherwise
Alternatively, use
0, if 𝒜 ∩ 𝒰 = Ø
∗
R̂ C, (𝒜; 𝑝) = (𝑝/100)𝑔 − |𝐺̂ (𝒜) − 𝑔 |, if 𝒜 ∩ 𝒰 = {r }
(𝑝/100)𝑔 − |𝐺̂ (𝒜) − 𝑔 − 𝑔 |, otherwise
12
Example
0, if 𝒜 ∩ 𝒰 = Ø
R̂ C, (𝒜; 𝑝) = (1 + 𝑝/100)𝑔̂ , − 𝐺̂ (𝒜), if 𝒜 ∩ 𝒰 = {r }
(1 + 𝑝/100)𝑔̂ , + 𝑔̂ , − 𝐺̂ (𝒜), otherwise
Alternatively, use
0, if 𝒜 ∩ 𝒰 = Ø
∗
R̂ C, (𝒜; 𝑝) = (𝑝/100)𝑔 − |𝐺̂ (𝒜) − 𝑔 |, if 𝒜 ∩ 𝒰 = {r }
(𝑝/100)𝑔 − |𝐺̂ (𝒜) − 𝑔 − 𝑔 |, otherwise
12
Example: spatial smoothing
13
Example: spatial smoothing
13
Example: spatial smoothing
13
Example: risk vs bandwidth
𝑝%-rule with 𝑝 = 10
●
●
● ●
● ● ● ● ● ● ●
● ● ●
● ● ●
● ●
● ● ●
● ● ● ●
● ● ●
●
75% 75%
PC6 / zip code
●
●
resolution resolution
●
●
●
●
●
100m ●
100m
●
●
● ● ● ● ●
200m 200m
50% ●
● PC6 / zip code 50%
● 300m 300m
●
●
400m 400m
● 500m 500m
●
●
●
●
25% 25%
●
0% 0%
14
Discussion
15
Discussion
16