Professional Documents
Culture Documents
Presentation PSD 2016
Presentation PSD 2016
Introduc on
Smoothing methods
Examples
Discussion
2
Introduction
3
Introduction
3
Introduction
4
Introduction
5
Introduction
Tradi onally
— Produce safe tables and plot on a map
— Perturb microdata and plot on a map
6
Introduction
Tradi onally
— Produce safe tables and plot on a map
— Perturb microdata and plot on a map
6
Some de initions
What we es mate:
Spa al distribu on of rela ve frequencies
7
Some de initions
8
Approach
9
Smoothing
10
Kernel Density Estimator (KDE)
where
1 𝑣 scores on 𝑐
𝟙 (𝑣 ) =
0 otherwise
and
𝑘𝑛𝑛(𝑥, 𝑦) the set of 𝑘 observa ons nearest to (𝑥, 𝑦)
12
Additional settings
kNN related
— Restrict the search area for 𝑘 nearest neighbors to
observa ons with distance no more than 𝑀 meters
— Otherwise areas with no popula on could s ll show posi ve
rela ve frequencies
— Loca ons with less than 𝑘 nearest neighbors within 𝑀
meters, get an undefined frac on assigned
General
— Par on [0, 𝑓 ] into at most 5 levels
— Generally accepted conven on in data-visualiza on
— Easier to ‘top-code’
13
Examples
14
KDE (h = 50m)
15
KDE (h = 100m)
16
kNN (k = 20, distance ≤ 250m)
17
Discussion
U lity related
— ‘Oversmoothing’ may hide or remove spa al pa erns
— How to objec vely measure u lity of such maps?
18
Discussion
Anomalies
— Probability mass may leak into ‘empty’ areas (rivers, lakes,
woods, …)
— False sensi ve loca ons: leaking addi onal mass to
non-sensi ve loca ons
— Dislocated modes: two adjacent modes may be blend into
single mode in between
— kNN shows ar ficial boundaries
19
Discussion
Future work?
— Variable bandwidth
— Automa c bandwidth selec on
— Boundary kernels
— Disclosure risk measures for spa al distribu on plots
— U lity measures for spa al distribu on plots
Thank you
20