16 Framework: Location-Privacy Preserving Mechanisms (LPPM)

Quantifying Location Privacy
Reza Shokri, George Theodorakopoulos, Jean-Yves Le
Boudec,
and Jean-Pierre Hubaux
Presented By:
Solomon Njorombe
Abstract
• Security issues in progressed personal communication
• Many Location-Privacy Protection Mechanisms (LPPMs) proposed
• No systematic quantification, and incomplete assumptions
• Framework for LPPMs analysis
• Information and attacks available to adversary
• Formalize attack performance
• Adversary inference attacks(accuracy, certainty, correctness)
• Implement Location Privacy meter
• Assess popular metrics(Entropy and k-anonymity)
• Low correlation to adversary’s success
2
Introduction
Introduction
• Smartphones with location sensors: GPS/Triangulation
• Convenience, but leaves traces of your where about
• Infer on habits, interests, relationships, secrets
• Increased computing power.
• Data mining algorithms, parallel db analysis
• Threat to privacy
• Users have the right to control the information shared
• Minimal information or only with trusted entities
4
Introduction: Motivation
Aim: Progress the quantification of performance of LPPM
• Why?
Humans, bad
estimators of risks
A meaningful way
to compare LPPMs
Literature, not
matured enough in
this
• Lack unified generic formal framework. Hence divergent
contribution and confusion. Which is more effective LPPM
5
Introduction: Contributions
1. Generic model to formalize adversarial attacks
• Define tracking and localization on anonymous traces as statistical
inference problem
2. Statistical methods to evaluate performance of such
inference attack
• Expected estimation error as right metric
3. Location Privacy Meter
4. Inappropriateness of existing metrics
6
Framework
Framework
• Location privacy is a tuple < 𝒰, 𝒜, 𝐿𝑃𝑃𝑀, 𝒪, 𝐴𝐷𝑉, 𝑀𝐸𝑇𝑅𝐼𝐶 >
𝓤: Set of mobile users
𝓐: Actual traces of user
LPPM: Location-Privacy Preserving Mechanism
• Acts on 𝑎 ℰ 𝒜 and produces 𝑜 ℰ 𝒪
𝓞: Traces observed by adversary
ADV: Adversary
• Try to infer a having observed o , relying on LPPM knowledge & users’
mobility model
METRIC: metric for performance and success of ADV. Implies
users’ location privacy
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
8
Framework: Mobile Users
• 𝒰 = {𝑢1 , 𝑢2 , … , 𝑢𝑁 } set on N mobile users within area
portioned into M regions ℛ = {𝑟1 , 𝑟2 , … , 𝑟𝑀 }
• 𝒯 = {1, … , 𝑇}: Set of time instants when users can be
observed. It is discrete.
• Spatiotemporal position of users modeled through events and
traces
-> Tuple < 𝒖, 𝒓, 𝒕 >
• Event: where 𝑢 ∈ 𝒰, 𝑟 ∈ ℛ, 𝑡 ∈ 𝒯
au(T) =
• Trace for user u: T-size vector for events
au(1)=
< 𝑢𝑢 , 𝑟𝑥 , 𝑡𝑇 >
𝑎𝑢 = (𝑎𝑢 1 , 𝑎𝑢 2 , … , 𝑎𝑢 (𝑇))
< 𝑢𝑢 , 𝑟𝑗 , 𝑡1 >
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
au(2)
9
Framework: Mobile Users
• 𝒜𝑢 : Set of all traces that may belong to user u
• Actual trace of u: Only true trace of u for the period t=1…T
Actual (au(1), au(2), … au(T))
• Actual events: Events of the actual trace of user u
< 𝑢𝑢 , 𝑟𝑥 , 𝑡1 >, < 𝑢𝑢 , 𝑟𝑒 , 𝑡2 >, … < 𝑢𝑢 , 𝑟𝑖 , 𝑡𝑇 >
• 𝒜 = 𝒜𝑢1 × 𝒜𝑢2 × ⋯ × 𝒜𝑢𝑁 : Set of all possible traces for
all users
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
10
Framework: Location-Privacy Preserving
Mechanisms (LPPM)
• LPPM: Mechanism of modifying and distorting actual traces
before exposure
• Different implementations
• Offline (e.g. from DB) vs Online (On the fly)
• Centralized(central anonymity server) vs Distributes(Users’ phones)
• Receives N actual traces and modify them in 2 steps
• Obfuscation: Location event replaced with location pseudonyms
𝓡′ = {𝒓′𝟏 , 𝒓′𝟐 , … , 𝒓′𝑴′ }
• Anonymization: User part of each trace replaced with user
pseudonym 𝓤′ = {𝒖′𝟏 , … 𝒖′𝑵′ }
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
11
Framework: Location-Privacy Preserving
Mechanisms (LPPM)
• Obfuscated event: <u, r’, t> where 𝑢 ∈ 𝒰, 𝑟 ′ ∈ ℛ ′ , 𝑡 ∈ 𝒯
• Obfuscated trace: 𝑜𝑢 = (𝑜𝑢 1 , 𝑜𝑢 2 , … , 𝑜𝑢 (𝑇))
• 𝓞𝒖 : Set of all possible obfuscated traces of user u
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
12
Framework: Location-Privacy Preserving
Mechanisms (LPPM)
• Obfuscation mechanism: function that maps a trace 𝑎𝑢 ∈ 𝒜𝑢
into a random variable 𝑶𝒖 taking values from set 𝓞𝒖
• Probability density function
𝑓𝑎𝑢 𝑂𝑈 = Pr{𝑂𝑢 = 𝑜𝑢 |𝐴𝑢 = 𝑎𝑢 }
• Methods by LPPMs to reduce accuracy and/or precision of the
events’ spatiotemporal information
• Perturbation
• Adding dummy regions
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
• Reducing precision(merge regions)
• Location hiding
13
Framework: Location-Privacy Preserving
Mechanisms (LPPM)
• Anonymization mechanism: Function Σ randomly chosen
from functions mapping 𝒰 to 𝒰′
• Drawn according to probability function 𝑔 𝜎 = Pr(Σ = 𝜎)
• We consider random permutation over possible N!
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
14
Framework: Location-Privacy Preserving
Mechanisms (LPPM)
• 𝐿𝑜𝑐𝑎𝑡𝑖𝑜𝑛 𝑝𝑟𝑖𝑣𝑎𝑐𝑦 = (𝑓, 𝑔)
Set of actual traces
𝑓 𝑎𝑢 1 , … , 𝑎𝑢 𝑁
Set of obfuscated traces
⟼ {𝑜𝑢1 , … , 𝑜𝑢𝑁 }
{𝑜𝑢1 , … , 𝑜𝑢𝑁 } an instantiation of random variables {𝑂𝑢1 , … , 𝑂𝑢𝑁 }
𝑔 𝑜𝑢1 , … , 𝑜𝑢𝑁
⟼ 𝑜𝜎
Set of obfuscated traces
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑢1
, … , 𝑜𝜎
𝑢𝑁
Set of anonymized traces
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
15
Framework: Location-Privacy Preserving
Mechanisms (LPPM)
• Summarize LPPM with the probability distribution that gives
the probability of mapping 𝑎 ∈ 𝒜 into 𝑜 ∈ 𝒪 = 𝒪1 × 𝒪2 × ⋯ 𝒪𝑁
𝑁
𝐿𝑃𝑃𝑀𝑎 (𝑜) = Pr{∩𝑁
𝑂
=
𝑜
|
∩
𝜎(𝑢𝑖 )
𝑖=1 Σ(𝑢1 )
𝑖=1 𝐴𝑢𝑖 = 𝑎𝑢𝑖 }
• Adversary’s aim is to reconstruct a when given o
𝓞𝝈(𝒖) : Set of all observable traces of user u
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
16
Framework : Adversary
• Knows anonymization and obfuscation probability distribution
functions f and 𝑔
• Has access to training traces + users’ public information
• Based on this information, construct mobility profile Pu for
each user
• Given LPPM(ie. f &𝑔), users’ profiles {(u, Pu)}, observed
traces {o1, o2,…, oN} attacker runs inference attack formulating
objectives as 𝒰 − ℛ − 𝒯 (subset of Users, Regions & Time)
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
17
Framework : Adversary
• Presence/Absence disclosure attacks
• Infer user, regions relationship over time
1. Tracking attacks: ADV trying to find full/partial sequence os a
user’s track
2. Localization attacks: ADV target a single event in a user’s trace
• Meeting Disclosure attack
• ADV interested in proximity btw 2 users. (meeting in a given time)
• Paper’s algorithm implement general attack
• General attack: Try to recover traces for all users
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
18
Framework : Evaluation
• Traces are probabilistically generated
• Actual traces – probabilistic over user mobility profile
• Observed traces – probabilistic over LPPM
• Attack output can be
• Probability distribution of possible outcomes
• Most probable outcome
• Expected outcome under distribution of possible outcomes
• Any function of the actual trace
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
19
Framework : Evaluation
• ∅ . : Function for the attacker’s objective
• If its argument is a then ∅ 𝒂 is correct answer to the attack
• 𝒳: Set of values ∅ . can take for a given attack
( M regions, N users, MT traces of one user)
• But attacker cannot obtain exact ∅ 𝒂 , the task is highly
probabilistic.
• Best hope: extract all information about it from observed
traces
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
20
Framework : Evaluation
• Extracted information is in the form Pr(x|o), 𝑥 ∈ 𝒳
• x is from all possible value ∅ .
derivable from observed o
• Uncertainty: Ambiguity of Pr(x|o) in respect to finding a
unique answer (Max under uniform distribution)
• Inaccuracy: Difference between Pr(x|o) and 𝑃𝑟(𝑥|𝑜)
• 𝑃𝑟(𝑥|𝑜): estimate as ADV doesn’t have infinite resource
• But uncertainty and Inaccuracy don’t quantify user’s privacy,
correctness does
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
21
Framework : Evaluation
• Correctness: Distance between result of the attack and the
real answer.
Correctness
Only correctness
really matters
• Accuracy and certainty may not be equivalent to correctness
Consider situation with insufficient traces
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
22
Framework : Evaluation
Accuracy: Quantified with confidence interval and level
𝑃𝑟(𝑥|𝑜) for some x.
It is within some
confidence interval
Confidence level = 1
X=xc
Prohibitively costly
Certainty: Quantified through entropy. Concentrated vs
uniform. Higher entropy -> lower certainty
1
𝑥 𝑃𝑟(𝑥|𝑜) 𝑙𝑜𝑔 𝑃𝑟(𝑥|𝑜).
𝐻 𝑥 =
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
23
Framework : Evaluation
Correctness: Quantified as expected distance between true xc
and 𝑃𝑟(𝑥|𝑜).
• If there is a distance ||.|| between members of X. expected
estimation error is
𝑃𝑟 𝑥 𝑜 ||𝑥 − 𝑥𝑐 ||
𝑥
• If the distance was =0 iff x=xc and 1 otherwise incorrectness
would be:
𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑜𝑓 𝑒𝑟𝑟𝑜𝑟 = 1 − 𝑃𝑟(𝑥𝑐 |𝑜)
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
24
Framework : Evaluation
• So correctness is the metric that
determines user privacy
• Adversary doesn’t know xc, and
cannot observe this parameter.
• However Accuracy, Certainty
and correctness are very
independent.
𝓤
𝓐
𝑳𝑷𝑷𝑴
𝓞
𝑨𝑫𝑽
𝑴𝑬𝑻𝑹𝑰𝑪
25
Location Privacy Meter
Location-Privacy Preserving Machanisms
• Implemented 2 obfuscation mechanisms
1. Precision Reducing(merging regions)
Drop low order bits of or region identifier
Eg µx and µy dropped bits of x and y coordinates
2. Location hiding
Events are independently eliminated. Replace location
with Ø with probability λh : location hiding level
• To import LPPM into tool, Specify probability function by
importing
• Anonymization function
• Obfuscation function
27
Knowledge of the Adversary
28
Knowledge of the Adversary
• Adversary collects information about user mobility
• Can translate to event, transition, full/partial traces
• This can be encoded as:
• Traces or
• Matrix of Transition Count TC
• TC is an M x M matrix with ij number of i to j transitions user created and
not encoded in the traces
• Adversary also considers user mobility constraint
29
Knowledge of the Adversary
• ADV tries to model user mobility using Markov Chain
• Such that Pu : user’s transition matrix for their Markov chain
• 𝑃𝑖𝑗𝑢 : probability that user will move from rj to ri in next time slot
• Objective: construct Pu starting from prior mobility
information.
• With bigger goal of:
• estimating the underlying Markov Chain
• Fill the Training Trace TT towards ET(Estimated Trace)
• Utilize convergence in Gibbs sampling
30
Tracking Attack
• ADV tries to reconstruct partial/complete actual traces
Maximum Likelihood Tracking Attack
• Objective: Find jointly most likely traces for all users, given
the observed traces
• That is done within a space of N!MT elements, brute force
approach is not practical
31
Tracking Attack :
Maximum Likelihood Tracking Attack
• Proceed through two steps:
• Deanonymization
Cannot assign most probable traces, multiple users may get same traces
Perform the likelihood for all trace-user pairs
Create an edge weighted bipartite graph
The edge weight is the user-trace likelihood
Find maximum weighted Assignment
use Hungarian algorithm
• De-obfuscation
Set of users
Set of traces
32
Tracking Attack :
Maximum Likelihood Tracking Attack
• De-obfuscation
• Use Viterbi algorithm. Tries to maximize the joint probability of
the most likely traces.
• Recursively compute the values at time T(max probability)
• But interest is on the trace itself
• Almost similar to finding the shortest path in a edge-weighted
directed graph. Vertices as set of R x T
33
Tracking Attack :
Distribution Tracking Attack
• Computes the distribution of traces for each user rather than
the most likely trace
• Use Metropolis Hasting algorithm
• Try to draw sample from 𝒜𝑥 Σ that are identically distributed
to as per the desired distribution.
• MH tries to perform a random walk over possible values of
(𝑎, 𝜎)
• Can answer wide range of U-R-T questions but very
computationally intensive.
34
Localization Attack
• Find the location of user u at time t
• Output: distribution of possible region, from which they
select the most probable
• Attacker needs estimate of observed trace(Max weighted
assignment)
• Can be computed using Forward-Backward algorithm
35
Meeting Disclosure Attack
• Objective 1: specify a pair of users (u and v), a region r and time t
• Computed as a product of the distribution for both events
• These established through localization attacks
• Another objective: Just a pair of users. How often they would
have met, and the region
• Answered using localization attack
• Objective 3: Location and time, expecting number of present
users
• Through localization attacks again
36
Using The Tool: Evaluation of
LPPMs
Using The Tool: Evaluation of LPPMs
• Goals:
1. Use Location Privacy Meter to quantify effectiveness of LPPMs
2. Evaluate effectiveness of entropy and k-anonymity to quantify
location privacy
• Location samples: N=20, 5 min intervals for 8 hrs(T=96),
Bay area M=40(5 by 8 grid)
• Privacy mechanism:
• Precision reducing
• Anonymized using random permutation(unique pseudonyms 1-N)
38
Using The Tool: Evaluation of LPPMs
• To consider strongest adversary:
• Feed Knowledge constructor(KC) with actual traces of user
• U-R-T attack scenario
• LO-ATT(Localization Attack): User u at time t, what is his location
at time t?
• MD-ATT(Meeting Disclosure Attack): How many instances in T
are two people in the same region
• AP-ATT(Aggregate Presence Attack): for a region r and time t,
what is the expected time number of users present at t
• Metric: Adversary incorrectness
39
Using The Tool: Evaluation of LPPMs
LPLO-ATT(u,t) for all users
u and time t
• LPPM(µx, µy, λh)
• Incorrectness of the # of
users
40
Using The Tool: Evaluation of LPPMs
LPMD-ATT(u, v) for all
pairs of users u, v
• LPPM(µx, µy, λh)
• Incorrectness of # of
meetings
41
Using The Tool: Evaluation of LPPMs
LPAP-ATT(r, t) for all
regions r and time t
• LPPM(µx, µy, λh)
• Incorrectness of number
of users in a region
42
Using The Tool: Evaluation of LPPMs
• X-axis: Users privacy
• Y-axis: Normalized entropy
*** : LPPM(2, 3, 0.9) strong
mechanism
… . : LPPM(1, 2, 0.5) medium
ooo : LPPM(1, 0, 0.0) Weak
43
Using The Tool: Evaluation of LPPMs
• X-axis: Users privacy
• Y-axis: Normalized kanonymity
*** : LPPM(2, 3, 0.9) strong
mechanism
… . : LPPM(1, 2, 0.5) medium
ooo : LPPM(1, 0, 0.0) Weak
44
Conclusion
Conclusion
• A unified formal framework to describe and evaluate a
variety of location-privacy preserving mechanisms with
respect to various inference attacks
• LPPM evaluation is modelled as an estimation problem and
the Expected Estimation Error metric is provided
• Designed Location-Privacy Meter tool to evaluate and
compare the location-privacy preserving mechanisms
46
Questions
Framework
𝓤 : Set of mobile users
𝓡 : Set of regions that partition the whole area
𝓣 : Time period under consideration
𝓐 : Set of all possible traces
𝓞 : Set of all observable traces
𝓤′ : Set of user pseudonyms
𝓡′ : Set of location pseudonyms
𝑵 : Number of users
𝑴 : Number of regions
𝑻 : Number of considered time instants
𝑵′ : Number of user pseudonyms
𝑴′ : Number of location pseudonyms
𝒇 : Obfuscation function
𝒈 : Anonymization function
𝒂𝒖 : Actual trace of user u
𝒐𝒖 : Obfuscated trace of user u
𝒐𝒊 : Observed trace of user with pseudonym i
𝓐𝒖 : Set of all possible(actual) traces of user u
𝓞𝒖 : Set of all possible obfuscated traces of user u
𝓞𝝈(𝒖) : Set of all observable traces of user u
𝑷𝒖 : Profile of user u
∅ . : Attacker’s objective
𝓧 : Set of values that can take
48

Download Report

16 Framework: Location-Privacy Preserving Mechanisms (LPPM)

Paperzz.com

Your Paperzz