Private Learning and Sanitization: Pure vs. Approx

Private Learning and
Sanitization: Pure vs.
Approx. Differential Privacy
Uri Stemmer
Ben-Gurion University
Join work with Amos Beimel and Kobbi Nissim
Why Private Learners?
Database
z1
z2
z3
๏
zn-1
zn
Users
Learning
Algorithm
๐“
Government,
Businesses,
Researchers
(or)
Malicious
adversary
Often, this algorithmic task can be abstracted as a learning problem:
โ€ข Bank is interested in predicting (based on past customers) whether
new customers are good/bad credit
Approx. Differential Privacy
Dwork, McSherry, Nissim, Smith 2006
Changing one record does not change the output
distribution โ€œtoo muchโ€
z1
z2
z3
๏
zn-1
zn
๐“
z1
z2
zโ€™3
๐“ ๐‘บ๐Ÿ
๏
zn-1
zn
๐“
๐“ ๐‘บ๐Ÿ
Approx. Differential Privacy
Dwork, McSherry, Nissim, Smith 2006
Changing one record does not change the output
distribution โ€œtoo muchโ€
A (rand) algorithm ๐“ is ๐, ๐œน differentially
private if for all neighboring databases ๐‘บ๐Ÿ , ๐‘บ๐Ÿ
and for all sets of outputs ๐‘ญ:
๐๐ซ ๐“ ๐‘บ๐Ÿ โˆˆ ๐‘ญ โ‰ค โ‰ˆ
๐’†๐ โ‹… ๐๐ซ ๐“ ๐‘บ๐Ÿ โˆˆ ๐‘ญ + ๐œน
Approx.
Pure Differential Privacy
Dwork, McSherry, Nissim, Smith 2006
Changing one record does not change the output
distribution โ€œtoo muchโ€
A (rand) algorithm ๐“ is ๐, ๐œน differentially
private if for all neighboring databases ๐‘บ๐Ÿ , ๐‘บ๐Ÿ
and for all sets of outputs ๐‘ญ:
๐๐ซ ๐“ ๐‘บ๐Ÿ โˆˆ ๐‘ญ โ‰ค ๐’†๐ โ‹… ๐๐ซ ๐“ ๐‘บ๐Ÿ โˆˆ ๐‘ญ + ๐œน
Approx. Differential Privacy
Dwork, McSherry, Nissim, Smith 2006
Dwork, Kenthapadi, McSherry, Mironov, Naor 2006
Changing one record does not change the output
distribution โ€œtoo muchโ€
A (rand) algorithm ๐“ is ๐, ๐œน differentially
private if for all neighboring databases ๐‘บ๐Ÿ , ๐‘บ๐Ÿ
and for all sets of outputs ๐‘ญ:
๐๐ซ ๐“ ๐‘บ๐Ÿ โˆˆ ๐‘ญ โ‰ค ๐’†๐ โ‹… ๐๐ซ ๐“ ๐‘บ๐Ÿ โˆˆ ๐‘ญ + ๐œน
Our Results:
โ€ข Sample complexity of Private Learning and
Sanitization can be drastically smaller if we settle
for approximate differential privacy.
โ€ข Label Privacy [Chaudhuri and Hsu 2011]
Learning model with weakened privacy demands.
We settle the question of sample complexity: O(VC).
โ€“ Same as non-private learning.
โ€“ Not is this talk.
โ€ข Natural connection between Private Learning and
Sanitization, leads to lower bounds on Sanitization.
โ€“ Not in this talk.
What is Private Learning?
Kasiviswanathan, Lee, Nissim, Raskhodnikova, Smith 08
Definition:
PAC Learning
+
Differential Privacy
__________________
Private Learning
โ€œPACโ€ Model
โ€ข
[Valiant 84]
Domain ๐‘ฟ.
๐Ÿ
๐ŸŽ
๐‘ฟ
๐Ÿ ๐Ÿ โ€ฆ
๐Ÿ๐’…
โ€œPACโ€ Model
โ€ข
โ€ข
[Valiant 84]
Domain ๐‘ฟ.
Set ๐“’ of boolean functions over ๐‘ฟ.
- for example: ๐ˆ๐๐“๐„๐‘๐•๐€๐‹๐’…
๐Ÿ
๐ŸŽ
๐‘ฟ
๐Ÿ ๐Ÿ โ€ฆ
๐Ÿ๐’…
โ€œPACโ€ Model
โ€ข
โ€ข
[Valiant 84]
Domain ๐‘ฟ.
Set ๐“’ of boolean functions over ๐‘ฟ.
- for example: ๐ˆ๐๐“๐„๐‘๐•๐€๐‹๐’…
๐Ÿ
๐ŸŽ
๐‘ฟ
๐Ÿ ๐Ÿ โ€ฆ
๐Ÿ๐’…
โ€œPACโ€ Model
[Valiant 84]
โ€ข
โ€ข
Domain ๐‘ฟ.
Set ๐“’ of boolean functions over ๐‘ฟ.
โ€ข
Labeled sample.
- for example: ๐ˆ๐๐“๐„๐‘๐•๐€๐‹๐’…
๐Ÿ
๐ŸŽ
๐‘ฟ
๐Ÿ ๐Ÿ โ€ฆ
๐Ÿ๐’…
โ€œPACโ€ Model
[Valiant 84]
โ€ข
โ€ข
Domain ๐‘ฟ.
Set ๐“’ of boolean functions over ๐‘ฟ.
โ€ข
โ€ข
Labeled sample.
Output a classifier ๐’‰.
- for example: ๐ˆ๐๐“๐„๐‘๐•๐€๐‹๐’…
๐Ÿ
๐ŸŽ
๐‘ฟ
๐Ÿ ๐Ÿ โ€ฆ
๐Ÿ๐’…
โ€œPACโ€ Model
[Valiant 84]
โ€ข
โ€ข
Domain ๐‘ฟ.
Set ๐“’ of boolean functions over ๐‘ฟ.
โ€ข
โ€ข
Labeled sample.
Output a classifier ๐’‰.
- for example: ๐ˆ๐๐“๐„๐‘๐•๐€๐‹๐’…
๐Ÿ
๐ŸŽ
๐‘ฟ
๐Ÿ ๐Ÿ โ€ฆ
๐Ÿ๐’…
Related work in Private
Learning (partial list)
[BDMN 05] First private learning algorithms. SQ based.
[KLNRS 08] Define private learning, and showed:
Every class ๐“’ can be privately learned using ๐ฅ๐จ๐  ๐“’ labeled
samples.
[BKN 10] Sample complexity of private learning.
[CH 11] Learning in continuous domain, label privacy.
[CM 08, CMS 11, KST 12] Machine learning.
[BLR 08, DNRRV 09, โ€ฆ] Synthetic Data.
[DRV 10] Private Boosting.
Running Example: INTERVALd
๐’„๐’‹
1
๐’„๐’‹ ๐’™ = ๐Ÿ โŸบ ๐’™ < ๐’‹
0
1 2 3 โ€ฆ
Facts:
๐‘—
โ€ฆ 2๐‘‘
โ€ข non-private proper learner with ๐‘ถ ๐Ÿ samples.
โ€ข ๐-private proper learner: ๐šฏ ๐’… samples [BBKN 10].
Running Example: INTERVALd
๐’„๐’‹
1
๐’„๐’‹ ๐’™ = ๐Ÿ โŸบ ๐’™ < ๐’‹
0
1 2 3 โ€ฆ
๐‘—
โ€ฆ 2๐‘‘
Facts:
โ€ข non-private proper learner with ๐‘ถ ๐Ÿ samples.
โ€ข ๐-private proper learner: ๐šฏ ๐’… samples [BBKN 10].
We show:
๐, ๐œน -private proper learner with ๐Ÿ๐‘ถ
๐ฅ๐จ๐  โˆ— ๐’…
samples.
Privately Learning intervals:
Facts: Ideas and Intuition.
โ€ข non-private proper learner with ๐‘ถ ๐Ÿ samples.
โ€ข ๐-private proper learner: ๐šฏ ๐’… samples [BBKN 10].
We show:
๐, ๐œน -private proper learner with ๐Ÿ๐‘ถ
๐ฅ๐จ๐  โˆ— ๐’…
samples.
The Goal:
Given a labeled sample, choose
a concept with small error.
๐Ÿ
๐ŸŽ
๐‘ฟ
4-good interval ๐‘ฎ
Assume we can (privately) obtain an interval ๐‘ฎ โІ ๐‘ฟ s.t.
โ€ข
โ€ข
Contains โ€œa lotโ€ of ones, and โ€œa lotโ€ of zeroes.
Every interval ๐‘ฐ โІ ๐‘ฟ of length โ‰ค |๐‘ฎ|/๐Ÿ’ either does not
contain โ€œtoo manyโ€ ones or does not contain โ€œtoo manyโ€
zeroes.
๐Ÿ
๐ŸŽ
๐‘ฟ
4-good interval ๐‘ฎ
Assume we can (privately) obtain an interval ๐‘ฎ โІ ๐‘ฟ s.t.
โ€ข
โ€ข
Contains โ€œa lotโ€ of ones, and โ€œa lotโ€ of zeroes.
Every interval ๐‘ฐ โІ ๐‘ฟ of length โ‰ค |๐‘ฎ|/๐Ÿ’ either does not
contain โ€œtoo manyโ€ ones or does not contain โ€œtoo manyโ€
zeroes.
๐Ÿ
๐ŸŽ
๐‘ฟ
๐‘ฎ
4-good interval ๐‘ฎ
Assume we can (privately) obtain an interval ๐‘ฎ โІ ๐‘ฟ s.t.
โ€ข
โ€ข
Contains โ€œa lotโ€ of ones, and โ€œa lotโ€ of zeroes.
Every interval ๐‘ฐ โІ ๐‘ฟ of length โ‰ค |๐‘ฎ|/๐Ÿ’ either does not
contain โ€œtoo manyโ€ ones or does not contain โ€œtoo manyโ€
zeroes.
๐Ÿ
๐ŸŽ
๐‘ฟ
๐‘ฎ
4-good interval ๐‘ฎ
Assume we can (privately) obtain an interval ๐‘ฎ โІ ๐‘ฟ s.t.
โ€ข
โ€ข
Contains โ€œa lotโ€ of ones, and โ€œa lotโ€ of zeroes.
Every interval ๐‘ฐ โІ ๐‘ฟ of length โ‰ค |๐‘ฎ|/๐Ÿ’ either does not
contain โ€œtoo manyโ€ ones or does not contain โ€œtoo manyโ€
zeroes.
๐Ÿ
๐ŸŽ
๐‘ฟ
๐‘ฎ
4-good interval ๐‘ฎ
Assume we can (privately) obtain an interval ๐‘ฎ โІ ๐‘ฟ s.t.
โ€ข
โ€ข
Contains โ€œa lotโ€ of ones, and โ€œa lotโ€ of zeroes.
Every interval ๐‘ฐ โІ ๐‘ฟ of length โ‰ค |๐‘ฎ|/๐Ÿ’ either does not
contain โ€œtoo manyโ€ ones or does not contain โ€œtoo manyโ€
zeroes.
๐Ÿ
๐ŸŽ
๐‘ฟ
๐‘ฎ
4-good interval ๐‘ฎ
Assume we can (privately) obtain an interval ๐‘ฎ โІ ๐‘ฟ s.t.
โ€ข
โ€ข
Contains โ€œa lotโ€ of ones, and โ€œa lotโ€ of zeroes.
Every interval ๐‘ฐ โІ ๐‘ฟ of length โ‰ค |๐‘ฎ|/๐Ÿ’ either does not
contain โ€œtoo manyโ€ ones or does not contain โ€œtoo manyโ€
zeroes.
๐Ÿ
๐ŸŽ
๐‘ฟ
๐‘ฎ
4-good interval ๐‘ฎ โ‡’ done!
โ€ข
Divide ๐‘ฎ into 4 equal intervals, and define 5 โ€œequally spreadโ€
concepts in ๐‘ฎ.
โ€ข
At least one concept has small error.
๐’‰๐Ÿ
๐’‰๐Ÿ
๐’‰๐Ÿ‘
๐’‰๐Ÿ’
๐’‰๐Ÿ“
๐‘ฟ
๐‘ฎ
4-good interval ๐‘ฎ โ‡’ done!
โ€ข
Divide ๐‘ฎ into 4 equal intervals, and define 5 โ€œequally spreadโ€
concepts in ๐‘ฎ.
โ€ข
At least one concept has small error.
๐’‰๐Ÿ
๐’‰๐Ÿ
๐’‰๐Ÿ‘
๐’‰๐Ÿ’
๐’‰๐Ÿ“
๐‘ฟ
๐‘ฎ
4-good interval ๐‘ฎ โ‡’ done!
โ€ข
Divide ๐‘ฎ into 4 equal intervals, and define 5 โ€œequally spreadโ€
concepts in ๐‘ฎ.
โ€ข
โ€ข
At least one concept has small error.
Choose one using the Exp. Mechanism [McSherry and Talwar 07]
(requires ๐‘ถ ๐Ÿ samples).
๐’‰๐Ÿ
๐’‰๐Ÿ
๐’‰๐Ÿ‘
๐’‰๐Ÿ’
๐’‰๐Ÿ“
๐‘ฟ
๐‘ฎ
4-good interval ๐‘ฎ โ‡’ done!
โ€ข
Divide ๐‘ฎ into 4 equal intervals, and define 5 โ€œequally spreadโ€
concepts in ๐‘ฎ.
โ€ข
โ€ข
At least one concept has small error.
Choose one using the Exp. Mechanism [McSherry and Talwar 07]
(requires ๐‘ถ ๐Ÿ samples).
Conclusion: suffices to find a 4-good interval.
๐’‰๐Ÿ
๐’‰๐Ÿ
๐’‰๐Ÿ‘
๐’‰๐Ÿ’
๐’‰๐Ÿ“
๐‘ฟ
๐‘ฎ
Finding a 4-good interval
Assume we can (privately) obtain a Jโˆˆ โ„ s.t. there exists a 2-good
interval ๐‘ฎ of length J.
๐‘ฟ
Finding a 4-good interval
Assume we can (privately) obtain a Jโˆˆ โ„ s.t. there exists a 2-good
interval ๐‘ฎ of length J.
โ€ข
Divide ๐‘ฟ into intervals ๐‘จ๐’Š and ๐‘ฉ๐’Š of length 2J, where the
{๐‘ฉ๐’Š }โ€™s are right-shifted by J.
โ€ข
At least one interval contains ๐‘ฎ.
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ‘
๐‘ฉ๐Ÿ’
๐‘ฉ๐Ÿ“
๐‘ฟ
๐‘จ๐Ÿ
๐‘จ๐Ÿ
๐‘จ๐Ÿ‘
๐‘จ๐Ÿ’
๐‘จ๐Ÿ“
๐‘จ๐Ÿ”
Finding a 4-good interval
Assume we can (privately) obtain a Jโˆˆ โ„ s.t. there exists a 2-good
interval ๐‘ฎ of length J.
โ€ข
Divide ๐‘ฟ into intervals ๐‘จ๐’Š and ๐‘ฉ๐’Š of length 2J, where the
{๐‘ฉ๐’Š }โ€™s are right-shifted by J.
โ€ข
At least one interval contains ๐‘ฎ.
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ‘
๐‘ฉ๐Ÿ’
๐‘ฉ๐Ÿ“
๐‘ฎ
๐‘จ๐Ÿ
๐‘ฟ
๐‘จ๐Ÿ
๐‘จ๐Ÿ‘
๐‘จ๐Ÿ’
๐‘จ๐Ÿ“
๐‘จ๐Ÿ”
Finding a 4-good interval
Assume we can (privately) obtain a Jโˆˆ โ„ s.t. there exists a 2-good
interval ๐‘ฎ of length J.
โ€ข
Divide ๐‘ฟ into intervals ๐‘จ๐’Š and ๐‘ฉ๐’Š of length 2J, where the
{๐‘ฉ๐’Š }โ€™s are right-shifted by J.
โ€ข
At least one interval contains ๐‘ฎ.
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ‘
๐‘ฉ๐Ÿ’
๐‘ฉ๐Ÿ“
๐‘ฎ
๐‘จ๐Ÿ
๐‘ฟ
๐‘จ๐Ÿ
๐‘จ๐Ÿ‘
๐‘จ๐Ÿ’
๐‘จ๐Ÿ“
๐‘จ๐Ÿ”
Finding a 4-good interval
Assume we can (privately) obtain a Jโˆˆ โ„ s.t. there exists a 2-good
interval ๐‘ฎ of length J.
โ€ข
Divide ๐‘ฟ into intervals ๐‘จ๐’Š and ๐‘ฉ๐’Š of length 2J, where the
{๐‘ฉ๐’Š }โ€™s are right-shifted by J.
โ€ข
At least one interval contains ๐‘ฎ.
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ‘
๐‘ฉ๐Ÿ’
๐‘ฉ๐Ÿ“
๐‘ฎ
๐‘จ๐Ÿ
๐‘จ๐Ÿ
๐‘จ๐Ÿ‘
๐‘ฟ
๐‘จ๐Ÿ’
๐‘จ๐Ÿ“
๐‘จ๐Ÿ”
Finding a 4-good interval
Assume we can (privately) obtain a Jโˆˆ โ„ s.t. there exists a 2-good
interval ๐‘ฎ of length J.
โ€ข
Divide ๐‘ฟ into intervals ๐‘จ๐’Š and ๐‘ฉ๐’Š of length 2J, where the
{๐‘ฉ๐’Š }โ€™s are right-shifted by J.
โ€ข
At least one interval contains ๐‘ฎ.
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ‘
๐‘ฉ๐Ÿ’
๐‘ฉ๐Ÿ“
๐‘ฎ
๐‘จ๐Ÿ
๐‘จ๐Ÿ
๐‘จ๐Ÿ‘
๐‘จ๐Ÿ’
๐‘จ๐Ÿ“
๐‘ฟ
๐‘จ๐Ÿ”
Finding a 4-good interval
Assume we can (privately) obtain a Jโˆˆ โ„ s.t. there exists a 2-good
interval ๐‘ฎ of length J.
โ€ข
โ€ข
โ€ข
โ€ข
Divide ๐‘ฟ into intervals ๐‘จ๐’Š and ๐‘ฉ๐’Š of length 2J, where the
{๐‘ฉ๐’Š }โ€™s are right-shifted by J.
โ€ข
At least one interval contains ๐‘ฎ.
Say ๐‘ฎ โˆˆ ๐‘จ๐Ÿ‘ . Then ๐‘จ๐Ÿ‘ contains โ€œlotsโ€ of ones and zeroes.
Every other ๐‘จ๐’Š cannot contain both ones and zeroes.
Look for ๐‘จ๐’Š with โ€œlotsโ€ of ones and zeroes.
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ‘
๐‘ฉ๐Ÿ’
๐‘ฉ๐Ÿ“
๐‘ฎ
๐‘จ๐Ÿ
๐‘จ๐Ÿ
๐‘จ๐Ÿ‘
๐‘ฟ
๐‘จ๐Ÿ’
๐‘จ๐Ÿ“
๐‘จ๐Ÿ”
Finding a 4-good interval
Assume we can (privately) obtain a Jโˆˆ โ„ s.t. there exists a 2-good
interval ๐‘ฎ of length J.
โ€ข
Divide ๐‘ฟ into intervals ๐‘จ๐’Š and ๐‘ฉ๐’Š of length 2J, where the
{๐‘ฉ๐’Š }โ€™s are right-shifted by J.
โ€ข
At least one interval contains ๐‘ฎ.
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ‘
๐‘ฉ๐Ÿ’
๐‘ฉ๐Ÿ“
๐‘ฟ
๐‘จ๐Ÿ
๐‘จ๐Ÿ
๐‘จ๐Ÿ‘
๐‘จ๐Ÿ’
๐‘จ๐Ÿ“
๐‘จ๐Ÿ”
Finding a 4-good interval
Assume we can (privately) obtain a Jโˆˆ โ„ s.t. there exists a 2-good
interval ๐‘ฎ of length J.
โ€ข
Divide ๐‘ฟ into intervals ๐‘จ๐’Š and ๐‘ฉ๐’Š of length 2J, where the
{๐‘ฉ๐’Š }โ€™s are right-shifted by J.
โ€ข
โ€ข
At least one interval contains ๐‘ฎ.
Choose an interval using Adist
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ
[ST 2013]
๐‘ฉ๐Ÿ‘
(requires ๐‘ถ ๐Ÿ samples).
๐‘ฉ๐Ÿ’
๐‘ฉ๐Ÿ“
๐‘ฟ
๐‘จ๐Ÿ
๐‘จ๐Ÿ
๐‘จ๐Ÿ‘
๐‘จ๐Ÿ’
๐‘จ๐Ÿ“
๐‘จ๐Ÿ”
Finding a 4-good interval
Assume we can (privately) obtain a Jโˆˆ โ„ s.t. there exists a 2-good
interval ๐‘ฎ of length J.
โ€ข
Divide ๐‘ฟ into intervals ๐‘จ๐’Š and ๐‘ฉ๐’Š of length 2J, where the
{๐‘ฉ๐’Š }โ€™s are right-shifted by J.
โ€ข
โ€ข
โ€ข
At least one interval contains ๐‘ฎ.
Choose an interval using Adist
[ST 2013]
The chosen interval is of length ๐Ÿ ๐‘ฎ
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ
๐‘ฉ๐Ÿ‘
(requires ๐‘ถ ๐Ÿ samples).
โŸน
4-good !
๐‘ฉ๐Ÿ’
๐‘ฉ๐Ÿ“
๐‘ฟ
๐‘จ๐Ÿ
๐‘จ๐Ÿ
๐‘จ๐Ÿ‘
๐‘จ๐Ÿ’
๐‘จ๐Ÿ“
๐‘จ๐Ÿ”
Finding a 4-good interval
Assume we can (privately) obtain a Jโˆˆ โ„ s.t. there exists a 2-good
interval ๐‘ฎ of length J.
โ€ข
Divide ๐‘ฟ into intervals ๐‘จ๐’Š and ๐‘ฉ๐’Š of length 2J, where the
{๐‘ฉ๐’Š }โ€™s are right-shifted by J.
โ€ข
โ€ข
โ€ข
At least one interval contains ๐‘ฎ.
Choose an interval using Adist
[ST 2013]
The chosen interval is of length ๐Ÿ ๐‘ฎ
(requires ๐‘ถ ๐Ÿ samples).
โŸน
4-good !
Conclusion: suffices to find a length J
๐‘ฉ๐Ÿ of a
๐‘ฉ๐Ÿ2-good
๐‘ฉ๐Ÿ‘ interval.
๐‘ฉ๐Ÿ’
๐‘ฉ๐Ÿ“
๐‘ฟ
๐‘จ๐Ÿ
๐‘จ๐Ÿ
๐‘จ๐Ÿ‘
๐‘จ๐Ÿ’
๐‘จ๐Ÿ“
๐‘จ๐Ÿ”
Computing the length J
Easy solution:
โ€ข Noisy binary search on ๐ŸŽ โ‰ค J โ‰ค ๐Ÿ๐’… .
โ€ข ๐’… noisy comparisons requires ๐’… samples.
Better solution:
โ€ข Noisy binary search on the power ๐ŸŽ โ‰ค ๐’‹ โ‰ค ๐ฅ๐จ๐  ๐‘ฟ of a 2-good
interval of length J= ๐Ÿ๐’‹ .
โ€ข ๐ฅ๐จ๐  ๐ฅ๐จ๐  ๐‘ฟ noisy comparisons requires ๐ฅ๐จ๐  ๐ฅ๐จ๐  ๐‘ฟ samples.
In the paper:
Use recursion on binary search and significantly reduce the
costs.
Computing the length J
Easy solution:
โ€ข Noisy binary search on ๐ŸŽ โ‰ค J โ‰ค ๐Ÿ๐’… .
โ€ข ๐’… noisy comparisons requires ๐’… samples.
Better solution:
โ€ข Noisy binary search on the power ๐ŸŽ โ‰ค ๐’‹ โ‰ค ๐ฅ๐จ๐  ๐’… of a 2-good
interval of length J= ๐Ÿ๐’‹ .
โ€ข ๐ฅ๐จ๐  ๐’… noisy comparisons requires ๐ฅ๐จ๐  ๐’… samples.
In the paper:
Use recursion on binary search and significantly reduce the
costs.
Computing the length J
Easy solution:
โ€ข Noisy binary search on ๐ŸŽ โ‰ค J โ‰ค ๐Ÿ๐’… .
โ€ข ๐’… noisy comparisons requires ๐’… samples.
Better solution:
โ€ข Noisy binary search on the power ๐ŸŽ โ‰ค ๐’‹ โ‰ค ๐ฅ๐จ๐  ๐’… of a 2-good
interval of length J= ๐Ÿ๐’‹ .
โ€ข ๐ฅ๐จ๐  ๐’… noisy comparisons requires ๐ฅ๐จ๐  ๐’… samples.
In the paper:
Use recursion on binary search and significantly reduce the
costs.
Theorem:
There exists an ๐œ–, ๐›ฟ -private learner for
โˆ—๐’…
๐ฅ๐จ๐ 
๐ˆ๐๐“๐„๐‘๐•๐€๐‹๐’… with sample complexity ๐Ÿ
.
Summary and Open Problems
โ€ข What we saw:
Efficient ๐, ๐œน -private learner for ๐ˆ๐๐“๐„๐‘๐•๐€๐‹๐’… with low
sample complexity.
โ€“ This separates the sample complexity of ๐œ–, ๐›ฟ -private and
๐œ–-private learners.
โ€ข Other results:
โ€“ Efficient ๐, ๐œน -private for other concept classes with
even lower sample complexity (independent of the
domain).
โ€“ Similar results for Data Sanitization.
โ€ข Open problem:
Lower bounds on the sample complexity of ๐, ๐œน -private
learners?