RANDOMNESS AND PSEUDORANDOMNESS Omer Reingold, Microsoft Research and Weizmann Randomness and Pseudorandomness When Randomness is Useful When Randomness can be reduced or eliminated – derandomization Basic Tool: Pseudorandomness An object is pseudorandom if it “looks random” (indistinguishable from uniform), though it is not. Expander Graphs Randomness In Computation (1) Can’t live without you Distributed computing (breaking symmetry) Cryptography: Secrets, Semantic Security, … Sampling, Simulations, … Randomness In Computation (2) You change my world Communication Complexity (e.g., equality) Routing (on the cube [Valiant]) drastically reduces congestion Randomness In Computation (3) Do I really need you? In algorithms – useful design tool, but many times can derandomize (e.g., PRIMES in P). Is it always the case? BPP=P means that every randomized algorithm can be derandomized with only polynomial increase in time RL=L means that every randomized algorithm can be derandomized with only a constant factor increase in memory In Distributed Computing Dining Philosophers: Don’t Attack breaking symmetry Byzantine Agreement Deterministic Randomized Synchronous t failures t+1 rounds O(1) Asynchronous impossible possible Attack Now Randomness Saves Communication Deterministic: need to send the entire file! Randomness in the Sky: O(1) bits (or log in 1/error) Private Randomness: Logarithmic number of bits (derandomization). Original File ? = Copy In Cryptography Private Keys: no randomness no secrets and no identities Encryption: two encryptions of same message with same key need to be different Randomized (interactive) Proofs: Give rise to wonderful new notions: Zero-Knowledge, PCPs, … Random Walks and Markov Chains When in doubt, flip a coin: Explore graph: minimal memory Page Rank: stationary distribution of Markov Chains Sampling vs. Approx counting. Estimating size of Web Simulations of Physical Systems … Shake Your Input Communication network (n-dimensional cube) Every deterministic routing scheme will incur exponentially busy links (in worse case) Valiant: To send a message from x y, select node z at random, send x z y. Now: O(1) expected load for every edge Another example – randomized quicksort Smoothed Analysis: small perturbations, big impact In Private Data Analysis Hide Presence/Absence of Any Individual How many people in the database have the BC1 gene? Add random noise to true answer distributed as Lap(/) More questions? More privacy? Need more noise. ratio bounded -4 -3 -2 - 0 2 3 4 Randomness and Pseudorandomness When Randomness is Useful When Randomness can be reduced or eliminated – derandomization Basic Tool: Pseudorandomness An object is pseudorandom if it “looks random” (indistinguishable from uniform), though it is not. Expander Graphs Cryptography: Good Pseudorandom Generators are Crucial With them, we have one-time pad (and more): plaintext data: 00001111 Without, keys are bad, algorithms are worthless (theoretical & practical) short key K0: 110 derived key K: 01100100 E D ciphertext = plaintext K = 01101011 plaintext data: (ciphertext K) = 00001111 Data Structures & Hash Functions Linear Probing: Bob F(Bob) Alice Mike Bob Mary If F is random then insertion time and query time are O(1) (in expectation). But where do you store a random function ?!? Derandomize! Heuristic: use SHA1, MD4, … Recently (2007): 5-wise independent functions are sufficient* Similar considerations all over: bloom filters, cuckoo hashing, bit-vectors, … Weak Sources & Randomness Extractors Available random bits are biased and correlated Von Neumann sources: b1 b2 … bi … are i.i.d. 0/1 variables and bi =1 with some probability p < 1 then translate 01 1 10 0 Randomness Extractors produce randomness from general weak sources, many other applications Algorithms: Can Randomness Save Time or Memory? Conjecture - No* (*moderate overheads may still apply) Examples of derandomization: Primality Testing in Polynomial Time Graph Connectivity logarithmic Memory Holdouts: Identity testing, approximation algorithms, … (Bipartite) Expander Graphs N S, |S| K N D |(S)| A |S| (A > 1) Important: every (not too large) set expands. (Bipartite) Expander Graphs N S, |S| K N D |(S)| A |S| (A > 1) Main goal: minimize D (i.e. constant D) • Degree 3 random graphs are expanders! [Pin73] (Bipartite) Expander Graphs N S, |S| K N D |(S)| A |S| (A > 1) Also: maximize A. Trivial upper bound: A D even A ≲ D-1 Random graphs: AD-1 Applications of Expanders These “innocent” looking objects are intimately related to various fundamental problems: Network design (fault tolerance), Sorting networks, Complexity and proof theory, Derandomization, Error correcting codes, Cryptography, Ramsey theory And more ... Non-blocking Network with On-line Path Selection [ALM] N (Inputs) N (Outputs) Depth O(log N), size O(N log N), bounded degree. Allows connection between input nodes and output nodes using vertex disjoint paths. Non-blocking Network with On-line Path Selection [ALM] N (Inputs) N (Outputs) Every request for connection (or disconnection) is satisfied in O(log N) bit steps: • On line. Handles many requests in parallel. The Network N (Inputs) N (outputs) “Lossless” Expander Slightly Unbalanced, “Lossless” Expanders N S, |S| K M= N D |(S)| 0.9 D |S| 0< 1 is an arbitrary constant D is constant & K= (M/D) = (N/D). [CRVW 02]: such expanders (with D = polylog(1/)) Property 1: A Very Strong Unique Neighbor Property S, |S| K, |(S)| 0.9 D |S| S Unique neighbor of S Non Unique neighbor S has 0.8 D |S| unique neighbors ! Using Unique Neighbors for Distributed Routing Task: match S to its neighbors (|S| K) S S` Step I: match S to its unique neighbors. Continue recursively with unmatched vertices S’. Reminder: The Network Adding new paths: think of vertices used by previous paths as faulty. Property 2: Incredibly Fault Tolerant S, |S| K, |(S)| 0.9 D |S| Remains a lossless expander even if adversary removes (0.7 D) edges from each vertex. Simple Expander Codes [G63,Z71,ZP76,T81,SS96] N (Variables) M= N (Parity Checks) 1 0 0 + + 1 +0 1 + Linear code; Rate 1 – M/N = (1 - ). Minimum distance K. Relative distance K/N= ( / D) = / polylog (1/). For small beats the Zyablov bound and is quite close to the Gilbert-Varshamov bound of / log (1/). Simple Decoding Algorithm in Linear Time (& log n parallel phases) [SS 96] N (Variables) Error set B, |B| K/2 1 0 10 |Flip\B| |B|/4 0 1 |B\Flip| |B|/4 |Bnew| |B|/2 1 M= N (Constraints) + + 0 1 1 |(B)| > .9 D |B| + 0 |(B)Sat|< .2 D|B| + 0 Algorithm: At each phase, flip every variable that “sees” a majority of 1’s (i.e, unsatisfied constraints). Random Walk on Expanders [AKS 87] x1 x0 ... x2 xi xi converges to uniform fast (for arbitrary x0). For a random x0: the sequence x0, x1, x2 . . . has interesting “random-like” properties. Thanks
© Copyright 2026 Paperzz