Advisor : Huilan Chang Student : Yi-Lin Tsai Department of Applied Mathematics National University of Kaohsiung 2014/08/02 Outline Introduction • Group testing • Group testing with consecutive positives • Threshold group testing Main result • Sequential algorithm for T.G.T.C • Nonadaptive algorithm for T.G.T.C Concluding Reference 2 Classical group testing • Given a set 𝒩 of 𝑛 items, each is either positive or negative, and a set 𝒟 of at most 𝑑 positives. • Goal : identify all positives by group test. • Group Test : a test on a subset S ⊆ 𝒩. positive negative Positive outcome: S contains at least one positive item. 3 Types of algorithm • Sequential algorithm : A test can be specified after the previous test outcome. • Nonadaptive algorithm : All test are specified beforehand and are conducted simultaneously. items 𝑐1 𝑐2 𝑐3 𝑐4 𝑐5 𝑐6 𝑐7 𝑐8 𝑐9 𝑡1 1 1 0 1 0 1 0 0 0 𝑡2 0 1 0 0 1 1 0 1 0 𝑡3 1 0 1 0 0 1 1 1 0 𝑡4 1 0 0 1 0 0 1 0 1 tests 𝑡5 𝑡6 0 1 1 0 1 0 0 1 0 1 1 0 0 1 0 1 1 0 4 Types of algorithm • Sequential algorithm : A test can be specified after the previous test outcome. • Nonadaptive algorithm : All test are specified beforehand and are conducted simultaneously. items 𝑐1 𝑐2 𝑐3 𝑐4 𝑐5 𝑐6 𝑐7 𝑐8 𝑐9 𝑡1 1 1 0 1 0 1 0 0 0 1 𝑡2 0 1 0 0 1 1 0 1 0 0 Outcome 𝑡3 1 0 1 0 0 1 1 1 0 0 vector 𝑡4 1 0 0 1 0 0 1 0 1 1 𝑡5 0 1 1 0 1 0 0 1 0 0 tests 𝑡6 1 1 0 0 1 0 1 1 0 0 4 Consecutive model • 𝒩 = 𝑎1 , 𝑎2 , 𝑎3 , … 𝑎𝑛 is a set of items with the linear order 𝑎𝑖 ≺ 𝑎𝑖+1 for 1 ≤ 𝑖 < 𝑛. • 𝒟 : is a set of positive items which is consecutive (under the ordering ≺), and contains at most 𝒅 items. • Test : choose arbitrary subset 𝑆 of 𝒩. 5 Consecutive model • Balding and Torney (1997) and Colbourn (1999) first studied this model. • Colbourn (1999) sequential : nonadaptive : 𝐥𝐨𝐠 𝟐 𝒏 + 𝐥𝐨𝐠 𝟐 𝒅 + 𝒄 𝐥𝐨𝐠 𝟐 𝒏 𝒅−𝟏 + 𝟐𝒅 + 𝟏 • Muller and Jimbo (2004) nonadaptive : 𝐥𝐨𝐠 𝟐 • Juan and Chang (2008) sequential : 𝒏 𝒅−𝟏 + 𝟐𝒅 − 𝟏 Lower bound : 𝐥𝐨𝐠 𝟐 (𝒅𝒏 + 𝟏 − 𝒅(𝒅 − 𝟏)/𝟐) 𝐥𝐨𝐠 𝟐 𝒏𝒅 + 𝟏, for 𝑛 ≥ 𝑑 − 1 6 Threshold group testing • Peter Damaschke (2006) arbitrary answer negative 𝒍 lower threshold positive 𝒖 upper threshold 7 Threshold group testing • Peter Damaschke (2006) 𝒈 𝒍 lower threshold 𝒖 upper threshold • 𝑔 =𝑢−𝑙−1 • If 𝑔 = 0 then we can find all positives. If 𝑔 > 0 then we can only find a 𝑔-approximate set. 7 Threshold group testing • A set 𝑃 is called 𝑔-approximate if 𝒟\𝑃 ≤ 𝑔 and 𝑃\𝒟 ≤ 𝑔. EX1 Let 𝑢 = 3, 𝑙 = 1 ⇒ 𝑔 = 1. a c b d 𝟏-approximate set e EX2 The classical group testing is the case of 𝑢 = 1, 𝑙 = 0. 8 Our work Group testing with consecutive positives Threshold group testing Threshold group testing with consecutive positives 9 Our work Threshold group testing with consecutive positives • Lower bound 𝐥𝐨𝐠 𝟐 𝒏(𝒅 − 𝒖 + 𝟏) − 𝟏, 𝑛 ≥ 𝑑 + 𝑢 − 2. • Sequential algorithm 𝐥𝐨𝐠 𝟐 𝒏/𝒖 − 𝟏 + 𝟐 𝐥𝐨𝐠 𝟐 (𝒖 + 𝟐) + 𝐥𝐨𝐠 𝟐 (𝒅 − 𝒍) − 𝟐. • Nonadaptive algorithm 𝟏𝟓 𝐥𝐨𝐠 𝟐 𝒏𝒅 + 𝟒𝒅 + 𝟕𝟏 and decoding complexity : 𝐎(𝒏𝒅 𝒍𝒐𝒈𝟐 𝒏𝒅 + 𝒖𝒅𝟐 ). 10 Sequential algorithm Nonadaptive algorithm Sequential algo. for T.G.T.C Recall : at most 𝑑 positives … It is usually assumed that 𝒖 ≤ 𝓓 ≤ 𝒅. 12 Sequential algo. for T.G.T.C Information-theoretic lower bound : Proposition (Chang and Tsai, 2014) If 𝑛 ≥ 𝑑 + 𝑢 − 2, then the number of group tests required to identify all positive items from 𝒩 is at least 𝐥𝐨𝐠 𝟐 𝒏(𝒅 − 𝒖 + 𝟏) − 𝟏. 13 Sequential algo. for T.G.T.C Our job : Provide an algorithm to locate all positive items from linear order 𝒩 and compare with the lower bound. at most 𝑑 positives … 14 Sequential algo. for T.G.T.C Our job : Provide an algorithm to locate all positive items from linear order 𝒩 and compare with the lower bound. at most 𝑑 positives … 14 Sequential algo. for T.G.T.C Our job : Provide an algorithm to locate all positive items from linear order 𝒩 and compare with the lower bound. at most 𝑑 positives … min(𝒟) max(𝒟) ≥ 𝒖 → positive. We start with the case gap 𝑔 = 0. < 𝒖 → negative. 14 Threshold without gap Theorem 1 (Chang and Tsai, 2014) For gap-free T.G.T.C, all positives can be identified in 𝐥𝐨𝐠 𝟐 𝒏/𝒖 − 𝟏 + 𝟐 𝐥𝐨𝐠 𝟐 (𝒖 + 𝟐) + 𝐥𝐨𝐠 𝟐 (𝒅 − 𝒖 + 𝟏) − 𝟐 tests. 15 Threshold without gap Proof of Theorem 1 • First partition 𝒩 into 𝑛/𝑢 parts of 𝒖 consecutive items and add some dummy negative items to the last part. dummy items 𝓝 𝑋1 𝑋2 𝑋3 𝑋4 ⋯⋯⋯⋯ 𝑋 𝑛/𝑢 ⋯⋯⋯⋯ 𝑋 𝑛/𝑢 • Let 𝒳 = 𝑋1 , 𝑋2 , ⋯ , 𝑋 𝑛/𝑢 . • Goal : find min(𝒟). 𝓝 𝑋1 𝑋2 𝑋3 𝑋4 𝑋5 Algorithm 1 and Algorithm 2 16 Threshold without gap Proof of Theorem 1 After Algorithm 1, 2, we have : 𝑋𝑖 𝑋𝑖+1 𝓝 min(𝒟) Next, find max(𝒟) : 𝑥𝑖 𝑃 = 𝑥𝑖+𝑢 ↑𝑑−𝑢 𝒖 Apply a binary search algorithm to 𝑃 where each group test is composed of 𝑢 consecutive items. 17 Algorithm 1 FIND-TWO-CANDIDATES 𝓝 𝑋1 𝑋2 𝑋3 𝑋4 𝑋5 𝑋6 ⋯⋯⋯⋯ 𝑋 𝑛/𝑢 Positive : Negative : 18 Threshold without gap Proof of theorem 1 Lemma 1 (Chang and Tsai, 2014) FIND-TWO-CANDIDATES returns 𝑋𝑖 , 𝑋𝑖+1 that min 𝒟 ∈ 𝑋𝑖 ∪ 𝑋𝑖+1 in 𝐥𝐨𝐠 𝟐 𝒏/𝒖 − 𝟏 tests. 19 Algorithm 2 LOCATE-STARTER 𝑋𝑖 𝑋𝑖+1 𝒖 + 𝑋𝑖 𝑋𝑖+1 𝒖 − 𝑋𝑖 𝑋𝑖+1 𝒖 20 Algorithm 2 LOCATE-STARTER 𝑋𝑖 𝑋𝑖+1 𝒖 − + 𝑋𝑖 𝑋𝑖+1 𝑋𝑖 𝑋𝑖+1 𝒖 𝒖 + 𝑋𝑖 𝑋𝑖+1 𝒖 − 𝑋𝑖 𝑋𝑖+1 𝒖 20 Threshold without gap Proof of theorem 1 Lemma 2 (Chang and Tsai, 2014) LOCATE-STARTER can identify min(𝒟) from 𝑋𝑖 ∪ 𝑋𝑖+1 in 𝟐 𝐥𝐨𝐠 𝟐 (𝒖 + 𝟐) − 𝟐 tests. 21 Threshold without gap Theorem 1 (Chang and Tsai, 2014) For gap-free T.G.T.C, all positives can be identified in 𝐥𝐨𝐠 𝟐 𝒏/𝒖 − 𝟏 + 𝟐 𝐥𝐨𝐠 𝟐 (𝒖 + 𝟐) + 𝐥𝐨𝐠 𝟐 (𝒅 − 𝒖 + 𝟏) − 𝟐 tests. 22 Threshold with gap Theorem 2 (Chang and Tsai, 2014) For T.G.T.C with 𝑔 > 0, a 𝑔-consecutive-approximate set can be identified in 𝐥𝐨𝐠 𝟐 𝒏/𝒖 − 𝟏 + 𝟐 𝐥𝐨𝐠 𝟐 (𝒖 + 𝟐) + 𝐥𝐨𝐠 𝟐 (𝒅 − 𝒍) − 𝟐 tests. 23 Sequential algorithm Nonadaptive algorithm Nonadaptive algo. for T.G.T.C Recall : 𝑡1 𝑡2 𝑡3 𝑡4 𝑡5 𝑡6 𝑐1 1 0 1 1 0 1 𝑐2 1 1 0 0 1 1 𝑐3 0 0 1 0 1 0 𝑐4 1 0 0 1 0 0 𝑐5 0 1 0 0 1 1 𝑐6 1 1 1 0 0 0 𝑐7 0 0 1 1 0 1 𝑐8 0 1 1 0 1 1 𝑐9 0 0 0 1 0 0 1 0 0 1 0 0 25 Consecutive-disjunct matrix Definition 1 (Chang, Chiu and Tsai, 2014) A binary matrix is (𝒓, 𝒘]-consecutive-disjunct if for any 𝑤 cyclically consecutive columns 𝐶1 , ⋯ , 𝐶𝑤 and other 𝑟 cyclically consecutive columns 𝐶𝑤+1 , ⋯ , 𝐶𝑤+𝑟 , there exists one row intersecting 𝐶1 , ⋯ , 𝐶𝑤 but none of 𝐶𝑤+1 , ⋯ , 𝐶𝑤+𝑟 . 𝒘 𝒓 1111111 000000000 𝒕 𝒏, 𝒓, 𝒘 ≔ the minimum number of rows among of all (𝑟, 𝑤]-consecutive-disjunct matrices of 𝑛 columns. 26 Consecutive-disjunct matrix • Probabilistic method Lovasz Local Lemma (1974) • Greedy construction Lovasz-Stein Theorem (1975) 27 Probabilistic method Lemma 3 (Lovasz Local Lemma) 1. 𝐴𝑖 ∶ event. 2. For each 𝐴𝑖 ∶ is dependent of at most 𝜇 events. 3. 𝑃𝑟(𝐴𝑖 ) ≤ 𝑝 for all 1 ≤ 𝑖 ≤ 𝑛. If 𝑒𝑝 𝜇 + 1 ≤ 1, then 𝑃𝑟 𝐴1 , 𝐴2 , ⋯ , 𝐴𝑛 > 0. 28 Probabilistic method Theorem 3 (Chang, Chiu and Tsai, 2014) ∀ 𝑟, 𝑤, 𝑛 > 0 with 𝑛 ≥ 3𝑟 + 2𝑤 − 1 and 𝑟 ≥ 𝑤, 𝑡 𝑛, 𝑟, 𝑤 ≤ 2 + 𝑤2 4𝑟 + 4𝑤 − 4 𝑛 − 6 𝑟 1 𝑟 𝑤 (𝑟 + 1) 1 + 𝑟 ln −13𝑤𝑟 + 12𝑟 + 13𝑤 − 5 +1 . Example. 27 𝑡 𝑛, 2, 1 ≤ 27 ln 8𝑛 − 24 + 4 4. 81 𝑡 𝑛, 2, 2 ≤ 81 ln 12𝑛 − 55 + . 4 4 29 Greedy construction Theorem 7 (Chang, Chiu and Tsai, 2014) ∀ 𝑟, 𝑛 > 0 with 𝑤 ≥ 1 and 𝑛 ≥ 𝑤 2 𝑘, 𝑡 𝑛, 𝑟, 𝑤 < 𝑒 2 𝑘 𝑤 ln 𝑘−1 2 𝑤 𝑘−2 𝑛 − 𝑘 𝑛+ 𝑟−1 𝑤−1 𝑘2 +1 , where 𝑘 = 𝑟 + 𝑤. Example. 𝑡 𝑛, 2, 1 < 9𝑒 2 ln((2/9)𝑛2 − (1/3) 𝑛 + 1) 𝑡 𝑛, 2, 2 < 16𝑒 2 ln((3/16)𝑛2 − 𝑛 + 1 + 1) 30 Nonadaptive algo. for T.G.T.C Goal : Identify a 𝑔-approximate set. 𝑑 𝑑 𝑑 𝑑 𝑑 𝑑 𝑑 ⋯ 31 Nonadaptive algo. for T.G.T.C Apply a (2,2]-consecutive-disjunct matrix with 𝑛 𝑑 columns. Given 1 1 0 1 0 0 𝑃 0 1 1 = 3 𝑎𝑛𝑑 𝑢 1 1 0 0 1 1 1 0 1 =2 1 1 0 0 1 1 1 0 1 32 Nonadaptive algo. for T.G.T.C Theorem 8 (Chang, Chiu and Tsai, 2014) For T.G.T.C with 𝑢 > 1, nonadaptive algorithm can identify a 𝑔-approximate set in 𝟏𝟓 𝐥𝐨𝐠 𝟐 𝒏𝒅 + 𝟒𝒅 + 𝟕𝟏 tests. Furthermore, the decoding complexity is 𝑶(𝒏𝒅 𝐥𝐨𝐠𝟐 𝒏𝒅 + 𝒖𝒅𝟐 ). Proof. 𝑡 𝑛 𝑛 81 , 2, 2 + 4𝑑 ≤ 4 ln 12 − 55 + 81 + 4𝑑 4 𝑑 𝑑 𝑛 ≤ 15 log 2 𝑑 + 4𝑑 + 71 . 33 Nonadaptive algo. for T.G.T.C Theorem 9 (Chang, Chiu and Tsai, 2014) For G.T.C, nonadaptive algorithm can identify all positives in 𝟓 𝐥𝐨𝐠 𝟐 𝒏𝒅 + 𝟐𝒅 + 𝟐𝟏 tests. Furthermore, the decoding complexity is 𝑶(𝒏𝒅 𝐥𝐨𝐠𝟐 𝒏𝒅 + 𝒅𝟐 ). Proof. 𝑡 𝑛 𝑛 27 , 2, 1 + 2𝑑 ≤ 4 ln 8 − 24 + 27 + 2𝑑 4 𝑑 𝑑 𝑛 ≤ 5 log 2 𝑑 + 2𝑑 + 21 . 34 Concluding Threshold group testing with consecutive positives • Lower bound 𝐥𝐨𝐠 𝟐 𝒏(𝒅 − 𝒖 + 𝟏) − 𝟏, 𝑛 ≥ 𝑑 + 𝑢 − 2. • Sequential algorithm 𝐥𝐨𝐠 𝟐 𝒏/𝒖 − 𝟏 + 𝟐 𝐥𝐨𝐠 𝟐 (𝒖 + 𝟐) + 𝐥𝐨𝐠 𝟐 (𝒅 − 𝒍) − 𝟐. • Nonadaptive algorithm 𝟏𝟓 𝐥𝐨𝐠 𝟐 𝒏𝒅 + 𝟒𝒅 + 𝟕𝟏 and decoding complexity : 𝐎(𝒏𝒅 𝐥𝐨𝐠𝟐 𝒏𝒅 + 𝒖𝒅𝟐 ). 35 References 1. D. J. Balding and D. C. Torney, The design of pooling experiments for screening a clone map, Fungal Genet. Biol. 21 (1997) 302-307. 2. H. Chang, Y.-C Chiu and Y.-L Tsai, A variation of cover-free families and its applications, preprint. 3. H. Chang and Y.-L Tsai, Threshold group testing with consecutive positives, Discrete Appl. Math. 169 (2014) 68-72. 4. C. J. Colbourn, Group testing for consecutive positives, Ann. Combin. 3 (1999) 37-41. 5. P. Damaschke, Threshold group testing, In: General Theory of Information Transfer and Combinatorics, Lect. Notes Comput. Sci. 4123 (2006) 707-718. 6. P. Erdos and L. Lovasz, Infinite and finite sets, Colloq. Math. Soc. Janos Bolyai 10 (1974) 609-627. 7. J. S.-T. Juan and G. J. Chang, Adaptive group testing for consecutive positives, Discrete Math. 308 (2008) 1124-1129. 36 References 8. L. Lovasz, On the ratio of optimal integral and fractional covers, Discrete Math. 13 (1975) 383-390. 9. R. A. Moser and G. Tardos, A constructive proof of the general Lovasz Local Lemma, Journal of the ACM (JACM). 57 (2010) 1-15. 10. M. Muller and M. Jimbo, Consecutive positive detectable matrices and group testing for consecutive positives, Discrete Math. 279 (2004) 369-381. 11. S. K. Stein, Two combinatorial covering problems, J. Combinatorial Theory. 16 (1974) 391-397. 37
© Copyright 2026 Paperzz