NMETH-BC07899D Supplementary Figure 1. The principle of 7DDNA haplotyping. A diploid cell is shown with 5 chromosome pairs. By computer-directed laser microdissection at the dotted line, 5 chromosomes are collected. In this harvest, chromosomes 2, 3 and 5 are monosomic, chromosome 1 is disomic, and chromosome 4 is null. Conventional genotyping using this harvest will directly reveal haplotypes of chromosomes 2, 3, and 5. NMETH-BC07899D Supplementary Figure 2. Monitoring the copy number of chromosome-1 in a microdissection harvest. The copy number of each sister chromosome pair in the harvest can be readily monitored by the genotypes of the 7DDNA sample. If a 7DDNA sample shows “homozygous” genotypes at all loci known to be heterozygous (known from genomic DNA genotyping results) along a chromosome in a person, this microdissection harvest should contain one single copy of this chromosome. This figure shows observed heterozygous SNPs along the chromosome-1 of our microdissection harvest-2 in our genotyping result using the Illumina CNV370 platform. Each red bar represents a known heterozygous SNP that remained to be heterozygous in the genotype call with WGA4 amplified microdissection harvest. Those SNPs located on repeats were excluded. These observed heterozygous SNP loci enabled us to distinguish the haploid portion (containing only one of two sister chromosome copies) and the diploid portion (containing both sister chromosome copies) in this microdissected sample. The boundary between the haploid portion and the diploid portion is indicated by a horizontal green line in this figure. One of the homologous copies of chromosome-1 was broken in the microdissection procedure, the allele calls of chromsome-1 segment below this line constituted the haplotype output. NMETH-BC07899D Supplementary Figure 3. Monitoring the copy number of chromosomes in three microdissection harvests. The copy number of each sister chromosome pair in a microdissection harvest can be monitored through the remaining heterozygous loci in a sample. If a 7DDNA sample shows “homozygous” genotypes at all loci known to be heterozygous (known from genomic DNA genotyping results) along a chromosome in a person, this microdissection harvest should contain one single copy of this chromosome. This figure shows two typical windows (one for haploid chromosomes and one for diploid chromosomes) for each of our three microdissection harvests. Each red bar represents a known heterozygous SNP that remained to be heterozygous in the genotype call with WGA4 amplified microdissection harvest. Those SNPs located on repeats were excluded. NMETH-BC07899D Supplementary Figure 4. The relationship between chromosome coverage and the cumulative number of microdissection in the 7DDNA haplotyping. For example, 6 cumulative cut will ensure ~72.9% of the total individuals receive a whole-genome haplotype report , in which 20.1% of individuals may have completed their allchromosome haplotyping when it reaches 4 cuts, 26.2% of individuals may have completed in the 5th round, and 20.7% of individuals at the 6th round. The probability data is provided in Supplementary Table 6. The detailed calculation procedure is provided in Supplementary Equation-1. NMETH-BC07899D Supplementary Table 1. A zoom-in window of the haplotype report on chromosome 5. SNP Name rs10076494 rs1108867 rs28416084 rs6869925 rs10055993 rs4957023 rs6868535 rs4541696 rs4045344 rs9313132 rs7702501 rs920980 rs4957112 rs4594899 rs4975605 rs2736100 rs402710 rs31489 rs6347 rs27048 Chr 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 Pos (bp) 91139 222902 270640 317036 317296 389952 548377 634053 648870 651237 918411 949983 1E+06 1E+06 1E+06 1E+06 1E+06 1E+06 1E+06 1E+06 SNP [A/G] [A/G] [A/G] [A/G] [A/C] [T/C] [T/C] [A/G] [A/C] [A/G] [A/C] [T/C] [T/C] [A/G] [T/G] [T/G] [T/C] [A/C] [A/G] [T/C] Conventional genotyping output gDNA 7DDNA TC AG AG AG AC TC TC TC AC AG AC TC TC AG AC TG TC AC AG TC Haplotype output Observed Derived TT AA AA GG CC TT CC TT AA AA AA CC CC GG CC TT TT AA GG CC T A A G C T C T A A A C C G C T T A G C C G G A A C T C C G C T T A A G C C A T One chromosomal haplotype (observed) was directly readout from the Illumina output, the other chromosomal haplotype (derived) is composed of opposite alleles that were not observed in the genotyping report. gDNA, genomic DNA. Conventional genotyping, we used a high-throughput Illumina genotyping platform in this experiment. NMETH-BC07899D Supplementary Table 3. Fidelity of whole genome amplification in the 7DDNA haplotyping. Error rate (%) Samples haploid chromosomes diploid chromosomes all chromosomes Sample-1 1.66 1.60 1.63 Sample-2 0.63 0.50 0.54 Sample-3 0.72 0.46 0.50 All samples 0.98 0.64 0.73 Through observation on the genotype calls of known homozygous SNPs after whole genome amplification of our microdissection harvests, we were able to access the fidelity of the WGA in our experiment. SNPs on repeats were excluded. We observed the opposite allele calls on only 0.73% of these homozygous SNPs. These errors may be caused by WGA errors, Illumina genotyping errors, and overlapping with un-annotated repeats, CNVs, or segmental duplications. NMETH-BC07899D Supplementary Table 4. Call rates of the Illumina genotyping with whole genome amplification products of microdissection harvests in the 7DDNA haplotyping. . Haploid chromosomes Diploid chromosomes Samples Total SNPs Called SNPs % Total SNPs Called SNPs % Sample-1 148565 36961 24.9 191502 55092 28.8 Sample-2 139103 43295 31.1 200964 108057 53.8 Sample-3 67703 25232 37.3 283296 159662 56.4 All 355371 105488 29.7 675762 322811 47.8 All SNPs, including both homozygous SNPs and heterozygous SNPs reported in the genotyping with the genomic DNA of GM10847, were included in this analysis. NMETH-BC07899D Supplementary Table 5. Allele dropout of the Illumina genotyping with whole genome amplification products of microdissection harvests in the 7DDNA haplotyping. Total Number of Samples Observed heterozygous heterozygous chromosomes SNPs Observed Obs/Total Heterozygous (%) SNP/chromosome SNPs Haploid chromosomes Sample-1 9 13641 8 0.059 0.9 Sample-2 10 15649 13 0.083 1.3 Sample-3 4 7193 8 0.111 2.0 All 23 36483 29 0.079 1.3 Diploid chromosomes Sample-1 13 13241 832 6.3 64 Sample-2 12 27011 2827 10.5 236 Sample-3 18 40287 5029 12.5 279 All 43 80539 8688 10.8 202 Only heterozygous SNPs reported in the genotyping with the genomic DNA of GM10847 were included in this analysis. Those SNPs located on repeats were excluded. Repeated were identified by RepeatMasker. NMETH-BC07899D Supplementary Table 6. Probability for haplotyping whole-genome after m cuts. The number pm ([1-P(A)]) of cuts (m) Assumption A1 Assumption A3 1 1.02E-06 1.19E-07 2 0.00201 0.00134 3 0.05830 0.04636 4 0.25960 0.22664 5 0.52190 0.48180 6 0.72910 0.69613 7 0.85730 0.83494 8 0.92760 0.91391 9 0.96390 0.95603 10 0.98220 0.97778 11 0.99130 0.98883 12 0.99570 0.99440 13 0.99786 0.99720 14 0.99893 0.99860 15 0.99945 0.99930 16 0.99971 0.99965 17 0.99983 0.99982 18 0.99989 0.99991 19 0.99992 0.99996 20 0.99994 0.99998 The detailed calculation procedure of this table is provided in Supplementary Equation-1. NMETH-BC07899D Supplementary Table 7. The SNP coverage in cumulative 7DDNA haplotyping. SNP coverage% Number of Haplotyped chromosomes chromosomes Mean SD Min Max chromosomes haplotyped by 1 harvests 8 27.5 10.0 15.9 39.0 chromosomes haplotyped by 2 harvests 5 49.4 2.6 45.2 51.6 chromosomes haplotyped by 3 harvests 1 59.5 59.5 59.5 NMETH-BC07899D Supplementary Note-1. The probability that at least one chromosome has not been separated in cumulative haplotyping. Assumptions: A1: Each cut results half of chromosomes in one side and the other half in the other side; A2: The cut from one cell is independent to the cut from another cell; Notations n : the number of chromosome pairs ( n 23 ); m : the number of cells used (the number of cuts); Ai : the event that the i th chromosome pair is not separated ( i 1, , n ) after m cuts; Aij : the event that the i th chromosome pair is not separated in the j th cut ( i 1, , n; j 1, , m ) A : the event that at least one chromosome is not separated after m cuts; n m We have A i 1 Ai and Ai j 1 Aij . So n P ( A) P( i 1 Ai ) k n k 1 (1) k 1 P( A1 A2 Ak ) n m k n k 1 (1) k 1 P( j 1 ( A1 j A2 j Akj )) n k n k 1 (1) k 1 [ P( A11 A21 Ak1 )]m , n where A11 A21 Ak 1 is the event that the first k chromosome pairs are not separated in one cut. We also have NMETH-BC07899D l n 2l k 2n 2k k . P ( A11 A21 Ak1 ) l 0,2l n n 2n We can understand this calculation as follows: n : The total number of combinations; 2n l : k The number of combinations that l chromosome pairs are not separated in one side ( l chromosome pairs in one side while k l haplotype pairs in the other side; n 2l : The number of combinations when l chromosome pairs are not separated in 2n 2 k one side; If we change the aforementioned assumption A1 as the following assumption: A3: each chromosome has a probability of 0.5 to enter one side in the cut, then n n P ( A) P( i 1 Ai ) 1 P(( i 1 Ai )c ) n 1 P( i 1 Aic ) 1 P( A1c ) n , m m where P ( A1c ) P (( j 1 A1 j )c ) 1 P ( j 1 A1 j ) 1 P ( A11 ) m 1 0.5m. So we have P ( A) 1 (1 0.5m ) n . Supplementary Table 6 showed the probability data that all chromosome pairs are separated after m cuts, which is 1 P( A) . NMETH-BC07899D Supplementary Note-2. The estimation of 7DDNA Cost. Assume that for each individual, we will make up to 12 microdissection sampling cut. Assume that in each round we will process only one of an individual’s microdissection harvests (whole genome amplification and Illumina SNP genotyping). At the end of each round, successfully haplotyped chromosomes are summarized for each individual. When an individual has already received a haplotype report for all 23 chromosomes, his/her remaining microdissection harvests will not be processed in the next round. Our statistical analysis (Supplementary Equation-1) showed that 4-8 microdissection samplings of each individual will provide a 93% probability to obtain his/her whole genome haplotypes (Supplementary Table 6 and Supplementary Fig. 4). In other words, ~93% of individuals will need no more than 8 microdissections to complete his/her genome haplotyping. When we reach 12 microdissection samplings, 99.6% of all individuals will have his/her whole genome haplotypes (Supplementary Table 6). Assume that the cost for first round haplotyping is x and the cost for each addition round for m repeats (from m cells) is ym (m 1, 2,) . Denote the probability of that the procedure is repeated m times is pm (m 1, 2,) , then the expected cost of the experiment is: x m 1 pm ym The probability pm is the probability that at least one chromosome pair that is not separated after m 1 microdissections but all chromosome pairs are separated after m cuts. The pm values are provided in the Supplementary Table 6. NMETH-BC07899D For each specimen, there is a basal cost on a genotyping assay with its genomic DNA, and a cost ($5.20) on tubes, tips, chemicals, and a foiled slide. Based on the outsource service price, $473 for Illumina Omni1-Quad BeadChip and $342 for Illumina CNV370 BeadChip, x = $473+5.20 = $478.20 (Illumina Omni1-Quad BeadChip) or x = $342+5.20 = $347.20 (Illumina CNV370 BeadChip) The cost for each cut includes a collecting tube ($0.07), whole genome amplification (WGA) ($10) and a high-throughput SNP genotyping ($473). So ym= $483 (Illumina Omni1-Quad BeadChip) or $352 (Illumina CNV370 BeadChip) ym = ymx m The total outsource service cost per individual will be, (Illumina Omni1-Quad) x m 1 pm ym = $478 + m-1 pm x 483 = $2675/individual (Illumina CNV370) x m 1 pm ym = $347 + m-1 pm x 352 = $1968/individual * The pm values are provided in the Supplementary Table 6. The total outsource service cost per individual per SNP will be, (Illumina Omni1-Quad) $2675/1140419 = $0.0023/individual/SNP (Illumina CNV370) $1968/351507 = $0.0056/individual/SNP
© Copyright 2026 Paperzz