Supplementary tables & figures: High resolution pseudotemporal ordering of single-cell RNA-seq profiles Kieran Campbell, Chris P Ponting, Caleb Webber [email protected] Sept 15 Supplementary tables Monocle classification Embeddr classification Number of cells Non-mesenchymal Mesenchymal 69 Mesenchymal Mesenchymal 43 Non-mesenchymal Non-Mesenchymal 159 Mesenchymal Non-Mesenchymal 0 Supplementary Table 1 Classification of different cell types using Monocle and Embeddr. Embeddr classifies significantly more cells as mesenchymal compared to non-mesenchymal. Gene cluster 1 Category Log10(q-val) Description GO:0000278 -43.79775 mitotic cell cycle GO:0007049 -39.25601 cell cycle GO:0000280 -38.43667 nuclear division GO:1903047 -36.59466 mitotic cell cycle process 2 GO:0048285 -35.83482 organelle fission GO:0022402 -34.52133 cell cycle process GO:0006613 -13.67789 cotranslational 1 protein targeting to membrane GO:0045047 -13.67789 protein targeting to ER GO:0030198 -13.67789 extracellular matrix organization GO:0043062 -13.67789 extracellular structure organization GO:0006614 -13.67789 SRP-dependent cotranslational protein targeting to membrane GO:0072599 -13.56176 establishment of protein localization to endoplasmic reticulum 3 GO:0061061 -24.65233 muscle structure development GO:0007517 -22.65526 muscle organ development GO:0003012 -19.99099 muscle system process GO:0006936 -19.03772 muscle contraction GO:0030049 -18.35997 muscle filament sliding GO:0033275 -18.35997 actin-myosin filament sliding 4 GO:0051084 -4.037972 ‘de novo’ posttranslational protein folding GO:0006458 -3.962691 ‘de novo’ protein folding GO:0006457 -3.556899 protein folding Supplementary Table 2 Enriched GO biological process terms in different gene clusters. Only the top 6 in each cluster with a Benjamini-Hochberg corrected p-value < 0.01 are reported. 2 Marker genes identified by Embeddr and Quake et al. Marker genes identified by Quake et al. only. Marker genes identified by Embeddr only S100g Fabp5 Lamp3 Cd36 Scd1 Sftpb Slc34a2 Sftpa1 Egfl6 Soat1 Bex2 Sftpc Lcn2 Hc Trf Lyz2 Lyz1 Abca3 Muc1 4930420K17Rik Acsl4 Cited2 Cxcl15 Dlk1 Dram1 Etv5 Lpcat1 Mid1ip1 Napsa Nek6 Pgm2 Pla2g1b Ppp1r14c Rrp1 Sgms1 Supplementary Table 3 Marker genes for AT2 cells identified by differential silencing of genes on the BP-AT1 transition. Marker genes identified by Embeddr and Quake et al. Marker genes identified by Quake et al. only. Marker genes identified by Embeddr only Pdpn Rtkn2 Emp2 Cav1 Clic5 Lmo7 S100a6 Col4a3 Akap5 Cryab Sdpr Aqp5 Ager S100a14 1190002H23Rik 2200002D01Rik A930038C07Rik Ablim1 Actb Agrn Ahnak Akap2 Anxa3 Arhgap29 Bola2 Cav2 Cd9 Cish Clic3 Col4a4 Crip2 Cttnbp2nl Dpysl2 Elovl5 Errfi1 Fam108c Fam174b Fbln5 Frmd4a Gm6548 Gprc5a Hes1 Hopx Hs2st1 Hspg2 Igf2bp2 Igfbp6 Il18r1 Ip6k2 Itgb6 Krt7 Lgals3 Limch1 Lims1 Magi3 Mal2 Mir682 Msln Myh14 Myl12a Myo1c Phyh Pmp22 Ppp3ca Prdx6 Ptrf Pxdn Rps19 Rps19-ps3 Rps4y2 S100a11 Scnn1a Sec14l3 Sema3e Serpinb6b Sfn Slc1a5 Slco3a1 Sparc Tcf7l2 Tgfb2 Timp3 Tinagl1 Tmem37 Trp53bp2 Tspan8 Usp46 Vegfa Yes1 Zfp703 Supplementary Table 4 Marker genes for AT1 cells. 3 Supplementary figures Supplementary Figure 1 Pseudotime ordering of single-cell RNA-seq data. Beginning with a set of unordered profiles, the algorithm will assign to each cell a pseudotime after which differential expression across pseudotime can be found. 4 Supplementary Figure 2 Principal curve fitting in pseudotime. A principal curve (black solid line) is fit through the centre of the 2-dimensional laplacian eigenmaps embedding of the cells (red). The arclength from the beginning of the curve to the orthogonal project of each cell (blue dotted line) is used as to assign each cell a pseudotime. 5 Supplementary Figure 3 Laplacian eigenmaps emedding using entire gene set and clusters assigned using reduced (high variance) genes. 6 Supplementary Figure 4 Expression of mesenchymal marker genes. A large number of cells designated as non-mesenchymal by monocle show high expression of mesenchymal marker genes. Supplementary Figure 5 Principal curve and pseudotime fitting to clusters 1 & 2 (differentiation trajectory) of the Monocle dataset. 7 Supplementary Figure 6 Scatterplot of pseudotimes assigned by Monocle and Embeddr using the Monocle dataset. Spearman correlation coefficient 0.8. 8 Supplementary Figure 7 Four clusters identified using complete linkage clustering on the predicted values of the tobit regression models for pseudotime dynamics. 9 Supplementary Figure 8 Laplacian eigenmaps embedding and principal curve fitting (dashed, black) for 5, 6, 8, 10, 15, 20 nearest neighbours in the embedding. The clusters are coloured by their original assignment (k = 6), showing the method is highly robust to the choice of k. 10 Supplementary Figure 9 Embeddings and principal curves re-fit removing a randomly selected 50% of cells. In each case cluster assignments remain well separated and the pseudotime trajectory fits correctly. 11 Supplementary Figure 10 Spearman correlations of 30 pseudotime fittings using only 50% of cells compared to the pseudotime fit using the full set. Median spearman correlation 0.98. 12 Supplementary Figure 11 Spearman correlations of 30 pseudotime fittings using only 50% of genes used for original ordering. Median spearman correlation 0.91. 13 Supplementary Figure 12 Cell density as a function of pseudotime. BP cells are spread out along the beginning of the pseudotime trajectory before moving into the AT1 and AT2 states with high density. This suggests BP cells display more variability than AT cells, with AT1 cells representing the highest transcriptional state. Note that in this representation, cells begin as BP and move `forwards' in pseudotime to differentiate into AT1 cells or `backwards' in pseudotime to differentiate into AT2 cells. 14 Supplementary Figure 13 Pseudotime gene models for differentially expressed genes in the BP-AT1 transition. This transition is marked exclusively by the selective silencing of genes. 15 Supplementary Figure 14 Clusters of co-expression in the BP-AT2 transition. Three non-degenerate clusters were identified corresponding to up-regulation (1), down-regulation (2) and transient upregulation (3). Supplementary Figure 15 Differential gene expression as BP cells differentiate into AT cells. BP cells (blue) differentiate into AT1 cells (purple) or AT2 cells (pink) mainly through selective silencing of genes. The figure shows previously identified ‘perfect’ marker genes identified (purple), including those missed by Embeddr (orange), as well as genes discovered exclusively by Embeddr (green). While 16 cells are placed horizontally in pseudotime order they were assigned but that the width of each is constant for visualisation purposes (cellular density is not taken into account). 17
© Copyright 2026 Paperzz