Highly multiplexed in situ protein imaging with signal amplification by Immuno-SABER, bioRxiv, 2018-12-29
AbstractProbing the molecular organization of tissues requires in situ analysis by microscopy. However current limitations in multiplexing, sensitivity, and throughput collectively constitute a major barrier for comprehensive single-cell profiling of proteins. Here, we report Immunostaining with Signal Amplification By Exchange Reaction (Immuno-SABER), a rapid, highly multiplexed signal amplification method that simultaneously tackles these key challenges. Immuno-SABER utilizes DNA-barcoded antibodies and provides a method for highly multiplexed signal amplification via modular orthogonal DNA concatemers generated by Primer Exchange Reaction. This approach offers the capability to preprogram and control the amplification level independently for multiple targets without in situ enzymatic reactions, and the intrinsic scalability to rapidly amplify and image a large number of protein targets. We validated our approach in diverse sample types including cultured cells, cryosections, FFPE sections, and whole mount tissues. We demonstrated independently tunable 5-180-fold amplification for multiple targets, covering the full signal range conventionally achieved by secondary antibodies to tyramide signal amplification, as well as simultaneous signal amplification for 10 different proteins using standard equipment and workflow. We further combined Immuno-SABER with Expansion Microscopy to enable rapid and highly multiplexed super-resolution tissue imaging. Overall, Immuno-SABER presents an effective and accessible platform for rapid, multiplexed imaging of proteins across scales with high sensitivity.
biorxiv cell-biology 200-500-users 2018Human microbiome aging clocks based on deep learning and tandem of permutation feature importance and accumulated local effects, bioRxiv, 2018-12-29
The human gut microbiome is a complex ecosystem that both affects and is affected by its host status. Previous analyses of gut microflora revealed associations between specific microbes and host health and disease status, genotype and diet. Here, we developed a method of predicting the biological age of the host based on the microbiological profiles of gut microbiota using a curated dataset of 1,165 healthy individuals (1,663 microbiome samples). Our predictive model, a human microbiome clock, has an architecture of a deep neural network and achieves the accuracy of 3.94 years mean absolute error in cross-validation. The performance of the deep microbiome clock was also evaluated on several additional populations. We further introduce a platform for biological interpretation of individual microbial features used in age models, which relies on permutation feature importance and accumulated local effects. This approach has allowed us to define two lists of 95 intestinal biomarkers of human aging. We further show that this list can be reduced to 39 taxa that convey the most information on their host's aging. Overall, we show that (a) microbiological profiles can be used to predict human age; and (b) microbial features selected by models are age-related.
biorxiv bioinformatics 200-500-users 2018Prioritisation of oncology therapeutic targets using CRISPR-Cas9 screening, bioRxiv, 2018-12-20
SummaryFunctional genomics approaches can overcome current limitations that hamper oncology drug development such as lack of robust target identification and clinical efficacy. Here we performed genome-scale CRISPR-Cas9 screens in 204 human cancer cell lines from 12 cancer-types and developed a data-driven framework to prioritise cancer therapeutic candidates. We integrated gene cell fitness effects with genomic biomarkers and target tractability for drug development to systematically prioritise new oncology targets in defined tissues and genotypes. Furthermore, we took one of our most promising dependencies, Werner syndrome RecQ helicase, and verified it as a candidate target for tumours with microsatellite instability. Our analysis provides a comprehensive resource of cancer dependencies, a framework to prioritise oncology targets, and nominates specific new candidates. The principles described in this study can transform the initial stages of the drug development process contributing to a new, diverse and more effective portfolio of oncology targets.
biorxiv cancer-biology 200-500-users 2018An RNA-binding region regulates CTCF clustering and chromatin looping, bioRxiv, 2018-12-13
Mammalian genomes are folded into Topologically Associating Domains (TADs), consisting of cell-type specific chromatin loops anchored by CTCF and cohesin. Since CTCF and cohesin are expressed ubiquitously, how cell-type specific CTCF-mediated loops are formed poses a paradox. Here we show RNase-sensitive CTCF self-association in vitro and that an RNA-binding region (RBR) mediates CTCF clustering in vivo. Intriguingly, deleting the RBR abolishes or impairs almost half of all chromatin loops in mouse embryonic stem cells. Disrupted loop formation correlates with abrogated clustering and diminished chromatin binding of the RBR mutant CTCF protein, which in turn results in a failure to halt cohesin-mediated extrusion. Thus, CTCF loops fall into at least 2 classes RBR-independent and RBR-dependent loops. We suggest that evidence for distinct classes of RBR-dependent loops may provide a mechanism for establishing cell-specific CTCF loops regulated by RNAs and other RBR partner.
biorxiv biophysics 200-500-users 2018Fast and accurate large multiple sequence alignments using root-to-leave regressive computation, bioRxiv, 2018-12-08
AbstractInferences derived from large multiple alignments of biological sequences are critical to many areas of biology, including evolution, genomics, biochemistry, and structural biology. However, the complexity of the alignment problem imposes the use of approximate solutions. The most common is the progressive algorithm, which starts by aligning the most similar sequences, incorporating the remaining ones following the order imposed by a guide-tree. We developed and validated on protein sequences a regressive algorithm that works the other way around, aligning first the most dissimilar sequences. Our algorithm produces more accurate alignments than non-regressive methods, especially on datasets larger than 10,000 sequences. By design, it can run any existing alignment method in linear time thus allowing the scale-up required for extremely large genomic analyses.One Sentence SummaryInitiating alignments with the most dissimilar sequences allows slow and accurate methods to be used on large datasets
biorxiv bioinformatics 200-500-users 2018Large-scale exome sequencing study implicates both developmental and functional changes in the neurobiology of autism, bioRxiv, 2018-12-01
SummaryWe present the largest exome sequencing study of autism spectrum disorder (ASD) to date (n=35,584 total samples, 11,986 with ASD). Using an enhanced Bayesian framework to integrate de novo and case-control rare variation, we identify 102 risk genes at a false discovery rate ≤ 0.1. Of these genes, 49 show higher frequencies of disruptive de novo variants in individuals ascertained for severe neurodevelopmental delay, while 53 show higher frequencies in individuals ascertained for ASD; comparing ASD cases with mutations in these groups reveals phenotypic differences. Expressed early in brain development, most of the risk genes have roles in regulation of gene expression or neuronal communication (i.e., mutations effect neurodevelopmental and neurophysiological changes), and 13 fall within loci recurrently hit by copy number variants. In human cortex single-cell gene expression data, expression of risk genes is enriched in both excitatory and inhibitory neuronal lineages, consistent with multiple paths to an excitatoryinhibitory imbalance underlying ASD.
biorxiv genetics 200-500-users 2018