Biological Plasticity Rescues Target Activity in CRISPR Knockouts, bioRxiv, 2019-07-27
AbstractGene knockouts (KOs) are efficiently engineered through CRISPR-Cas9-induced frameshift mutations. While DNA editing efficiency is readily verified by DNA sequencing, a systematic understanding of the efficiency of protein elimination has been lacking. Here, we devised an experimental strategy combining RNA-seq and triple-stage mass spectrometry (SPS-MS3) to characterize 193 genetically verified deletions targeting 136 distinct genes generated by CRISPR-induced frameshifts in HAP1 cells. We observed residual protein expression for about one third of the quantified targets, at variable levels from low to original, and identified two causal mechanisms, translation reinitiation leading to N-terminally truncated target proteins, or skipping of the edited exon leading to protein isoforms with internal sequence deletions. Detailed analysis of three truncated targets, BRD4, DNMT1 and NGLY1, revealed partial preservation of protein function. Our results imply that systematic characterization of residual protein expression or function in CRISPR-Cas9 generated KO lines is necessary for phenotype interpretation.
biorxiv bioengineering 100-200-users 2019Efficient de novo assembly of eleven human genomes using PromethION sequencing and a novel nanopore toolkit, bioRxiv, 2019-07-26
AbstractPresent workflows for producing human genome assemblies from long-read technologies have cost and production time bottlenecks that prohibit efficient scaling to large cohorts. We demonstrate an optimized PromethION nanopore sequencing method for eleven human genomes. The sequencing, performed on one machine in nine days, achieved an average 63x coverage, 42 Kb read N50, 90% median read identity and 6.5x coverage in 100 Kb+ reads using just three flow cells per sample. To assemble these data we introduce new computational tools Shasta - a de novo long read assembler, and MarginPolish & HELEN - a suite of nanopore assembly polishing algorithms. On a single commercial compute node Shasta can produce a complete human genome assembly in under six hours, and MarginPolish & HELEN can polish the result in just over a day, achieving 99.9% identity (QV30) for haploid samples from nanopore reads alone. We evaluate assembly performance for diploid, haploid and trio-binned human samples in terms of accuracy, cost, and time and demonstrate improvements relative to current state-of-the-art methods in all areas. We further show that addition of proximity ligation (Hi-C) sequencing yields near chromosome-level scaffolds for all eleven genomes.
biorxiv bioinformatics 200-500-users 2019On the discovery of population-specific state transitions from multi-sample multi-condition single-cell RNA sequencing data, bioRxiv, 2019-07-26
AbstractSingle-cell RNA sequencing (scRNA-seq) has quickly become an empowering technology to profile the transcriptomes of individual cells on a large scale. Many early analyses of differential expression have aimed at identifying differences between subpopulations, and thus are focused on finding subpopulation markers either in a single sample or across multiple samples. More generally, such methods can compare expression levels in multiple sets of cells, thus leading to cross-condition analyses. However, given the emergence of replicated multi-condition scRNA-seq datasets, an area of increasing focus is making sample-level inferences, termed here as differential state analysis. For example, one could investigate the condition-specific responses of cell subpopulations measured from patients from each condition; however, it is not clear which statistical framework best handles this situation. In this work, we surveyed the methods available to perform cross-condition differential state analyses, including cell-level mixed models and methods based on aggregated “pseudobulk” data. We developed a flexible simulation platform that mimics both single and multi-sample scRNA-seq data and provide robust tools for multi-condition analysis within the muscat R package.
biorxiv bioinformatics 100-200-users 2019Innovations in Primate Interneuron Repertoire, bioRxiv, 2019-07-24
ABSTRACTPrimates and rodents, which descended from a common ancestor more than 90 million years ago, exhibit profound differences in behavior and cognitive capacity. Modifications, specializations, and innovations to brain cell types may have occurred along each lineage. We used Drop-seq to profile RNA expression in more than 184,000 individual telencephalic interneurons from humans, macaques, marmosets, and mice. Conserved interneuron types varied significantly in abundance and RNA expression between mice and primates, but varied much more modestly among primates. In adult primates, the expression patterns of dozens of genes exhibited spatial expression gradients among neocortical interneurons, suggesting that adult neocortical interneurons are imprinted by their local cortical context. In addition, we found that an interneuron type previously associated with the mouse hippocampus—the “ivy cell”, which has neurogliaform characteristics—has become abundant across the neocortex of humans, macaques, and marmosets. The most striking innovation was subcortical we identified an abundant striatal interneuron type in primates that had no molecularly homologous cell population in mouse striatum, cortex, thalamus, or hippocampus. These interneurons, which expressed a unique combination of transcription factors, receptors, and neuropeptides, including the neuropeptide TAC3, constituted almost 30% of striatal interneurons in marmosets and humans. Understanding how gene and cell-type attributes changed or persisted over the evolutionary divergence of primates and rodents will guide the choice of models for human brain disorders and mutations and help to identify the cellular substrates of expanded cognition in humans and other primates.
biorxiv neuroscience 100-200-users 2019The Genetic History of France, bioRxiv, 2019-07-24
ABSTRACTThe study of the genetic structure of different countries within Europe has provided significant insights into their demographic history and their actual stratification. Although France occupies a particular location at the end of the European peninsula and at the crossroads of migration routes, few population genetic studies have been conducted so far with genome-wide data. In this study, we analyzed SNP-chip genetic data from 2 184 individuals born in France who were enrolled in two independent population cohorts. Using FineStructure, six different genetic clusters of individuals were found that were very consistent between the two cohorts. These clusters match extremely well the geography and overlap with historical and linguistic divisions of France. By modeling the relationship between genetics and geography using EEMS software, we were able to detect gene flow barriers that are similar in the two cohorts and corresponds to major French rivers or mountains. Estimations of effective population sizes using IBDNe program also revealed very similar patterns in both cohorts with a rapid increase of effective population sizes over the last 150 generations similar to what was observed in other European countries. A marked bottleneck is also consistently seen in the two datasets starting in the fourteenth century when the Black Death raged in Europe. In conclusion, by performing the first exhaustive study of the genetic structure of France, we fill a gap in the genetic studies in Europe that would be useful to medical geneticists but also historians and archeologists.
biorxiv genetics 100-200-users 2019The histone chaperone FACT induces Cas9 multi-turnover behavior and modifies genome manipulation in human cells, bioRxiv, 2019-07-24
SummaryCas9 is a prokaryotic RNA-guided DNA endonuclease that binds substrates tightly in vitro but turns over rapidly when used to manipulate genomes in eukaryotic cells. Little is known about the factors responsible for dislodging Cas9 or how they influence genome engineering. Using a proximity labeling system for unbiased detection of transient protein interactions in cell-free Xenopus laevis egg extract, we identified the dimeric histone chaperone FACT as an interactor of substrate-bound Cas9. Immunodepletion of FACT subunits from extract potently inhibits Cas9 unloading and converts Cas9’s activity from multi-turnover to single-turnover. In human cells, depletion of FACT delays genome editing and alters the balance between indel formation and homology directed repair. Depletion of FACT also increases epigenetic marking by dCas9-based transcriptional effectors with concomitant enhancement of transcriptional modulation. FACT thus shapes the intrinsic cellular response to Cas9-based genome manipulation most likely by determining Cas9 residence times.
biorxiv cell-biology 0-100-users 2019