Genome-wide Analysis of Insomnia (N=1,331,010) Identifies Novel Loci and Functional Pathways, bioRxiv, 2018-01-31
AbstractInsomnia is the second-most prevalent mental disorder, with no sufficient treatment available. Despite a substantial role of genetic factors, only a handful of genes have been implicated and insight into the associated neurobiological pathways remains limited. Here, we use an unprecedented large genetic association sample (N=1,331,010) to allow detection of a substantial number of genetic variants and gain insight into biological functions, cell types and tissues involved in insomnia complaints. We identify 202 genome-wide significant loci implicating 956 genes through positional, eQTL and chromatin interaction mapping. We show involvement of the axonal part of neurons, of specific cortical and subcortical tissues, and of two specific cell-types in insomnia striatal medium spiny neurons and hypothalamic neurons. These cell-types have been implicated previously in the regulation of reward processing, sleep and arousal in animal studies, but have never been genetically linked to insomnia in humans. We found weak genetic correlations with other sleep-related traits, but strong genetic correlations with psychiatric and metabolic traits. Mendelian randomization identified causal effects of insomnia on specific psychiatric and metabolic traits. Our findings reveal key brain areas and cells implicated in the neurobiology of insomnia and its related disorders, and provide novel targets for treatment.
biorxiv genetics 100-200-users 2018On the design of CRISPR-based single cell molecular screens, bioRxiv, 2018-01-30
AbstractSeveral groups recently reported coupling CRISPRCas9 perturbations and single cell RNA-seq as a potentially powerful approach for forward genetics. Here we demonstrate that vector designs for such screens that rely on cis linkage of guides and distally located barcodes suffer from swapping of intended guide-barcode associations at rates approaching 50% due to template switching during lentivirus production, greatly reducing sensitivity. We optimize a published strategy, CROP-seq, that instead uses a Pol II transcribed copy of the sgRNA sequence itself, doubling the rate at which guides are assigned to cells to 94%. We confirm this strategy performs robustly and further explore experimental best practices for CRISPRCas9-based single cell molecular screens.
biorxiv genomics 100-200-users 2018The Juicebox Assembly Tools module facilitates de novo assembly of mammalian genomes with chromosome-length scaffolds for under 1000, bioRxiv, 2018-01-29
Hi-C contact maps are valuable for genome assembly (Lieberman-Aiden, van Berkum et al. 2009; Burton et al. 2013; Dudchenko et al. 2017). Recently, we developed Juicebox, a system for the visual exploration of Hi-C data (Durand, Robinson et al. 2016), and 3D-DNA, an automated pipeline for using Hi-C data to assemble genomes (Dudchenko et al. 2017). Here, we introduce “Assembly Tools,” a new module for Juicebox, which provides a point-and-click interface for using Hi-C heatmaps to identify and correct errors in a genome assembly. Together, 3D-DNA and the Juicebox Assembly Tools greatly reduce the cost of accurately assembling complex eukaryotic genomes. To illustrate, we generated de novo assemblies with chromosome-length scaffolds for three mammals the wombat, Vombatus ursinus (3.3Gb), the Virginia opossum, Didelphis virginiana (3.3Gb), and the raccoon, Procyon lotor (2.5Gb). The only inputs for each assembly were Illumina reads from a short insert DNA-Seq library (300 million Illumina reads, maximum length 2x150 bases) and an in situ Hi-C library (100 million Illumina reads, maximum read length 2x150 bases), which cost <$1000.
biorxiv genomics 100-200-users 2018Integrating single-cell RNA-Seq with spatial transcriptomics in pancreatic ductal adenocarcinoma using multimodal intersection analysis, bioRxiv, 2018-01-27
To understand tissue architecture, it is necessary to understand both which cell types are present and the physical relationships among them. Single-cell RNA-Seq (scRNA-Seq) has made significant progress towards the unbiased and systematic identification of cell populations within a tissue, however, the characterization of their spatial organization within it has been more elusive. The recently introduced ‘spatial transcriptomics’ method (ST) reveals the spatial pattern of gene expression within a tissue section at a resolution of a thousand 100 µm spots across the tissue, each capturing the transcriptomes of multiple cells. Here, we present an approach for the integration of scRNA-Seq and ST data generated from the same sample, and deploy it on primary pancreatic tumors from two patients. Applying our multimodal intersection analysis (MIA), we annotated the distinct micro-environment of each cell type identified by scRNA-Seq. We further found that subpopulations of ductal cells, macrophages, dendritic cells, and cancer cells have spatially restricted localizations across the tissue, as well as distinct co-enrichments with other cell types. Our mapping approach provides an efficient framework for the integration of the scRNA-Seq-defined subpopulation structure and the ST-defined tissue architecture in any tissue.
biorxiv cancer-biology 100-200-users 2018Neural spiking for causal inference, bioRxiv, 2018-01-26
AbstractWhen a neuron is driven beyond its threshold it spikes, and the fact that it does not communicate its continuous membrane potential is usually seen as a computational liability. Here we show that this spiking mechanism allows neurons to produce an unbiased estimate of their causal influence, and a way of approximating gradient descent learning. Importantly, neither activity of upstream neurons, which act as confounders, nor downstream non-linearities bias the results. By introducing a local discontinuity with respect to their input drive, we show how spiking enables neurons to solve causal estimation and learning problems.
biorxiv neuroscience 100-200-users 2018Human Skeletal Muscle Possesses an Epigenetic Memory of Hypertrophy, Scientific Reports, 2018-01-24
It is unknown if adult human skeletal muscle has an epigenetic memory of earlier encounters with growth. We report, for the first time in humans, genome-wide DNA methylation (850,000 CpGs) and gene expression analysis after muscle hypertrophy (loading), return of muscle mass to baseline (unloading), followed by later hypertrophy (reloading). We discovered increased frequency of hypomethylation across the genome after reloading (18,816 CpGs) versus earlier loading (9,153 CpG sites). We also identified AXIN1, GRIK2, CAMK4, TRAF1 as hypomethylated genes with enhanced expression after loading that maintained their hypomethylated status even during unloading where muscle mass returned to control levels, indicating a memory of these genes methylation signatures following earlier hypertrophy. Further, UBR5, RPL35a, HEG1, PLA2G16, SETD3 displayed hypomethylation and enhanced gene expression following loading, and demonstrated the largest increases in hypomethylation, gene expression and muscle mass after later reloading, indicating an epigenetic memory in these genes. Finally, genes; GRIK2, TRAF1, BICC1, STAG1 were epigenetically sensitive to acute exercise demonstrating hypomethylation after a single bout of resistance exercise that was maintained 22 weeks later with the largest increase in gene expression and muscle mass after reloading. Overall, we identify an important epigenetic role for a number of largely unstudied genes in muscle hypertrophymemory.
scientific reports genetics 500+-users 2018