Modified penetrance of coding variants by cis-regulatory variation contributes to disease risk, Nature Genetics, 2018-08-10
Coding variants represent many of the strongest associations between genotype and phenotype; however, they exhibit inter-individual differences in effect, termed ‘variable penetrance’. Here, we study how cis-regulatory variation modifies the penetrance of coding variants. Using functional genomic and genetic data from the Genotype-Tissue Expression Project (GTEx), we observed that in the general population, purifying selection has depleted haplotype combinations predicted to increase pathogenic coding variant penetrance. Conversely, in cancer and autism patients, we observed an enrichment of penetrance increasing haplotype configurations for pathogenic variants in disease-implicated genes, providing evidence that regulatory haplotype configuration of coding variants affects disease risk. Finally, we experimentally validated this model by editing a Mendelian single-nucleotide polymorphism (SNP) using CRISPRCas9 on distinct expression haplotypes with the transcriptome as a phenotypic readout. Our results demonstrate that joint regulatory and coding variant effects are an important part of the genetic architecture of human traits and contribute to modified penetrance of disease-causing variants.
nature genetics genetics 500+-users 2018Moving beyond P values Everyday data analysis with estimation plots, bioRxiv, 2018-07-26
Over the past 75 years, a number of statisticians have advised that the data-analysis method known as null-hypothesis significance testing (NHST) should be deprecated (Berkson, 1942; Halsey et al., 2015; Wasserstein et al., 2019). The limitations of NHST have been extensively discussed, with a broad consensus that current statistical practice in the biological sciences needs reform. However, there is less agreement on reform’s specific nature, with vigorous debate surrounding what would constitute a suitable alternative (Altman et al., 2000; Benjamin et al., 2017; Cumming and Calin-Jageman, 2016). An emerging view is that a more complete analytic technique would use statistical graphics to estimate effect sizes and evaluate their uncertainty (Cohen, 1994; Cumming and Calin-Jageman, 2016). As these estimation methods require only minimal statistical retraining, they have great potential to shift the current data-analysis culture away from dichotomous thinking towards quantitative reasoning (Claridge-Chang and Assam, 2016). The evolution of statistics has been inextricably linked to the development of quantitative displays that support complex visual reasoning (Tufte, 2001). We consider that the graphic we describe here as estimation plot is the most intuitive way to display the complete statistical information about experimental data sets. However, a major obstacle to adopting estimation plots is accessibility to suitable software. To lower this hurdle, we have developed free software that makes high-quality estimation plotting available to all. Here, we explain the rationale for estimation plots by contrasting them with conventional charts used to display data with NHST results, and describe how the use of these graphs affords five major analytical advantages.
biorxiv bioinformatics 500+-users 2018Panoptic vDISCO imaging reveals neuronal connectivity, remote trauma effects and meningeal vessels in intact transparent mice, bioRxiv, 2018-07-23
Analysis of entire transparent rodent bodies could provide holistic information on biological systems in health and disease. However, it has been challenging to reliably image and quantify signal from endogenously expressed fluorescent proteins in large cleared mouse bodies due to the low signal contrast. Here, we devised a pressure driven, nanobody based whole-body immunolabeling technology to enhance the signal of fluorescent proteins by up to two orders of magnitude. This allowed us to image subcellular details in transparent mouse bodies through bones and highly autofluorescent tissues, and perform quantifications. We visualized for the first-time whole-body neuronal connectivity of an entire adult mouse and discovered that brain trauma induces degeneration of peripheral axons. We also imaged meningeal lymphatic vessels and immune cells through the intact skull and vertebra in naive animals and trauma models. Thus, our new approach can provide an unbiased holistic view of biological events affecting the nervous system and the rest of the body.
biorxiv neuroscience 500+-users 2018Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals, Nature Genetics, 2018-07-20
Here we conducted a large-scale genetic association analysis of educational attainment in a sample of approximately 1.1 million individuals and identify 1,271 independent genome-wide-significant SNPs. For the SNPs taken together, we found evidence of heterogeneous effects across environments. The SNPs implicate genes involved in brain-development processes and neuron-to-neuron communication. In a separate analysis of the X chromosome, we identify 10 independent genome-wide-significant SNPs and estimate a SNP heritability of around 0.3% in both men and women, consistent with partial dosage compensation. A joint (multi-phenotype) analysis of educational attainment and three related cognitive phenotypes generates polygenic scores that explain 11–13% of the variance in educational attainment and 7–10% of the variance in cognitive performance. This prediction accuracy substantially increases the utility of polygenic scores as tools in research.
nature genetics genetics 500+-users 2018Entomophthovirus An insect-derived iflavirus that infects a behavior manipulating fungal pathogen of dipterans, bioRxiv, 2018-07-18
AbstractWe discovered a virus infecting Entomophthora muscae, a behavior-manipulating fungal pathogen of dipterans. The virus, which we name Entomophthovirus, is a capsid-forming, positive-strand RNA virus in the viral family iflaviridae, whose known members almost exclusively infect insects. We show that the virus RNA is expressed at high levels in fungal cells in vitro and during in vivo infections of Drosophila melanogaster, and that virus particles are present in E. muscae. Two close relatives of the virus had been previously described as insect viruses based on the presence of viral genomes in transcriptomes assembled from RNA extracted from wild dipterans. By analyzing sequencing data from these earlier reports, we show that both dipteran samples were co-infected with E. muscae. We also find the virus in RNA sequencing data from samples of two other species of dipterans, Musca domestica and Delia radicum, known to be infected with E. muscae. These data establish that Entomophthovirus is widely, and seemingly obligately, associated with E. muscae. As other members of the iflaviridae cause behavioral changes in insects, we speculate on the possibility that Entomophthovirus plays a role in E. muscae involved host manipulation.
biorxiv microbiology 500+-users 2018