Low rate of somatic mutations in a long-lived oak tree, bioRxiv, 2017-06-14
Because plants do not possess a proper germline, deleterious somatic mutations can be passed to gametes and a large number of cell divisions separating zygote from gamete formation in long-lived plants may lead to many mutations. We sequenced the genome of two terminal branches of a 234-year-old oak tree and found few fixed somatic single-nucleotide variants (SNVs), whose sequential appearance in the tree could be traced along nested sectors of younger branches. Our data suggest that stem cells of shoot meristems are robustly protected from accumulation of mutations in trees.
biorxiv plant-biology 200-500-users 2017Regional missense constraint improves variant deleteriousness prediction, bioRxiv, 2017-06-13
AbstractGiven increasing numbers of patients who are undergoing exome or genome sequencing, it is critical to establish tools and methods to interpret the impact of genetic variation. While the ability to predict deleteriousness for any given variant is limited, missense variants remain a particularly challenging class of variation to interpret, since they can have drastically different effects depending on both the precise location and specific amino acid substitution of the variant. In order to better evaluate missense variation, we leveraged the exome sequencing data of 60,706 individuals from the Exome Aggregation Consortium (ExAC) dataset to identify sub-genic regions that are depleted of missense variation. We further used this depletion as part of a novel missense deleteriousness metric named MPC. We applied MPC to de novo missense variants and identified a category of de novo missense variants with the same impact on neurodevelopmental disorders as truncating mutations in intolerant genes, supporting the value of incorporating regional missense constraint in variant interpretation.
biorxiv genomics 100-200-users 2017A single-cell anatomical blueprint for intracortical information transfer from primary visual cortex, bioRxiv, 2017-06-10
The wiring diagram of the neocortex determines how information is processed across dozens of cortical areas. Each area communicates with multiple others via extensive long-range axonal projections 1–6, but the logic of inter-area information transfer is unresolved. Specifically, the extent to which individual neurons send dedicated projections to single cortical targets or distribute their signals across multiple areas remains unclear5,7–20. Distinguishing between these possibilities has been challenging because axonal projections of only a few individual neurons have been reconstructed. Here we map the projection patterns of axonal arbors from 591 individual neurons in mouse primary visual cortex (V1) using two complementary methods whole-brain fluorescence-based axonal tracing21,22 and high-throughput DNA sequencing of genetically barcoded neurons (MAPseq)23. Although our results confirm the existence of dedicated projections to certain cortical areas, we find these are the exception, and that the majority of V1 neurons broadcast information to multiple cortical targets. Furthermore, broadcasting cells do not project to all targets randomly, but rather comprise subpopulations that either avoid or preferentially innervate specific subsets of cortical areas. Our data argue against a model of dedicated lines of intracortical information transfer via “one neuron – one target area” mapping. Instead, long-range communication between a sensory cortical area and its targets may be based on a principle whereby individual neurons copy information to, and potentially coordinate activity across, specific subsets of cortical areas.
biorxiv neuroscience 100-200-users 2017Graphtyper Population-scale genotyping using pangenome graphs, bioRxiv, 2017-06-10
AbstractA fundamental requisite for genetic studies is an accurate determination of sequence variation. While human genome sequence diversity is increasingly well characterized, there is a need for efficient ways to utilize this knowledge in sequence analysis. Here we present Graphtyper, a publicly available novel algorithm and software for discovering and genotyping sequence variants. Graphtyper realigns short-read sequence data to a pangenome, a variation-aware graph structure that encodes sequence variation within a population by representing possible haplotypes as graph paths. Our results show that Graphtyper is fast, highly scalable, and provides sensitive and accurate genotype calls. Graphtyper genotyped 89.4 million sequence variants in whole-genomes of 28,075 Icelanders using less than 100,000 CPU days, including detailed genotyping of six human leukocyte antigen (HLA) genes. We show that Graphtyper is a valuable tool in characterizing sequence variation in population-scale sequencing studies.
biorxiv bioinformatics 0-100-users 2017Indexcov fast coverage quality control for whole-genome sequencing, bioRxiv, 2017-06-10
AbstractThe BAM1 and CRAM2 formats provide a supplementary linear index that facilitates rapid access to sequence alignments in arbitrary genomic regions. Comparing consecutive entries in a BAM or CRAM index allows one to infer the number of alignment records per genomic region for use as an effective proxy of sequence depth in each genomic region. Based on these properties, we have developed indexcov, an efficient estimator of whole-genome sequencing coverage to rapidly identify samples with aberrant coverage profiles, reveal large scale chromosomal anomalies, recognize potential batch effects, and infer the sex of a sample. Indexcov is available at <jatsext-link xmlnsxlink=httpwww.w3.org1999xlink ext-link-type=uri xlinkhref=httphttpsgithub.combrentpgoleft>httpsgithub.combrentpgoleft<jatsext-link> under the MIT license.
biorxiv genomics 100-200-users 2017Quantifying the impact of rare and ultra-rare coding variation across the phenotypic spectrum, bioRxiv, 2017-06-10
AbstractThere is a limited understanding about the impact of rare protein truncating variants across multiple phenotypes. We explore the impact of this class of variants on 13 quantitative traits and 10 diseases using whole-exome sequencing data from 100,296 individuals. Protein truncating variants in genes intolerant to this class of mutations increased risk of autism, schizophrenia, bipolar disorder, intellectual disability, ADHD. In individuals without these disorders, there was an association with shorter height, lower education, increased hospitalization and reduced age. Gene sets implicated from GWAS did not show a significant protein truncating variants-burden beyond what captured by established Mendelian genes. In conclusion, we provide the most thorough investigation to date of the impact of rare deleterious coding variants on complex traits, suggesting widespread pleiotropic risk.Main abbreviations<jatsdef-list><jatsdef-item>PTV= Protein Truncating Variants<jatsdef-item><jatsdef-item>PI= Protein Truncating Intolerant<jatsdef-item><jatsdef-item>PI-PTV= Protein Truncating Variant in genes that are Intolerant to Protein Truncating Variants<jatsdef-item><jatsdef-list>
biorxiv genetics 100-200-users 2017