High accuracy DNA sequencing on a small, scalable platform via electrical detection of single base incorporations, bioRxiv, 2019-04-16
AbstractHigh throughput DNA sequencing technologies have undergone tremendous development over the past decade. Although optical detection-based sequencing has constituted the majority of data output, it requires a large capital investment and aggregation of samples to achieve optimal cost per sample. We have developed a novel electronic detection-based platform capable of accurately detecting single base incorporations. The GenapSys technology with its electronic detection modality allows the system to be compact, accessible, and affordable. We demonstrate the performance of the system by sequencing several different microbial genomes with varying GC content. The platform is capable of generating 1.5 Gb of high-quality nucleic acid sequence in a single run. We routinely generate sequence data that exceeds 99% raw accuracy with read lengths of up to 175 bp. The utility of the platform is highlighted by targeted sequencing of the human genome. We show high concordance of SNP detection on the human NA12878 HapMap cell line with data generated on the Illumina sequencing platform. In addition, we sequenced a targeted panel of cancer-associated genes in a well characterized reference standard. With multiple library preparation approaches on this sample, we were able to identify low frequency mutations at expected allele frequencies.
biorxiv genomics 100-200-users 2019nf-core Community curated bioinformatics pipelines, bioRxiv, 2019-04-16
AbstractThe standardization, portability, and reproducibility of analysis pipelines is a renowned problem within the bioinformatics community. Most pipelines are designed for execution on-premise, and the associated software dependencies are tightly coupled with the local compute environment. This leads to poor pipeline portability and reproducibility of the ensuing results - both of which are fundamental requirements for the validation of scientific findings. Here, we introduce nf-core a framework that provides a community-driven, peer-reviewed platform for the development of best practice analysis pipelines written in Nextflow. Key obstacles in pipeline development such as portability, reproducibility, scalability and unified parallelism are inherently addressed by all nf-core pipelines. We are also continually developing a suite of tools that assist in the creation and development of both new and existing pipelines. Our primary goal is to provide a platform for high-quality, reproducible bioinformatics pipelines that can be utilized across various institutions and research facilities.
biorxiv bioinformatics 100-200-users 2019Protein engineering expands the effector recognition profile of a rice NLR immune receptor, bioRxiv, 2019-04-16
AbstractPlant NLR receptors detect pathogen effectors and initiate an immune response. Since their discovery, NLRs have been the focus of protein engineering to improve disease resistance. However, this has proven challenging, in part due to their narrow response specificity. Here, we used structure-guided engineering to expand the response profile of the rice NLR Pikp to variants of the rice blast pathogen effector AVR-Pik. A mutation located within an effector binding interface of the integrated Pikp-HMA domain increased the binding affinity for AVR-Pik variants in vitro and in vivo. This translates to an expanded cell death response to AVR-Pik variants previously unrecognized by Pikp in planta. Structures of the engineered Pikp-HMA in complex with AVR-Pik variants revealed the mechanism of expanded recognition. These results provide a proof-of-concept that protein engineering can improve the utility of plant NLR receptors where direct interaction between effectors and NLRs is established, particularly via integrated domains.
biorxiv plant-biology 0-100-users 2019Scale-free Vertical Tracking Microscopy Towards Bridging Scales in Biological Oceanography, bioRxiv, 2019-04-16
AbstractUnderstanding key biophysical phenomena in the ocean often requires one to simultaneously focus on microscale entities, such as motile plankton and sedimenting particles, while maintaining the macroscale context of vertical transport in a highly stratified environment. This poses a conundrum How to measure single organisms, at microscale resolution, in the lab, while allowing them to freely move hundreds of meters in the vertical direction? We present a solution in the form of a scale-free, vertical tracking microscope based on a circular “hydrodynamic-treadmill”. Our technology allows us to transcend physiological and ecological scales, tracking organisms from marine zooplankton to single-cells over vertical scales of meters while resolving microflows and behavioral processes. We demonstrate measurements of sinking particles, including marine snow as they sediment tens of meters while capturing sub-particle-scale phenomena. We also demonstrate depth-patterned virtual-reality environments for novel behavioral analyses of microscale plankton. This technique offers a new experimental paradigm in microscale ocean biophysics by combining physiological-scale imaging with free movement in an ecological-scale patterned environment.One sentence summaryScale-free vertical tracking microscopy captures, for the first time, untethered behavioral dynamics at cellular resolution for marine plankton.
biorxiv biophysics 100-200-users 2019Rare variants contribute disproportionately to quantitative trait variation in yeast, bioRxiv, 2019-04-15
AbstractA detailed understanding of the sources of heritable variation is a central goal of modern genetics. Genome-wide association studies (GWAS) in humans1 have implicated tens of thousands of DNA sequence variants in disease risk and quantitative trait variation, but these variants fail to account for the entire heritability of diseases and traits. GWAS have by design focused on common DNA sequence variants; however, recent studies underscore the likely importance of the contribution of rare variants to heritable variation2. Further, finding the genes that underlie the GWAS signals remains a major challenge. Here, we use a unique model system to disentangle the contributions of common and rare variants to a large number of quantitative traits. We generated large crosses among 16 diverse yeast strains and identified thousands of quantitative trait loci (QTLs) that explain most of the heritable variation in 38 traits. We combined our results with sequencing data for 1,011 yeast isolates3 to decouple variant effect size estimation from allele frequency and showed that rare variants make a disproportionate contribution to trait variation as a consequence of their larger effect sizes. Evolutionary analyses revealed that this contribution is driven by rare variants that arose recently, that such variants are more likely to decrease fitness, and that negative selection has shaped the relationship between variant frequency and effect size. Finally, we leveraged the structure of the crosses to resolve hundreds of QTLs to single genes. These results refine our understanding of trait variation at the population level and suggest that studies of rare variants are a fertile ground for discovery of genetic effects.
biorxiv genetics 100-200-users 2019Generation of viral vectors specific to neuronal subtypes of targeted brain regions by Enhancer-Driven Gene Expression (EDGE), bioRxiv, 2019-04-14
SummaryUnderstanding brain function requires understanding neural circuits at the level of specificity at which they operate. While recent years have seen the development of a variety of remarkable molecular tools for the study of neural circuits, their utility is currently limited by the inability to deploy them in specific elements of native neural circuits, i.e. particular neuronal subtypes. One can obtain a degree of specificity with neuron-specific promoters, but native promoters are almost never sufficiently specific restricting this approach to transgenic animals. We recently showed that one can obtain transgenic mice with augmented anatomical specificity in targeted brain regions by identifying cis-regulatory elements (i.e. enhancers) uniquely active in those brain regions and combining them with a heterologous promoter, an approach we call EDGE (Enhancer-Driven Gene Expression). Here we extend this strategy to the generation of viral (rAAV) vectors, showing that when combined with the right minimal promoter they largely recapitulate the specificity seen in the corresponding transgenic lines in wildtype animals, even of another species. Because active enhancers can be identified in any tissue sample, this approach promises to enable the kind of circuit-specific manipulations in any species. This should not only greatly enhance our understanding of brain function, but may one day even provide novel therapeutic avenues to correct the imbalances in neural circuits underlying many disorders of the brain.
biorxiv neuroscience 0-100-users 2019