biorxiv | audiences

Common DNA sequence variation influences 3-dimensional conformation of the human genome, bioRxiv, 2019-03-30

ABSTRACTThe 3-dimensional (3D) conformation of chromatin inside the nucleus is integral to a variety of nuclear processes including transcriptional regulation, DNA replication, and DNA damage repair. Aberrations in 3D chromatin conformation have been implicated in developmental abnormalities and cancer. Despite the importance of 3D chromatin conformation to cellular function and human health, little is known about how 3D chromatin conformation varies in the human population, or whether DNA sequence variation between individuals influences 3D chromatin conformation. To address these questions, we performed Hi-C on Lymphoblastoid Cell Lines (LCLs) from 20 individuals. We identified thousands of regions across the genome where 3D chromatin conformation varies between individuals and found that this conformational variation is often accompanied by variation in gene expression, histone modifications, and transcription factor (TF) binding. Moreover, we found that DNA sequence variation influences several features of 3D chromatin conformation including loop strength, contact insulation, contact directionality and density of localciscontacts. We mapped hundreds of Quantitative Trait Loci (QTLs) associated with 3D chromatin features and found evidence that some of these same variants are associated at modest levels with other molecular phenotypes as well as complex disease risk. Our results demonstrate that common DNA sequence variants can influence 3D chromatin conformation, pointing to a more pervasive role for 3D chromatin conformation in human phenotypic variation than previously recognized.

biorxiv genomics 0-100-users 2019

ezTrack An open-source video analysis pipeline for the investigation of animal behavior, bioRxiv, 2019-03-30

AbstractTracking small animal behavior by video is one of the most common tasks in the fields of neuroscience and psychology. Although commercial software exists for the execution of this task, commercial software often presents enormous cost to the researcher, and can also entail purchasing specific hardware setups that are not only expensive but lack adaptability. Moreover, the inaccessibility of the code underlying this software renders them inflexible. Alternatively, available open source options frequently require extensive model training and can be challenging for those inexperienced with programming. Here we present an open source and platform independent set of behavior analysis pipelines using interactive Python (iPythonJupyter Notebook) that researchers with no prior programming experience can use. Two modules are described. One module can be used for the positional analysis of an individual animal across a session (i.e., location tracking), amenable to a wide range of behavioral tasks including conditioned place preference, water maze, light-dark box, open field, and elevated plus maze, to name but a few. A second module is described for the analysis of conditioned freezing behavior. For both modules, a range of interactive plots and visualizations are available to confirm that chosen parameters produce results that conform to the user’s approval. In addition, batch processing tools for the fast analysis of multiple videos is provided, and frame-by-frame output makes aligning the data with neural recording data simple. Lastly, options for cropping video frames to mitigate the influence of fiberopticelectrophysiology cables, analyzing specified portions of time in a video, and defining regions of interest, can be implemented with ease.

biorxiv neuroscience 200-500-users 2019

MARGO (Massively Automated Real-time GUI for Object-tracking), a platform for high-throughput ethology, bioRxiv, 2019-03-30

AbstractFast object tracking in real time allows convenient tracking of very large numbers of animals and closed-loop experiments that control stimuli for multiple animals in parallel. We developed MARGO, a real-time animal tracking suite for custom behavioral experiments. We demonstrated that MARGO can rapidly and accurately track large numbers of animals in parallel over very long timescales. We incorporated control of peripheral hardware, and implemented a flexible software architecture for defining new experimental routines. These features enable closed-loop delivery of stimuli to many individuals simultaneously. We highlight MARGO’s ability to coordinate tracking and hardware control with two custom behavioral assays (measuring phototaxis and optomotor response) and one optogenetic operant conditioning assay. There are currently several open source animal trackers. MARGO’s strengths are 1) robustness, 2) high throughput, 3) flexible control of hardware and 4) real-time closed-loop control of sensory and optogenetic stimuli, all of which are optimized for large-scale experimentation.

biorxiv neuroscience 100-200-users 2019

Single Nucleotide Mapping of the Locally Accessible Trait Space in Yeast Reveals Pareto Fronts that Constrain Initial Adaptation, bioRxiv, 2019-03-30

AbstractTradeoffs constrain the improvement of performance of multiple traits simultaneously. Such tradeoffs define Pareto fronts, which represent a set of optimal individuals that cannot be improved in any one trait without reducing performance in another. Surprisingly, experimental evolution often yields genotypes with improved performance in all measured traits, perhaps indicating an absence of tradeoffs at least in the short-term. Here we densely sample adaptive mutations in S. cerevisiae to ask whether first-step adaptive mutations result in tradeoffs during the growth cycle. We isolated thousands of adaptive clones evolved under carefully chosen conditions and quantified their performances in each part of the growth cycle. We too find that some first-step adaptive mutations can improve all traits to a modest extent. However, our dense sampling allowed us to identify tradeoffs and establish the existence of Pareto fronts between fermentation and respiration, and between respiration and stationary phases. Moreover, we establish that no single mutation in the ancestral genome can circumvent the detected tradeoffs. Finally, we sequenced hundreds of these adaptive clones, revealing novel targets of adaptation and defining the genetic basis of the identified tradeoffs.

biorxiv evolutionary-biology 100-200-users 2019

An approximate full-likelihood method for inferring selection and allele frequency trajectories from DNA sequence data, bioRxiv, 2019-03-29

AbstractMost current methods for detecting natural selection from DNA sequence data are limited in that they are either based on summary statistics or a composite likelihood, and as a consequence, do not make full use of the information available in DNA sequence data. We here present a new importance sampling approach for approximating the full likelihood function for the selection coefficient. The method treats the ancestral recombination graph (ARG) as a latent variable that is integrated out using previously published Markov Chain Monte Carlo (MCMC) methods. The method can be used for detecting selection, estimating selection coefficients, testing models of changes in the strength of selection, estimating the time of the start of a selective sweep, and for inferring the allele frequency trajectory of a selected or neutral allele. We perform extensive simulations to evaluate the method and show that it uniformly improves power to detect selection compared to current popular methods such as nSL and SDS, under various demographic models and can provide reliable inferences of allele frequency trajectories under many conditions. We also explore the potential of our method to detect extremely recent changes in the strength of selection. We use the method to infer the past allele frequency trajectory for a lactase persistence SNP (MCM6) in Europeans. We also study a set of 11 pigmentation-associated variants. Several genes show evidence of strong selection particularly within the last 5,000 years, including ASIP, KITLG, and TYR. However, selection on OCA2HERC2 seems to be much older and, in contrast to previous claims, we find no evidence of selection on TYRP1.Author summaryCurrent methods to study natural selection using modern population genomic data are limited in their power and flexibility. Here, we present a new method to infer natural selection that builds on recent methodological advances in estimating genome-wide genealogies. By using importance sampling we are able to efficiently estimate the likelihood function of the selection coefficient. We show our method improves power to test for selection over competing methods across a diverse range of scenarios, and also accurately infers the selection coefficient. We also demonstrate a novel capability of our model, using it to infer the allele’s frequency over time. We validate these results with a study of a lactase persistence SNP in Europeans, and also study a set of 11 pigmentation-associated variants.

biorxiv genetics 100-200-users 2019

Polygenic architecture of human neuroanatomical diversity, bioRxiv, 2019-03-29

AbstractWe analysed the genomic architecture of neuroanatomical diversity using magnetic resonance imaging and SNP data from > 20,000 individuals. Our results replicate previous findings of a strong polygenic architecture of neuroanatomical diversity. SNPs captured from 40% to 54% of the variance in the volume of different brain regions. We observed a large correlation between chromosome length and the amount of phenotypic variance captured, r∼0.64 on average, suggesting that at a global scale causal variants are homogeneously distributed across the genome. At a more local scale, SNPs within genes (∼51%) captured ∼1.5-times more genetic variance than the rest; and SNPs with low minor allele frequency (MAF) captured significantly less variance than those with higher MAF the 40% of SNPs with MAF<5% captured less than one fourth of the genetic variance. We also observed extensive pleiotropy across regions, with an average genetic correlation of rG∼0.45. Across regions, genetic correlations were in general similar to phenotypic correlations. By contrast, genetic correlations were larger than phenotypic correlations for the leftright volumes of the same region, and indistinguishable from 1. Additionally, the differences in leftright volumes were not heritable, underlining the role of environmental causes in the variability of brain asymmetry. Our analysis code is available at <jatsext-link xmlnsxlink=httpwww.w3.org1999xlink ext-link-type=uri xlinkhref=httpsgithub.comneuroanatomygenomic-architecture>httpsgithub.comneuroanatomygenomic-architecture<jatsext-link>.

biorxiv neuroscience 0-100-users 2019