A comprehensive toolkit to enable MinION sequencing in any laboratory, bioRxiv, 2018-03-27
AbstractLong-read sequencing technologies are transforming our ability to assemble highly complex genomes. Realising their full potential relies crucially on extracting high quality, high molecular weight (HMW) DNA from the organisms of interest. This is especially the case for the portable MinION sequencer which potentiates all laboratories to undertake their own genome sequencing projects, due to its low entry cost and minimal spatial footprint. One challenge of the MinION is that each group has to independently establish effective protocols for using the instrument, which can be time consuming and costly. Here we present a workflow and protocols that enabled us to establish MinION sequencing in our own laboratories, based on optimising DNA extractions from a challenging plant tissue as a case study. Following the workflow illustrated we were able to reliably and repeatedly obtain > 8.5 Gb of long read sequencing data with a mean read length of 13 kb and an N50 of 26 kb. Our protocols are open-source and can be performed in any laboratory without special equipment. We also illustrate some more elaborate workflows which can increase mean and average read lengths if this is desired. We envision that our workflow for establishing MinION sequencing, including the illustration of potential pitfalls, will be useful to others who plan to establish long-read sequencing in their own laboratories.
biorxiv genomics 500+-users 2018Label-free prediction of three-dimensional fluorescence images from transmitted light microscopy, bioRxiv, 2018-03-27
Understanding living cells as integrated systems, a challenge central to modern biology, is complicated by limitations of available imaging methods. While fluorescence microscopy can resolve subcellular structure in living cells, it is expensive, slow, and damaging to cells. Here, we present a label-free method for predicting 3D fluorescence directly from transmitted light images and demonstrate that it can be used to generate multi-structure, integrated images.
biorxiv cell-biology 100-200-users 2018Large-scale neuroimaging and genetic study reveals genetic architecture of brain white matter microstructure, bioRxiv, 2018-03-26
AbstractMicrostructural changes of white matter (WM) tracts are known to be associated with various neuropsychiatric disordersdiseases. Heritability of structural changes of WM tracts has been examined using diffusion tensor imaging (DTI) in family-based studies for different age groups. The availability of genetic and DTI data from recent large population-based studies offers opportunity to further improve our understanding of genetic contributions. Here, we analyzed the genetic architecture of WM tracts using DTI and single-nucleotide polymorphism (SNP) data of unrelated individuals in the UK Biobank (n ∼ 8000). The DTI parameters were generated using the ENIGMA-DTI pipeline. We found that DTI parameters are substantially heritable on most WM tracts. We observed a highly polygenic or omnigenic architecture of genetic influence across the genome as well as the enrichment of SNPs in active chromatin regions. Our bivariate analyses showed strong genetic correlations for several pairs of WM tracts as well as pairs of DTI parameters. We performed voxel-based analysis to illustrate the pattern of genetic effects on selected parts of the tract-based spatial statistics skeleton. Comparing the estimates from the UK Biobank to those from small population-based studies, we illustrated that sufficiently large sample size is essential for genetic architecture discovery in imaging genetics. We confirmed this finding with a simulation study.
biorxiv genetics 100-200-users 2018Ancient Fennoscandian genomes reveal origin and spread of Siberian ancestry in Europe, bioRxiv, 2018-03-22
AbstractEuropean history has been shaped by migrations of people, and their subsequent admixture. Recently, evidence from ancient DNA has brought new insights into migration events that could be linked to the advent of agriculture, and possibly to the spread of Indo-European languages. However, little is known so far about the ancient population history of north-eastern Europe, in particular about populations speaking Uralic languages, such as Finns and Saami. Here we analyse ancient genomic data from 11 individuals from Finland and Northwest Russia. We show that the specific genetic makeup of northern Europe traces back to migrations from Siberia that began at least 3,500 years ago. This ancestry was subsequently admixed into many modern populations in the region, in particular populations speaking Uralic languages today. In addition, we show that ancestors of modern Saami inhabited a larger territory during the Iron Age than today, which adds to historical and linguistic evidence for the population history of Finland.
biorxiv genomics 0-100-users 2018Marionette E. coli containing 12 highly-optimized small molecule sensors, bioRxiv, 2018-03-21
Cellular processes are carried out by many interacting genes and their study and optimization requires multiple levers by which they can be independently controlled. The most common method is via a genetically-encoded sensor that responds to a small molecule (an “inducible system”). However, these sensors are often suboptimal, exhibiting high background expression and low dynamic range. Further, using multiple sensors in one cell is limited by cross-talk and the taxing of cellular resources. Here, we have developed a directed evolution strategy to simultaneously select for less background, high dynamic range, increased sensitivity, and low crosstalk. Libraries of the regulatory protein and output promoter are built based on random and rationally-guided mutations. This is applied to generate a set of 12 high-performance sensors, which exhibit >100-fold induction with low background and cross-reactivity. These are combined to build a single “sensor array” and inserted into the genomes of E. coli MG1655 (wild-type), DH10B (cloning), and BL21 (protein expression). These “Marionette” strains allow for the independent control of gene expression using 2,4-diacetylphophloroglucinol (DAPG), cuminic acid (Cuma), 3-oxohexanoyl-homoserine lactone (OC6), vanillic acid (Van), isopropyl β-D-1-thiogalactopyranoside (IPTG), anhydrotetracycline (aTc), L-arabinose (Ara), choline chloride (Cho), naringenin (Nar), 3,4-dihydroxybenzoic acid (DHBA), sodium salicylate (Sal), and 3-hydroxytetradecanoyl-homoserine lactone (OHC14).
biorxiv synthetic-biology 0-100-users 2018Profiling of pluripotency factors in individual stem cells and early embryos, bioRxiv, 2018-03-21
SUMMARYMajor cell fate decisions are governed by sequence-specific transcription factors (TFs) that act in small cell populations within developing embryos. To understand how TFs regulate cell fate it is important to identify their genomic binding sites in these populations. However, current methods cannot profile TFs genome-wide at or near the single cell level. Here we adapt the CUT&RUN method to profile chromatin proteins in low cell numbers, mapping TF-DNA interactions in single cells and individual pre-implantation embryos for the first time. Using this method, we demonstrate that the pluripotency TF NANOG is significantly more dependent on the SWISNF family ATPase BRG1 for association with its genomic targets in vivo than in cultured cells—a finding that could not have been made using traditional approaches. Ultra-low input CUT&RUN (uliCUT&RUN) enables interrogation of TF binding from low cell numbers, with broad applicability to rare cell populations of importance in development or disease.
biorxiv genomics 0-100-users 2018