Variability of bacterial behavior in the mammalian gut captured using a growth-linked single-cell synthetic gene oscillator, bioRxiv, 2018-11-17
AbstractThe dynamics of the bacterial population that comprises the gut microbiota plays key roles in overall mammalian health. However, a detailed understanding of bacterial growth within the gut is limited by the inherent complexity and inaccessibility of the gut environment. Here, we deploy an improved synthetic genetic oscillator to investigate dynamics of bacterial colonization and growth in the mammalian gut under both healthy and disease conditions. The synthetic oscillator, when introduced into both Escherichia coli and Salmonella Typhimurium maintains regular oscillations with a constant period in generations across growth conditions. We determine the phase of oscillation from individual bacteria using image analysis of resultant colonies and thereby infer the number of cell divisions elapsed. In doing so, we demonstrate robust functionality and controllability of the oscillator circuit’s activity during bacterial growth in vitro, in a simulated murine gut microfluidic environment, and in vivo within the mouse gut. We determine different dynamics of bacterial colonization and growth in the gut under normal and inflammatory conditions. Our results show that a precise genetic oscillator can function in a complex environment and reveal single cell behavior under diverse conditions where disease may create otherwise impossible-to-quantify variability in growth across the population.
biorxiv synthetic-biology 0-100-users 2018Epigenetically reprogrammed methylation landscape drives the DNA self-assembly and serves as a universal cancer biomarker, Nature Communications, 2018-11-15
Epigenetic reprogramming in cancer genomes creates a distinct methylation landscape encompassing clustered methylation at regulatory regions separated by large intergenic tracks of hypomethylated regions. This methylation landscape that we referred to as Methylscape is displayed by most cancer types, thus may serve as a universal cancer biomarker. To-date most research has focused on the biological consequences of DNA Methylscape changes whereas its impact on DNA physicochemical properties remains unexplored. Herein, we examine the effect of levels and genomic distribution of methylcytosines on the physicochemical properties of DNA to detect the Methylscape biomarker. We find that DNA polymeric behaviour is strongly affected by differential patterning of methylcytosine, leading to fundamental differences in DNA solvation and DNA-gold affinity between cancerous and normal genomes. We exploit these Methylscape differences to develop simple, highly sensitive and selective electrochemical or colorimetric one-step assays for the detection of cancer. These assays are quick, i.e., analysis time ≤10 minutes, and require minimal sample preparation and small DNA input.
nature communications genetics 200-500-users 2018Scaling computational genomics to millions of individuals with GPUs, bioRxiv, 2018-11-14
Current genomics methods were designed to handle tens to thousands of samples, but will soon need to scale to millions to keep up with the pace of data and hypothesis generation in biomedical science. Moreover, costs associated with processing these growing datasets will become prohibitive without improving the computational efficiency and scalability of methods. Here, we show that recently developed machine-learning libraries (TensorFlow and PyTorch) facilitate implementation of genomics methods for GPUs and significantly accelerate computations. To demonstrate this, we re-implemented methods for two commonly performed computational genomics tasks QTL mapping and Bayesian non-negative matrix factorization. Our implementations ran > 200 times faster than current CPU-based versions, and these analyses are ~5-10 fold cheaper on GPUs due to the vastly shorter runtimes. We anticipate that the accessibility of these libraries, and the improvements in run-time will lead to a transition to GPU-based implementations for a wide range of computational genomics methods.
biorxiv genomics 200-500-users 2018Unbiased detection of CRISPR off-targets in vivo using DISCOVER-Seq, bioRxiv, 2018-11-14
AbstractGenome editing using nucleases such as CRISPR-Cas induces programmable DNA damage at a target genomic site but can also affect off-target sites. Here, we develop a powerful, sensitive assay for the unbiased identification of off-target sites that we term DISCOVER-Seq. This approach takes advantage of the recruitment of endogenous DNA repair factors for genome-wide identification of Cas-induced double-strand breaks. One such factor, MRE11, is recruited precisely to double-strand breaks, enabling molecular characterization of nuclease cut sites with single-base resolution. DISCOVER-Seq detects off-targets in cellular models and in vivo upon adenoviral gene editing of mouse livers, paving the way for real-time off-target discovery during therapeutic gene editing. DISCOVER-Seq is furthermore applicable to multiple types of Cas nucleases and provides an unprecedented view of events that precede repair of the affected sites.
biorxiv molecular-biology 0-100-users 2018Assembly of a pan-genome from deep sequencing of 910 humans of African descent, Nature Genetics, 2018-11-13
We used a deeply sequenced dataset of 910 individuals, all of African descent, to construct a set of DNA sequences that is present in these individuals but missing from the reference human genome. We aligned 1.19 trillion reads from the 910 individuals to the reference genome (GRCh38), collected all reads that failed to align, and assembled these reads into contiguous sequences (contigs). We then compared all contigs to one another to identify a set of unique sequences representing regions of the African pan-genome missing from the reference genome. Our analysis revealed 296,485,284 bp in 125,715 distinct contigs present in the populations of African descent, demonstrating that the African pan-genome contains ~10% more DNA than the current human reference genome. Although the functional significance of nearly all of this sequence is unknown, 387 of the novel contigs fall within 315 distinct protein-coding genes, and the rest appear to be intergenic.
nature genetics genetics 500+-users 2018Single particle cryo-EM reconstruction of 52 kDa streptavidin at 3.2 Angstrom resolution, bioRxiv, 2018-11-13
AbstractThe fast development of single particle cryo-EM has made it more feasible to obtain the 3D structure of well-behaved macromolecules with molecular weight higher than 300 kDa at ~3 Å resolution. It remains a challenge to obtain high resolution structure of molecules smaller than 100 kDa using single particle cryo-EM, mainly due to the low contrast of the molecules embedded in vitreous ice. In this work, we applied the Cs-corrector-VPP coupled cryo-EM to study 52 kDa streptavidin (SA) protein supported on a thin layer of graphene film and embedded in vitreous ice. We were able to solve both the apo-SA and biotin-bound SA at near-atomic resolution using single particle cryo-EM. We demonstrated that the method is capable to determine the structure of molecule as small as 39 kDa and potentially even smaller molecules. Furthermore, we found that using the graphene film to avoid the adsorption to the air-water interface is critical to maintain the protein’s high-resolution structural information.
biorxiv biophysics 0-100-users 2018