Genome sequence of the cluster root forming white lupin, bioRxiv, 2019-07-19
White lupin (Lupinus albus L.) is a legume that produces seeds recognized for their high protein content and good nutritional value (lowest glycemic index of all grains, high dietary fiber content, and zero gluten or starch)1–5. White lupin can form nitrogen-fixing nodules but has lost the ability to form mycorrhizal symbiosis with fungi6. Nevertheless, its root system is well adapted to poor soils it produces cluster roots, constituted of dozens of determinate lateral roots that improve soil exploration and phosphate remobilization7. As phosphate is a limited resource that comes from rock reserves8, the production of cluster roots is a trait of interest to improve fertilizers efficiency. Using long reads sequencing technologies, we provide a high-quality genome sequence of a modern variety of white lupin (2n=50, 451 Mb), as well as de novo assemblies of a landrace and a wild relative. We describe how domestication impacted soil exploration capacity through the early establishment of lateral and cluster roots. We identify the APETALA2 transcription factor LaPUCHI-1, homolog of the Arabidopsis morphogenesis coordinator9, as a potential regulator of this trait. Our high-quality genome and companion genomic and transcriptomic resources enable the development of modern breeding strategies to increase and stabilize yield and to develop new varieties with reduced allergenic properties (caused by conglutins10), which would favor the deployment of this promising culture.
biorxiv genomics 0-100-users 2019H3K4me3 is neither instructive for, nor informed by, transcription, bioRxiv, 2019-07-19
AbstractH3K4me3 is a near-universal histone modification found predominantly at the 5’ region of genes, with a well-documented association with gene activity. H3K4me3 has been ascribed roles as both an instructor of gene expression and also a downstream consequence of expression, yet neither has been convincingly proven on a genome-wide scale. Here we test these relationships using a combination of bioinformatics, modelling and experimental data from budding yeast in which the levels of H3K4me3 have been massively ablated. We find that loss of H3K4me3 has no effect on the levels of nascent transcription or transcript in the population. Moreover, we observe no change in the rates of transcription initiation, elongation, mRNA export or turnover, or in protein levels, or cell-to-cell variation of mRNA. Loss of H3K4me3 also has no effect on the large changes in gene expression patterns that follow galactose induction. Conversely, loss of RNA polymerase from the nucleus has no effect on the pattern of H3K4me3 deposition and little effect on its levels, despite much larger changes to other chromatin features. Furthermore, large genome-wide changes in transcription, both in response to environmental stress and during metabolic cycling, are not accompanied by corresponding changes in H3K4me3. Thus, despite the correlation between H3K4me3 and gene activity, neither appear to be necessary to maintain levels of the other, nor to influence their changes in response to environmental stimuli. When we compare gene classes with very different levels of H3K4me3 but highly similar transcription levels we find that H3K4me3-marked genes are those whose expression is unresponsive to environmental changes, and that their histones are less acetylated and dynamically turned-over. Constitutive genes are generally well-expressed, which may alone explain the correlation between H3K4me3 and gene expression, while the biological role of H3K4me3 may have more to do with this distinction in gene class.
biorxiv genomics 200-500-users 2019ORANGE A CRISPRCas9-based genome editing toolbox for epitope tagging of endogenous proteins in neurons, bioRxiv, 2019-07-19
ABSTRACTThe correct subcellular distribution of protein complexes establishes the complex morphology of neurons and is fundamental to their functioning. Thus, determining the dynamic distribution of proteins is essential to understand neuronal processes. Fluorescence imaging, in particular super-resolution microscopy, has become invaluable to investigate subcellular protein distribution. However, these approaches suffer from the limited ability to efficiently and reliably label endogenous proteins. We developed ORANGE an Open Resource for the Application of Neuronal Genome Editing, that mediates targeted genomic integration of fluorescent tags in neurons. This toolbox includes a knock-in library for in-depth investigation of endogenous protein distribution, and a detailed protocol explaining how knock-in can be developed for novel targets. In combination with super-resolution microscopy, ORANGE revealed the dynamic nanoscale organization of endogenous neuronal signaling molecules, synaptic scaffolding proteins, and neurotransmitter receptors. Thus, ORANGE enables quantitation of expression and distribution for virtually any protein in neurons at high resolution and will significantly further our understanding of neuronal cell biology.
biorxiv neuroscience 200-500-users 2019μDamID a microfluidic approach for imaging and sequencing protein-DNA interactions in single cells, bioRxiv, 2019-07-19
AbstractGenome regulation depends on carefully programmed protein-DNA interactions that maintain or alter gene expression states, often by influencing chromatin organization. Most studies of these interactions to date have relied on bulk methods, which in many systems cannot capture the dynamic single-cell nature of these interactions as they modulate cell states. One method allowing for sensitive single-cell mapping of protein-DNA interactions is DNA adenine methyltransferase identification (DamID), which records a protein’s DNA-binding history by methylating adenine bases in its vicinity, then selectively amplifies and sequences these methylated regions. These interaction sites can also be visualized using fluorescent proteins that bind to methyladenines. Here we combine these imaging and sequencing technologies in an integrated microfluidic platform (μDamID) that enables single-cell isolation, imaging, and sorting, followed by DamID. We apply this system to generate paired single-cell imaging and sequencing data from a human cell line, in which we map and validate interactions between DNA and nuclear lamina proteins, providing a measure of 3D chromatin organization and broad gene regulation patterns. μDamID provides the unique ability to compare paired imaging and sequencing data for each cell and between cells, enabling the joint analysis of the nuclear localization, sequence identity, and variability of protein-DNA interactions.
biorxiv bioengineering 0-100-users 2019Long live the king chromosome-level assembly of the lion (Panthera leo) using linked-read, Hi-C, and long read data, bioRxiv, 2019-07-18
AbstractThe lion (Panthera leo) is one of the most popular and iconic feline species on the planet, yet in spite of its popularity, the last century has seen massive declines for lion populations worldwide. Genomic resources for endangered species represent an important way forward for the field of conservation, enabling high-resolution studies of demography, disease, and population dynamics. Here, we present a chromosome-level assembly for the captive African lion from the Exotic Feline Rescue Center as a resource for current and subsequent genetic work of the sole social species of the Panthera clade. Our assembly is composed of 10x Genomics Chromium data, Dovetail Hi-C, and Oxford Nanopore long-read data. Synteny is highly conserved between the lion, other Panthera genomes, and the domestic cat. We find variability in the length and levels of homozygosity across the genomes of the lion sequenced here and other previous published resequence data, indicating contrasting histories of recent and ancient small population sizes andor inbreeding. Demographic analyses reveal similar histories across all individuals except the Asiatic lion, which shows a more rapid decline in population size. This high-quality genome will greatly aid in the continuing research and conservation efforts for the lion.
biorxiv genomics 100-200-users 2019Peripheral blood cell immunophenotyping reveals distinct subgroups of inflamed depression, bioRxiv, 2019-07-18
AbstractDepression has been associated with increased inflammatory proteins but changes in circulating immune cells are less well defined. We used multi-parametric flow cytometry to investigate 14 subsets of peripheral blood cells in 206 cases of major depressive disorder (MDD) and 77 age- and sex-matched controls. There were significant case-control differences, by univariate and multivariate analysis cases showed increased immune cell counts, especially neutrophils, CD4+ T cells and monocytes, and increased inflammatory proteins (C-reactive protein and interleukin-6). Within-group analysis demonstrated significant association between the severity of depressive symptoms and increased myeloid and CD4+ cell counts. MDD cases could be partitioned into two groups by forced binary clustering of cell counts the inflamed depression group (N=81 out of 206; 39%) had increased monocyte, CD4+ and neutrophil counts, increased C-reactive protein (CRP) and interleukin 6 (IL-6), and was more depressed than the uninflamed majority of cases. Relaxing the presumption of a binary classification, data-driven clustering identified four subgroups of MDD cases two of these subgroups (N=38 and N=100; 67% collectively) were associated with increased inflammatory proteins and more severe depression, but differed from each other in the relative weighting of myeloid and lymphoid cell counts. Case-control and within-group results were robust to statistical control for the potentially confounding effects of age, sex, BMI, recent infection status, and tobacco use. Peripheral blood immunophenotyping can be used to identify a candidate cellular biomarker of inflamed depression, and to further decompose that binary partition, suggesting that there is more than one mechanistic pathway underlying inflamed depression.One Sentence SummaryTwo subgroups of depressed cases (about two-thirds of all 206 cases) were identified by peripheral blood biomarker evidence of distinctive cellular immunophenotypes, biased towards the myeloid or lymphoid lineages in different subgroups, but consistently associated with increased blood concentrations of inflammatory proteins and greater severity of depressive symptoms.
biorxiv immunology 0-100-users 2019