Genome sequence of the cluster root forming white lupin, bioRxiv, 2019-07-19
White lupin (Lupinus albus L.) is a legume that produces seeds recognized for their high protein content and good nutritional value (lowest glycemic index of all grains, high dietary fiber content, and zero gluten or starch)1–5. White lupin can form nitrogen-fixing nodules but has lost the ability to form mycorrhizal symbiosis with fungi6. Nevertheless, its root system is well adapted to poor soils it produces cluster roots, constituted of dozens of determinate lateral roots that improve soil exploration and phosphate remobilization7. As phosphate is a limited resource that comes from rock reserves8, the production of cluster roots is a trait of interest to improve fertilizers efficiency. Using long reads sequencing technologies, we provide a high-quality genome sequence of a modern variety of white lupin (2n=50, 451 Mb), as well as de novo assemblies of a landrace and a wild relative. We describe how domestication impacted soil exploration capacity through the early establishment of lateral and cluster roots. We identify the APETALA2 transcription factor LaPUCHI-1, homolog of the Arabidopsis morphogenesis coordinator9, as a potential regulator of this trait. Our high-quality genome and companion genomic and transcriptomic resources enable the development of modern breeding strategies to increase and stabilize yield and to develop new varieties with reduced allergenic properties (caused by conglutins10), which would favor the deployment of this promising culture.
biorxiv genomics 0-100-users 2019μDamID a microfluidic approach for imaging and sequencing protein-DNA interactions in single cells, bioRxiv, 2019-07-19
AbstractGenome regulation depends on carefully programmed protein-DNA interactions that maintain or alter gene expression states, often by influencing chromatin organization. Most studies of these interactions to date have relied on bulk methods, which in many systems cannot capture the dynamic single-cell nature of these interactions as they modulate cell states. One method allowing for sensitive single-cell mapping of protein-DNA interactions is DNA adenine methyltransferase identification (DamID), which records a protein’s DNA-binding history by methylating adenine bases in its vicinity, then selectively amplifies and sequences these methylated regions. These interaction sites can also be visualized using fluorescent proteins that bind to methyladenines. Here we combine these imaging and sequencing technologies in an integrated microfluidic platform (μDamID) that enables single-cell isolation, imaging, and sorting, followed by DamID. We apply this system to generate paired single-cell imaging and sequencing data from a human cell line, in which we map and validate interactions between DNA and nuclear lamina proteins, providing a measure of 3D chromatin organization and broad gene regulation patterns. μDamID provides the unique ability to compare paired imaging and sequencing data for each cell and between cells, enabling the joint analysis of the nuclear localization, sequence identity, and variability of protein-DNA interactions.
biorxiv bioengineering 0-100-users 2019Peripheral blood cell immunophenotyping reveals distinct subgroups of inflamed depression, bioRxiv, 2019-07-18
AbstractDepression has been associated with increased inflammatory proteins but changes in circulating immune cells are less well defined. We used multi-parametric flow cytometry to investigate 14 subsets of peripheral blood cells in 206 cases of major depressive disorder (MDD) and 77 age- and sex-matched controls. There were significant case-control differences, by univariate and multivariate analysis cases showed increased immune cell counts, especially neutrophils, CD4+ T cells and monocytes, and increased inflammatory proteins (C-reactive protein and interleukin-6). Within-group analysis demonstrated significant association between the severity of depressive symptoms and increased myeloid and CD4+ cell counts. MDD cases could be partitioned into two groups by forced binary clustering of cell counts the inflamed depression group (N=81 out of 206; 39%) had increased monocyte, CD4+ and neutrophil counts, increased C-reactive protein (CRP) and interleukin 6 (IL-6), and was more depressed than the uninflamed majority of cases. Relaxing the presumption of a binary classification, data-driven clustering identified four subgroups of MDD cases two of these subgroups (N=38 and N=100; 67% collectively) were associated with increased inflammatory proteins and more severe depression, but differed from each other in the relative weighting of myeloid and lymphoid cell counts. Case-control and within-group results were robust to statistical control for the potentially confounding effects of age, sex, BMI, recent infection status, and tobacco use. Peripheral blood immunophenotyping can be used to identify a candidate cellular biomarker of inflamed depression, and to further decompose that binary partition, suggesting that there is more than one mechanistic pathway underlying inflamed depression.One Sentence SummaryTwo subgroups of depressed cases (about two-thirds of all 206 cases) were identified by peripheral blood biomarker evidence of distinctive cellular immunophenotypes, biased towards the myeloid or lymphoid lineages in different subgroups, but consistently associated with increased blood concentrations of inflammatory proteins and greater severity of depressive symptoms.
biorxiv immunology 0-100-users 2019DNA mismatches reveal widespread conformational penalties in protein-DNA recognition, bioRxiv, 2019-07-17
ABSTRACTTranscription-factor (TF) proteins recognize specific genomic sequences, despite an overwhelming excess of non-specific DNA, to regulate complex gene expression programs1–3. While there have been significant advances in understanding how DNA sequence and shape contribute to recognition, some fundamental aspects of protein-DNA binding remain poorly understood2,3. Many DNA-binding proteins induce changes in the DNA structure outside the intrinsic B-DNA envelope. How the energetic cost associated with distorting DNA contributes to recognition has proven difficult to study and measure experimentally because the distorted DNA structures exist as low-abundance conformations in the naked B-DNA ensemble4–10. Here, we use a novel high-throughput assay called SaMBA (Saturation Mismatch-Binding Assay) to investigate the role of DNA conformational penalties in TF-DNA recognition. The approach introduces mismatched base-pairs (i.e. mispairs) within TF binding sites to pre-induce a variety of DNA structural distortions much larger than those induced by changes in Watson-Crick sequence. Strikingly, while most mismatches either weakened TF binding (~70%) or had negligible effects (~20%), approximately 10% of mismatches increased binding and at least one mismatch was found that increased the binding affinity for each of 21 examined TFs. Mismatches also converted sites from the non-specific affinity range into specific sites, and high-affinity sites into “super-sites” stronger than any known canonical binding site. These findings reveal a complex binding landscape that cannot be explained based on DNA sequence alone. Analysis of crystal structures together with NMR and molecular dynamics simulations revealed that many of the mismatches that increase binding induce distortions similar to those induced by TF binding, thus pre-paying some of the energetic cost to deform the DNA. Our work indicates that conformational penalties are a major determinant of protein-DNA recognition, and reveals mechanisms by which mismatches can recruit TFs and thus modulate replication and repair activities in the cell11,12.
biorxiv biophysics 0-100-users 2019Gene networks with transcriptional bursting recapitulate rare transient coordinated expression states in cancer, bioRxiv, 2019-07-17
SUMMARYNon-genetic transcriptional variability at the single-cell level is a potential mechanism for therapy resistance in melanoma. Specifically, rare subpopulations of melanoma cells occupy a transient pre-resistant state characterized by coordinated high expression of several genes. Importantly, these rare cells are able to survive drug treatment and develop resistance. How might these extremely rare states arise and disappear within the population? It is unclear whether the canonical stochastic models of probabilistic transcriptional pulsing can explain this behavior, or if it requires special, hitherto unidentified molecular mechanisms. Here we use mathematical modeling to show that a minimal network comprising of transcriptional bursting and interactions between genes can give rise to rare coordinated high states. We next show that although these states occur across networks of different sizes, they depend strongly on three (out of seven) model parameters and require network connectivity to be ≤ 6. Interestingly, we find that while entry into the rare coordinated high state is initiated by a long transcriptional burst that also triggers entry of other genes, the exit from it occurs through the independent inactivation of individual genes. Finally, our model predicts that increased network connectivity can lead to transcriptionally stable states, which we verify using network inference analysis of experimental data. In sum, we demonstrate that established principles of gene regulation are sufficient to describe this new class of rare cell variability and argue for its general existence in other biological contexts.
biorxiv systems-biology 0-100-users 2019Longitudinal single cell transcriptomics reveals Krt8+ alveolar epithelial progenitors in lung regeneration, bioRxiv, 2019-07-17
Lung injury activates quiescent stem and progenitor cells to regenerate alveolar structures. The sequence and coordination of transcriptional programs during this process has largely remained elusive. Using single cell RNA-seq, we first generated a whole-organ bird’s-eye view on cellular dynamics and cell-cell communication networks during mouse lung regeneration from ∼30,000 cells at six timepoints. We discovered an injury-specific progenitor cell state characterized by Krt8 in flat epithelial cells covering alveolar surfaces. The number of these cells peaked during fibrogenesis in independent mouse models, as well as in human acute lung injury and fibrosis. Krt8+ progenitors featured a highly distinct connectome of receptor-ligand pairs with endothelial cells, fibroblasts, and macrophages. To ‘sky dive’ into epithelial differentiation dynamics, we sequenced >30,000 sorted epithelial cells at 18 timepoints and computationally derived cell state trajectories that were validated by lineage tracing genetic reporter mice. Airway stem cells within the club cell lineage and alveolar type-2 cells underwent transcriptional convergence onto the same Krt8+ progenitor cell state, which later resolved by terminal differentiation into alveolar type-1 cells. We derived distinct transcriptional regulators as key switch points in this process and show that induction of TNF-alphaNFkappaB, p53, and hypoxia driven gene expression programs precede a Sox4, Ctnnb1, and Wwtr1 driven switch towards alveolar type-1 cell fate. We show that epithelial cell plasticity can induce non-gradual transdifferentiation, involving intermediate progenitor cell states that may persist and promote disease if checkpoint signals for terminal differentiation are perturbed.
biorxiv systems-biology 0-100-users 2019