A genome-wide algal mutant library reveals a global view of genes required for eukaryotic photosynthesis, bioRxiv, 2018-11-07
Photosynthetic organisms provide food and energy for nearly all life on Earth, yet half of their protein-coding genes remain uncharacterized1,2. Characterization of these genes could be greatly accelerated by new genetic resources for unicellular organisms that complement the use of multicellular plants by enabling higher-throughput studies. Here, we generated a genome-wide, indexed library of mapped insertion mutants for the flagship unicellular alga Chlamydomonas reinhardtii (Chlamydomonas hereafter). The 62,389 mutants in the library, covering 83% of nuclear, protein-coding genes, are available to the community. Each mutant contains unique DNA barcodes, allowing the collection to be screened as a pool. We leveraged this feature to perform a genome-wide survey of genes required for photosynthesis, which identified 303 candidate genes. Characterization of one of these genes, the conserved predicted phosphatase CPL3, showed it is important for accumulation of multiple photosynthetic protein complexes. Strikingly, 21 of the 43 highest-confidence genes are novel, opening new opportunities for advances in our understanding of this biogeochemically fundamental process. This library is the first genome-wide mapped mutant resource in any unicellular photosynthetic organism, and will accelerate the characterization of thousands of genes in algae, plants and animals.
biorxiv genomics 0-100-users 2018AnnoTree visualization and exploration of a functionally annotated microbial tree of life, bioRxiv, 2018-11-06
AbstractBacterial genomics has revolutionized our understanding of the microbial tree of life; however, mapping and visualizing the distribution of functional traits across bacteria remains a challenge. Here, we introduce AnnoTree - an interactive, functionally annotated bacterial tree of life that integrates taxonomic, phylogenetic, and functional annotation data from nearly 24,000 bacterial genomes. AnnoTree enables visualization of millions of precomputed genome annotations across the bacterial phylogeny, thereby allowing users to explore gene distributions as well as patterns of gene gain and loss across bacteria. Using AnnoTree, we examined the phylogenomic distributions of 28,311 geneprotein families, and measured their phylogenetic conservation, patchiness, and lineage-specificity. Our analyses revealed widespread phylogenetic patchiness among bacterial gene families, reflecting the dynamic evolution of prokaryotic genomes. Genes involved in phage infectiondefense, mobile elements, and antibiotic resistance dominated the list of most patchy traits, as well as numerous intriguing metabolic enzymes that appear to have undergone frequent horizontal transfer. We anticipate that AnnoTree will be a valuable resource for exploring gene histories across bacteria, and will act as a catalyst for biological and evolutionary hypothesis generation.
biorxiv bioinformatics 100-200-users 2018Comparative analysis of sequencing technologies platforms for single-cell transcriptomics, bioRxiv, 2018-11-06
AbstractAll single-cell RNA-seq protocols and technologies require library preparation prior to sequencing on a platform such as Illumina. Here, we present the first report to utilize the BGISEQ-500 platform for scRNA-seq, and compare the sensitivity and accuracy to Illumina sequencing. We generate a scRNA-seq resource of 468 unique single-cells and 1,297 matched single cDNA samples, performing SMARTer and Smart-seq2 protocols on mESCs and K562 cells with RNA spike-ins. We sequence these libraries on both BGISEQ-500 and Illumina HiSeq platforms using single- and paired-end reads. The two platforms have comparable sensitivity and accuracy in terms of quantification of gene expression, and low technical variability. Our study provides a standardised scRNA-seq resource to benchmark new scRNA-seq library preparation protocols and sequencing platforms.
biorxiv genomics 0-100-users 2018Exploring neighborhoods in large metagenome assembly graphs reveals hidden sequence diversity, bioRxiv, 2018-11-06
Genomes computationally inferred from large metagenomic data sets are often incomplete and may be missing functionally important content and strain variation. We introduce an information retrieval system for large metagenomic data sets that exploits the sparsity of DNA assembly graphs to efficiently extract subgraphs surrounding an inferred genome. We apply this system to recover missing content from genome bins and show that substantial genomic sequence variation is present in a real metagenome. Our software implementation is available at <jatsext-link xmlnsxlink=httpwww.w3.org1999xlink ext-link-type=uri xlinkhref=httpsgithub.comspacegraphcats>httpsgithub.comspacegraphcats<jatsext-link> spacegraphcats under the 3-Clause BSD License.
biorxiv bioinformatics 100-200-users 2018Investigating causal relationships between sleep traits and risk of breast cancer a Mendelian randomization study, bioRxiv, 2018-11-06
AbstractObjectiveTo examine whether sleep traits have a causal effect on risk of breast cancer.DesignMultivariable regression, one- and two-sample Mendelian randomization.SettingThe UK Biobank prospective cohort study and the Breast Cancer Association Consortium (BCAC) case-control genome-wide association study.Participants156,848 women in the multivariable regression and one-sample Mendelian randomization analysis in UK Biobank (7,784 with a breast cancer diagnosis) and 122,977 breast cancer cases and 105,974 controls from BCAC in the two-sample Mendelian randomization analysis.ExposuresSelf-reported chronotype (morningevening preference), insomnia symptoms and sleep duration in multivariable regression, and genetic variants robustly associated with these sleep traits.Main outcome measuresBreast cancer (prevalent and incident cases in UK Biobank, prevalent cases only in BCAC).ResultsIn multivariable regression analysis using data on breast cancer incidence in UK Biobank, morning preference was inversely associated with breast cancer (HR 0.95, 95% CI 0.93, 0.98 per category increase) while there was little evidence for an association with sleep duration and insomnia symptoms. Using 341 single nucleotide polymorphisms (SNPs) associated with chronotype, 91 SNPs associated sleep duration and 57 SNPs associated with insomnia symptoms, one-sample MR analysis in UK Biobank provided some supportive evidence for a protective effect of morning preference on breast cancer risk (HR 0.85, 95% 0.70, 1.03 per category increase) but imprecise estimates for sleep duration and insomnia symptoms. Two-sample MR using data from BCAC supported findings for a protective effect of morning preference (OR 0.88, 95% CI 0.82, 0.93 per category increase) and adverse effect of increased sleep duration (OR 1.19, 95% CI 1.02, 1.39 per hour increase) on breast cancer (both estrogen receptor positive and negative), while there was inconsistent evidence for insomnia symptoms. Results were largely robust to sensitivity analyses accounting for horizontal pleiotropy.ConclusionsWe found consistent evidence for a protective effect of morning preference and suggestive evidence for an adverse effect of sleep duration on breast cancer risk.
biorxiv epidemiology 0-100-users 2018Structure of a bacterial ATP synthase, bioRxiv, 2018-11-06
ATP synthases produce ATP from ADP and inorganic phosphate with energy from a transmembrane proton motive force. Bacterial ATP synthases have been studied extensively because they are the simplest form of the enzyme and because of the relative ease of genetic manipulation of these complexes. We expressed the Bacillus PS3 ATP synthase in Eschericia coli, purified it, and imaged it by cryo-EM, allowing us to build atomic models of the complex in three rotational states. The position of subunit e shows how it is able to inhibit ATP hydrolysis while allowing ATP synthesis. The architecture of the membrane region shows how the simple bacterial ATP synthase is able to perform the same core functions as the equivalent, but more complicated, mitochondrial complex. The structures reveal the path of transmembrane proton translocation and provide a model for understanding decades of biochemical analysis interrogating the roles of specific residues in the enzyme.
biorxiv biochemistry 500+-users 2018