Assembly of a pan-genome from deep sequencing of 910 humans of African descent, Nature Genetics, 2018-11-13
We used a deeply sequenced dataset of 910 individuals, all of African descent, to construct a set of DNA sequences that is present in these individuals but missing from the reference human genome. We aligned 1.19 trillion reads from the 910 individuals to the reference genome (GRCh38), collected all reads that failed to align, and assembled these reads into contiguous sequences (contigs). We then compared all contigs to one another to identify a set of unique sequences representing regions of the African pan-genome missing from the reference genome. Our analysis revealed 296,485,284 bp in 125,715 distinct contigs present in the populations of African descent, demonstrating that the African pan-genome contains ~10% more DNA than the current human reference genome. Although the functional significance of nearly all of this sequence is unknown, 387 of the novel contigs fall within 315 distinct protein-coding genes, and the rest appear to be intergenic.
nature genetics genetics 500+-users 2018Structure of a bacterial ATP synthase, bioRxiv, 2018-11-06
ATP synthases produce ATP from ADP and inorganic phosphate with energy from a transmembrane proton motive force. Bacterial ATP synthases have been studied extensively because they are the simplest form of the enzyme and because of the relative ease of genetic manipulation of these complexes. We expressed the Bacillus PS3 ATP synthase in Eschericia coli, purified it, and imaged it by cryo-EM, allowing us to build atomic models of the complex in three rotational states. The position of subunit e shows how it is able to inhibit ATP hydrolysis while allowing ATP synthesis. The architecture of the membrane region shows how the simple bacterial ATP synthase is able to perform the same core functions as the equivalent, but more complicated, mitochondrial complex. The structures reveal the path of transmembrane proton translocation and provide a model for understanding decades of biochemical analysis interrogating the roles of specific residues in the enzyme.
biorxiv biochemistry 500+-users 2018Challenges and recommendations to improve installability and archival stability of omics computational tools, bioRxiv, 2018-10-26
AbstractDeveloping new software tools for analysis of large-scale biological data is a key component of advancing modern biomedical research. Scientific reproduction of published findings requires running computational tools on data generated by such studies, yet little attention is presently allocated to the installability and archival stability of computational software tools. Scientific journals require data and code sharing, but none currently require authors to guarantee the continuing functionality of newly published tools. We have estimated the archival stability of computational biology software tools by performing an empirical analysis of the internet presence for 36,702 omics software resources published from 2005 to 2017. We found that almost 28% of all resources are currently not accessible through URLs published in the paper they first appeared in. Among the 98 software tools selected for our installability test, 51% were deemed “easy to install,” and 28% of the tools failed to be installed at all due to problems in the implementation. Moreover, for papers introducing new software, we found that the number of citations significantly increased when authors provided an easy installation process. We propose for incorporation into journal policy several practical solutions for increasing the widespread installability and archival stability of published bioinformatics software.
biorxiv bioinformatics 500+-users 2018The genetics of university success, Scientific Reports, 2018-09-25
University success, which includes enrolment in and achievement at university, as well as quality of the university, have all been linked to later earnings, health and wellbeing. However, little is known about the causes and correlates of differences in university-level outcomes. Capitalizing on both quantitative and molecular genetic data, we perform the first genetically sensitive investigation of university success with a UK-representative sample of 3,000 genotyped individuals and 3,000 twin pairs. Twin analyses indicate substantial additive genetic influence on university entrance exam achievement (57%), university enrolment (51%), university quality (57%) and university achievement (46%). We find that environmental effects tend to be non-shared, although the shared environment is substantial for university enrolment. Furthermore, using multivariate twin analysis, we show moderate to high genetic correlations between university success variables (0.27–0.76). Analyses using DNA alone also support genetic influence on university success. Indeed, a genome-wide polygenic score, derived from a 2016 genome-wide association study of years of education, predicts up to 5% of the variance in each university success variable. These findings suggest young adults select and modify their educational experiences in part based on their genetic propensities and highlight the potential for DNA-based predictions of real-world outcomes, which will continue to increase in predictive power.
scientific reports genetics 500+-users 2018Increased frequency of travel in the presence of cross-immunity may act to decrease the chance of a global pandemic, bioRxiv, 2018-08-31
The high frequency of modern travel has led to concerns about a devastating pandemic since a lethal pathogen strain could spread worldwide quickly. Many historical pandemics have arisen following pathogen evolution to a more virulent form. However, some pathogen strains invoke immune responses that provide partial cross-immunity against infection with related strains. Here, we consider a mathematical model of successive outbreaks of two strains a low virulence strain outbreak followed by a high virulence strain outbreak. Under these circumstances, we investigate the impacts of varying travel rates and cross-immunity on the probability that a major epidemic of the high virulence strain occurs, and the size of that outbreak. Frequent travel between subpopulations can lead to widespread immunity to the high virulence strain, driven by exposure to the low virulence strain. As a result, major epidemics of the high virulence strain are less likely, and can potentially be smaller, with more connected subpopulations. Cross-immunity may be a factor contributing to the absence of a global pandemic as severe as the 1918 influenza pandemic in the century since.
biorxiv epidemiology 500+-users 2018Author-Reviewer Homophily in Peer Review, bioRxiv, 2018-08-29
AbstractThe fairness of scholarly peer review has been challenged by evidence of disparities in publication outcomes based on author demographic characteristics. To assess this, we conducted an exploratory analysis of peer review outcomes of 23,876 initial submissions and 7,192 full submissions that were submitted to the biosciences journal eLife between 2012 and 2017. Women and authors from nations outside of North America and Europe were underrepresented both as gatekeepers (editors and peer reviewers) and authors. We found evidence of a homophilic relationship between the demographics of the gatekeepers and authors and the outcome of peer review; that is, there were higher rates of acceptance in the case of gender and country homophily. The acceptance rate for manuscripts with male last authors was seven percent, or 3.5 percentage points, greater than for female last authors (95% CI = [0.5, 6.4]); this gender inequity was greatest, at nine percent or about 4.8 percentage points (95% CI = [0.3, 9.1]), when the team of reviewers was all male; this difference was smaller and not significantly different for mixed-gender reviewer teams. Homogeny between countries of the gatekeeper and the corresponding author was also associated with higher acceptance rates for many countries. To test for the persistence of these effects after controlling for potentially confounding variables, we conducted a logistic regression including document and author metadata. Disparities in acceptance rates associated with gender and country of affiliation and the homophilic associations remained. We conclude with a discussion of mechanisms that could contribute to this effect, directions for future research, and policy implications. Code and anonymized data have been made available at <jatsext-link xmlnsxlink=httpwww.w3.org1999xlink ext-link-type=uri xlinkhref=httpsgithub.commurraydselife-analysis>httpsgithub.commurraydselife-analysis<jatsext-link>Author summaryPeer review, the primary method by which scientific work is evaluated, is ideally a fair and equitable process in which scientific work is judged solely on its own merit. However, the integrity of peer review has been called into question based on evidence that outcomes often differ between male and female authors, and for authors in different countries. We investigated such disparities at the biosciences journal eLife by analyzing the demographics of authors and gatekeepers (editors and peer reviewers), and peer review outcomes of all submissions between 2012 and 2017. Outcomes were more favorable for male authors and those affiliated with institutions in North America and Europe; these groups were also over-represented among gatekeepers. There was evidence that peer review outcomes were influenced by homophily —a preference of gatekeepers for manuscripts from authors with shared characteristics. We discuss mechanisms that could contribute to this effect, directions for future research, and policy implications.
biorxiv scientific-communication-and-education 500+-users 2018