The network structure of cancer ecosystems, bioRxiv, 2017-12-30

Ever since Paget’s seed-and-soil and Ewing’s connectivity hypotheses to explain tumor metastasis (1,2), it has become clear that cancer progression can be envisaged as an ecological phenomenon. This connection has flourished during the past two decades (3–7), giving rise to important insights into the ecology and evolution of cancer progression, with therapeutic implications (8–10). Here, we take a metapopulation view of metastasis (i.e. the migration to and colonization of, habitat patches) and represent it as a bipartite network, distinguishing source patches, or organs that host a primary tumor, and acceptor patches, or organs colonized ultimately from the source through metastasis. Using 20,326, biomedical records obtained from literature, we show that (i) the network structure of cancer ecosystems is non-random, exhibiting a nested subset pattern as has been found both in the distribution of species across islands and island-like habitats (11–13), and in the distribution of among species interactions across different ecological networks (14–16); (ii) similar to ecological networks, there is a heterogeneous distribution of degree (i.e., number of connections associated with a source or acceptor organ); (iii) there is a significant correlation between metastatic incidence (or the frequency with which tumor cells from a source organ colonize an acceptor one) and arterial blood supply, suggesting that more irrigated organs have a higher probability of developing metastasis or being invaded; (iv) there is a positive correlation between metastatic incidence and acceptor organ degree (or number of different tumor-bearing source organs that generate metastasis in a given acceptor organ), and a negative one between acceptor organ degree and number of stem cell divisions, implying that there are preferred sink organs for metastasis and that this could be related to average acceptor organ cell longevity; (v) there is a negative association between organ cell turnover and source organ degree, implying that organs with rapid cell turnovers tend to generate more metastasis, a process akin to the phenomenon of propagule pressure in ecology (17); and (vi) the cancer ecosystem network exhibits a modular structure in both source and acceptor patches, suggesting that some of them share more connections among themselves than with the rest of the network. We show that both niche-related processes occurring at the organ level as well as spatial connectivity and propagule pressure contribute to metastaticspread and result in a non-random cancer network, which exhibits a truncated power law degree distribution, clustering and a nested subset structure. The similarity between the cancer network and ecological networks highlights the importance of ecological approaches in increasing our understanding of patterns in cancer incidence and dynamics, which may lead to new strategies to control tumor spread within the human ecosystem.

biorxiv ecology 0-100-users 2017

Reproducible Bioinformatics Project A community for reproducible bioinformatics analysis pipelines, bioRxiv, 2017-12-27

AbstractBackgroundReproducibility of a research is a key element in the modern science and it is mandatory for any industrial application. It represents the ability of replicating an experiment independently by the location and the operator. Therefore, a study can be considered reproducible only if all used data are available and the exploited computational analysis workflow is clearly described. However, today for reproducing a complex bioinformatics analysis, the raw data and a list of tools used in the workflow could be not enough to guarantee the reproducibility of the results obtained. Indeed, different releases of the same tools andor of the system libraries (exploited by such tools) might lead to sneaky reproducibility issues.ResultsTo address this challenge, we established the Reproducible Bioinformatics Project (RBP), which is a non-profit and open-source project, whose aim is to provide a schema and an infrastructure, based on docker images and R package, to provide reproducible results in Bioinformatics. One or more Docker images are then defined for a workflow (typically one for each task), while the workflow implementation is handled via R-functions embedded in a package available at github repository. Thus, a bioinformatician participating to the project has firstly to integrate herhis workflow modules into Docker image(s) exploiting an Ubuntu docker image developed ad hoc by RPB to make easier this task. Secondly, the workflow implementation must be realized in R according to an R-skeleton function made available by RPB to guarantee homogeneity and reusability among different RPB functions. Moreover shehe has to provide the R vignette explaining the package functionality together with an example dataset which can be used to improve the user confidence in the workflow utilization.ConclusionsReproducible Bioinformatics Project provides a general schema and an infrastructure to distribute robust and reproducible workflows. Thus, it guarantees to final users the ability to repeat consistently any analysis independently by the used UNIX-like architecture.

biorxiv bioinformatics 0-100-users 2017

 

Created with the audiences framework by Jedidiah Carlson

Powered by Hugo