A generative model for bipartite gene-sharing networks
Jaime Iranzo, Pedro Jódar, Eugene V. Koonin, Susanna Manrubia, José A. Cuesta
TLDR
This paper proposes a two-parameter generative model for bipartite gene-sharing networks, explaining observed gene and genome degree distributions.
Key contributions
- Introduces a mechanistic model for bipartite gene-sharing networks based on evolutionary processes.
- Explains characteristic scale-free gene and exponential genome degree distributions observed empirically.
- Derives analytical expressions validated by simulations and fits empirical data from viruses and prokaryotes.
- Suggests viral evolution is primarily driven by gene gain, with gene loss playing a minimal role.
Why it matters
This model provides a simple yet powerful framework to understand the evolutionary forces shaping viral and mobile genetic element genomes. It offers both qualitative and quantitative insights into genome plasticity, particularly highlighting the dominance of gene gain in viral evolution.
Original Abstract
Gene-sharing networks provide a powerful framework to study the evolution of viruses and mobile genetic elements. These bipartite networks, which link genes to the genomes that contain them, exhibit characteristic degree distributions: a scale-free distribution for genes and an exponential-like decay for genomes. Here, we propose a mechanistic model that explains these patterns through fundamental evolutionary processes including horizontal gene transfer, capture of new genes, emergence of new genomes, and gene loss. Using a mean-field approximation, we derive analytical expressions for the asymptotic gene and genome degree distributions, recapitulating a power-law distribution for genes and an exponential distribution for genomes. Numerical simulations validate these predictions and yield parameter values that closely fit empirical data from dsDNA viruses, RNA viruses, and prokaryotic pangenomes. This simple model with only two parameters provides a generative framework for bipartite gene-sharing networks, offering qualitative and quantitative insights into the main evolutionary forces driving genome plasticity. Setting the gene loss rate to zero, the gene and genome degree distributions of the model closely fit the empirically observed distributions. Thus, evolution of viruses appears to be dominated by gene gain, in agreement with the results of independent reconstructions of viral evolution.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.