Fitness prediction across scales

Can a phylogenetic model predict the fitness effects of mutations in extant mammals?

In this work based on genome-wide studies across mammalian species and populations, we estimated the proportion of beneficial mutations in protein coding sequences that are restoring pre-existing functions.

Our study is based on the premise that slightly deleterious mutations scattered across the genome are reaching fixation due to genetic drift. These mutations are then subsequently reverted by beneficial back-mutations, generating a balance at which genomes are constantly both “damaged” and “repaired” simultaneously at different loci.

Even though the existence of these back-mutations is predicted by the nearly neutral theory, they have been largely overlooked, and positive selection has often been interpreted as adaptation to changing environments. Despite leaving similar signatures when looking only at populations, adaptive mutations promote phenotypic diversification while beneficial back-mutations reduce diversity between species and stabilize existing systems.

We first estimated selective effects of mutations inside mammalian protein coding sequences, under a model assuming no adaptation at the phylogenetic scale (see notes on Mutation-Selection models).

From these estimates, we then tested whether the fitness effects of mutations estimated at the phylogenetic scale could predict the fitness effects of mutations in extant populations. We divided any mutations into three categories based on their predicted fitness effects: deleterious, nearly-neutral, and beneficial. For each category, the fitness effects of non-synonymous mutations in populations were estimated from the site frequency spectrum of non-synonymous and synonymous polymorphisms. The predicted fitness effects of mutations at the phylogenetic scale can then be compared to the fitness effects of mutations in populations.

Since estimates at the phylogenetic scale are based on the assumption that no adaptation has occurred, using Bayes formula, we then estimated the proportion of beneficial mutations that are not adaptive innovations among all beneficial mutations at the population scale (P[B₀|B]).

This result is robust across all 28 populations from the 6 genera studied: the fraction of beneficial mutations attributed to back-mutations consistently approaches or exceeds one-third, with no significant dependence on effective population size $N_e$. This last finding suggests that the balance between drift-induced degradation and compensatory repair is maintained across species with very different population sizes.

Taken together, these results demonstrate that a substantial fraction of positive selection what has historically been labelled as adaptive in mammalian genomes is better interpreted as part of a pervasive process of a genome constantly being both damaged and repaired simultaneously.

Last updated on Aug 27, 2023

No results found