Integrating opportunities and parametrized signatures for improved mutational processes estimation in extended sequence contexts
Ragnhild Laursen, Marta Pelizzola, Lasse Maretty, Asger Hobolth
TLDR
This paper introduces a robust method for estimating mutational signatures by integrating opportunities and parametrized signatures in extended sequence contexts.
Key contributions
- Integrates mutational opportunities for more accurate analysis.
- Enables estimation in extended sequence contexts (2-3 flanking nucleotides).
- Applies a Negative Binomial model for robust signature estimation.
- Introduces parametrized signatures, enhancing reliability.
Why it matters
This research significantly enhances the robustness and reliability of mutational signature estimation, particularly in complex extended sequence contexts. Improved accuracy in identifying these signatures is crucial for understanding underlying mutational processes and disease development.
Original Abstract
Mutational signatures describe the pattern of mutations over the different mutation types. Each mutation type is determined by a base substitution and the flanking nucleotides to the left and right of that base substitution. Due to the widespread interest in mutational signatures, several efforts have been devoted to the development of methods for robust and stable signature estimation. Here, we combine various extensions of the standard framework to estimate mutational signatures. These extensions include (a) incorporating opportunities to the analysis, (b) allowing for extended sequence contexts, (c) using the Negative Binomial model, and (d) parametrizing the signatures. We show that the combination of these four extensions gives very robust and reliable mutational signatures. In particular, we highlight the importance of including mutational opportunities and parametrizing the signatures when the mutation types describe an extended sequence context with two or three flanking nucleotides to each side of the base substitution.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.