Abstract
Somatic DNA mutations are a characteristic of cancerous cells, being usually key in the origin and development of cancer. In the last few years, somatic mutations have been studied in order to understand which processes or conditions may generate them, with the purpose of developing prevention and treatment strategies. In this work we propose a novel sparse regularised method that aims at extracting mutational signatures from somatic mutations. We developed a pipeline that extracts the dataset from raw data and performs the analysis returning the signatures and their relative usage frequencies. A thorough comparison between our method and the state of the art procedure reveals that our pipeline can be used alternatively without losing information and possibly gaining more interpretability and precision.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Code publicly available at https://github.com/slipguru/dalila under Free BDS license.
References
Alexandrov, L.B., et al.: Signatures of mutational processes in human cancer. Nature 500(7463), 415–421 (2013)
Alexandrov, L.B., Nik-Zainal, S., Wedge, D.C., Campbell, P.J., Stratton, M.R.: Deciphering signatures of mutational processes operative in human cancer. Cell Rep. 3(1), 246–359 (2013)
Bolte, J., Sabach, S., Teboulle, M.: Proximal alternating linearized minimization for nonconvex and nonsmooth problems. Math. Program. 146(1), 459–494 (2014)
Jain, A.K.: Data clustering: 50 years beyond K-means. Pattern Recogn. Lett. 31(8), 651–666 (2010)
Lee, D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401(6755), 788–791 (1999)
Nik-Zainal, S., et al.: Mutational processes molding the genomes of 21 breast cancers. Cell 149(5), 979–993 (2012)
Koboldt, D.C., et al.: VarScan: variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics 25(17), 2283 (2009). LO
Rakotomamonjy, A.: Direct optimization of the dictionary learning problem. IEEE Trans. Signal Process. 61(22), 5495–5506 (2013)
Ramazzotti, D., Lal, A., Liu, K., Tibshirani, R., Sidow, A.: De Novo mutational signature discovery in tumor genomes using SparseSignatures. bioRxiv. https://www.biorxiv.org/content/early/2018/08/04/384834
Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)
Tozzo, V., D’Amario, V., Barla, A.: Hey there’s DALILA: a DictionAry LearnIng LibrAry. In: ICCSW 2017 proceedings. OpenAccess Series in Informatics (OASIcs), vol. 60, pp. 1–14 (2018)
Acknowledgments
The authors are indebted to Dr. Alexandrov for valuable feedback on the pipeline analysis.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Tozzo, V., Barla, A. (2019). Cancer Mutational Signatures Identification with Sparse Dictionary Learning. In: Bartoletti, M., et al. Computational Intelligence Methods for Bioinformatics and Biostatistics. CIBB 2017. Lecture Notes in Computer Science(), vol 10834. Springer, Cham. https://doi.org/10.1007/978-3-030-14160-8_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-14160-8_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14159-2
Online ISBN: 978-3-030-14160-8
eBook Packages: Computer ScienceComputer Science (R0)