Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3584371.3612970acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article
Open access

Generalized Matching Distance: Tumor Phylogeny Comparison Beyond the Infinite Sites Assumption

Published: 04 October 2023 Publication History

Abstract

As the field of tumor phylogenomics matures, numerous methods have been developed to infer tumor phylogenies from many types of sequencing data. The tumor phylogenies being inferred have transitioned from abiding strictly to the Infinite Sites Assumption (ISA), which says that mutations are gained once and never lost, to more relaxed and more biologically accurate models such as Camin-Sokal or k-Dollo models which allow mutations to be either gained or lost multiple times, respectively. As the tumor phylogenies being inferred have become more attuned to the underlying biology of cancer, methods of comparing, or computing distances, between these phylogenies have not yet caught up. In order to address this discrepancy, we propose the Generalized Matching Distance (GMD) Problem which allows for ISA distance measures to be applied to non-ISA phylogenies after a particular type of transformation. We provide a simple, but effective solution for exactly solving the GMD Problem which is often efficient enough for many tumor phylogenies. We also provide a heuristic approach to solving the GMD Problem for instances where our exact solution is not appropriate. In our simulated experiments, we show that by using our approach to solve the GMD Problem we can effectively use ISA tumor distance measures to compare phylogenies with parallel mutations (those that are gained multiple times). Additionally, we show that our heuristic approach works well on a subset of phylogenies under the Camin-Sokal and k-Dollo models. Finally, we apply our method for solving the GMD to three tumor phylogenies generated from a colorectal cancer patient. The data for our experiments and the code for using GMD is available at: https://bitbucket.org/oesperlab/gmd/src/master/

References

[1]
Avesh Kumar Agrawal and Hamim Zafar. 2022. FiMO: Inferring the Temporal Order of Mutations on Clonal Phylogeny under Finite-sites Models. bioRxiv (2022), 2022--01.
[2]
Nabil Amirouchene-Angelozzi, Charles Swanton, and Alberto Bardelli. 2017. Tumor Evolution as a Therapeutic TargetThe Impact of Tumor Evolution in Precision Medicine. Cancer discovery 7, 8 (2017), 805--817.
[3]
Niko Beerenwinkel, Roland F Schwarz, Moritz Gerstung, and Florian Markowetz. 2015. Cancer evolution: mathematical models and computational inference. Systematic biology 64, 1 (2015), e1--e25.
[4]
Joseph H Camin and Robert R Sokal. 1965. A method for deducing branching sequences in phylogeny. Evolution (1965), 311--326.
[5]
Simone Ciccolella, Giulia Bernardini, Luca Denti, Paola Bonizzoni, Marco Previtali, and Gianluca Della Vedova. 2021. Triplet-based similarity score for fully multilabeled trees with poly-occurring labels. Bioinformatics 37, 2 (2021), 178--184.
[6]
Zach DiNardo, Kiran Tomlinson, Anna Ritz, and Layla Oesper. 2020. Distance measures for tumor evolutionary trees. Bioinformatics 36, 7 (2020), 2090--2097.
[7]
Louis Dollo. 1893. The laws of evolution. Bull. Soc. Bel. Geol. Paleontol 7 (1893), 164--166.
[8]
Mohammed El-Kebir, Layla Oesper, Hannah Acheson-Field, and Benjamin J Raphael. 2015. Reconstruction of clonal trees and tumor composition from multi-sample sequencing data. Bioinformatics 31, 12 (2015), i62--i70.
[9]
Kiya Govek, Camden Sikes, and Layla Oesper. 2018. A consensus approach to infer tumor evolutionary histories. In Proceedings of the 2018 Acm international conference on bioinformatics, computational biology, and health informatics. 63--72.
[10]
Kiya Govek, Camden Sikes, Yangqiaoyu Zhou, and Layla Oesper. 2020. Graphyc: Using consensus to infer tumor evolution. IEEE/ACM Transactions on Computational Biology and Bioinformatics 19, 1 (2020), 465--478.
[11]
Philippe Gui and Trever G Bivona. 2022. Evolution of metastasis: New tools and insights. Trends in Cancer 8, 2 (2022), 98--109.
[12]
Katharina Jahn, Niko Beerenwinkel, and Louxin Zhang. 2021. The Bourque distances for mutation trees of cancers. Algorithms for Molecular Biology 16, 1 (2021), 1--15.
[13]
Nikolai Karpov, Salem Malikic, Md Rahman, S Cenk Sahinalp, et al. 2018. A multi-labeled tree edit distance for comparing" clonal trees" of tumor progression. In 18th International Workshop on Algorithms in Bioinformatics (WABI 2018). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.
[14]
Nikolai Karpov, Salem Malikic, Md Rahman, S Cenk Sahinalp, et al. 2019. A multi-labeled tree dissimilarity measure for comparing "clonal trees" of tumor progression. Algorithms for Molecular Biology 14, 1 (2019), 1--18.
[15]
Tom L Kaufmann, Marina Petkovic, Thomas BK Watkins, Emma C Colliver, Sofya Laskina, Nisha Thapa, Darlan C Minussi, Nicholas Navin, Charles Swanton, Peter Van Loo, et al. 2022. MEDICC2: whole-genome doubling aware copy-number phylogenies for cancer evolution. Genome biology 23, 1 (2022), 241.
[16]
Motoo Kimura. 1969. The number of heterozygous nucleotide sites maintained in a finite population due to steady flux of mutations. Genetics 61, 4 (1969), 893.
[17]
Harold W Kuhn. 1955. The Hungarian method for the assignment problem. Naval research logistics quarterly 2, 1--2 (1955), 83--97.
[18]
Jack Kuipers, Katharina Jahn, Benjamin J Raphael, and Niko Beerenwinkel. 2017. Single-cell sequencing data reveal widespread recurrence and loss of mutational hits in the life histories of tumors. Genome research 27, 11 (2017), 1885--1894.
[19]
Marco L Leung, Alexander Davis, Ruli Gao, Anna Casasent, Yong Wang, Emi Sei, Eduardo Vilar, Dipen Maru, Scott Kopetz, and Nicholas E Navin. 2017. Single-cell DNA sequencing reveals a late-dissemination model in metastatic colorectal cancer. Genome research 27, 8 (2017), 1287--1299.
[20]
Nicholas McGranahan and Charles Swanton. 2017. Clonal heterogeneity and tumor evolution: past, present, and the future. Cell 168, 4 (2017), 613--628.
[21]
Peter C Nowell. 1976. The Clonal Evolution of Tumor Cell Populations: Acquired genetic lability permits stepwise selection of variant sublines and underlies tumor progression. Science 194, 4260 (1976), 23--28.
[22]
Katherine L Pogrebniak and Christina Curtis. 2018. Harnessing tumor evolution to circumvent resistance. Trends in Genetics 34, 8 (2018), 639--651.
[23]
David F Robinson and Leslie R Foulds. 1981. Comparison of phylogenetic trees. Mathematical biosciences 53, 1--2 (1981), 131--147.
[24]
Gryte Satas, Simone Zaccaria, Geoffrey Mon, and Benjamin J Raphael. 2020. SCARLET: single-cell tumor phylogeny inference with copy-number constrained mutation losses. Cell systems 10, 4 (2020), 323--332.
[25]
Russell Schwartz and Alejandro A Schäffer. 2017. The evolution of tumour phylogenetics: principles and practice. Nature Reviews Genetics 18, 4 (2017), 213--229.
[26]
Takeaki Uno. 1997. Algorithms for enumerating all perfect, maximum and maximal matchings in bipartite graphs. In Algorithms and Computation: 8th International Symposium, ISAAC'97 Singapore, December 17--19, 1997 Proceedings 8. Springer, 92--101.
[27]
Hamim Zafar, Nicholas Navin, Ken Chen, and Luay Nakhleh. 2019. SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data. Genome research 29, 11 (2019), 1847--1859.
[28]
Hamim Zafar, Nicholas Navin, Luay Nakhleh, and Ken Chen. 2018. Computational approaches for inferring tumor evolution from single-cell genomic data. Current Opinion in Systems Biology 7 (2018), 16--25.

Index Terms

  1. Generalized Matching Distance: Tumor Phylogeny Comparison Beyond the Infinite Sites Assumption

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      BCB '23: Proceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics
      September 2023
      626 pages
      ISBN:9798400701269
      DOI:10.1145/3584371
      This work is licensed under a Creative Commons Attribution International 4.0 License.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 04 October 2023

      Check for updates

      Author Tags

      1. cancer
      2. phylogeny
      3. infinite sites assumption
      4. distance measure

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      BCB '23
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 254 of 885 submissions, 29%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 307
        Total Downloads
      • Downloads (Last 12 months)277
      • Downloads (Last 6 weeks)41
      Reflects downloads up to 19 Nov 2024

      Other Metrics

      Citations

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media