TY - JOUR
T1 - A New Paradigm for Identifying Reconciliation-Scenario Altering Mutations Conferring Environmental Adaptation
AU - Zoller, Roni
AU - Zehavi, Meirav
AU - Ziv-Ukelson, Michal
N1 - Funding Information:
This work was supported by ISF Grant Nos. 1176/18 and 939/18 and by the Lynn and William Frankel Center for Computer Science.
Publisher Copyright:
© Copyright 2020, Mary Ann Liebert, Inc., publishers 2020.
PY - 2020/11/1
Y1 - 2020/11/1
N2 - An important goal in microbial computational genomics is to identify crucial events in the evolution of a gene that severely alter the duplication, loss, and mobilization patterns of the gene within the genomes in which it disseminates. In this article, we formalize this microbiological goal as a new pattern-matching problem in the domain of gene tree and species tree reconciliation, denoted "Reconciliation-Scenario Altering Mutation (RSAM) Discovery."We propose an O(m⋅n⋅k) time algorithm to solve this new problem, where m and n are the number of vertices of the input gene tree and species tree, respectively, and k is a user-specified parameter that bounds from above the number of optimal solutions of interest. The algorithm first constructs a hypergraph representing the k highest scoring reconciliation scenarios between the given gene tree and species tree, and then interrogates this hypergraph for subtrees matching a prespecified RSAM pattern. Our algorithm is optimal in the sense that the number of hypernodes in the hypergraph can be lower bounded by ω(m⋅n⋅k). We implement the new algorithm as a tool, called RSAM-finder, and demonstrate its application to the identification of RSAMs in toxins and drug resistance elements across a data set spanning hundreds of species.
AB - An important goal in microbial computational genomics is to identify crucial events in the evolution of a gene that severely alter the duplication, loss, and mobilization patterns of the gene within the genomes in which it disseminates. In this article, we formalize this microbiological goal as a new pattern-matching problem in the domain of gene tree and species tree reconciliation, denoted "Reconciliation-Scenario Altering Mutation (RSAM) Discovery."We propose an O(m⋅n⋅k) time algorithm to solve this new problem, where m and n are the number of vertices of the input gene tree and species tree, respectively, and k is a user-specified parameter that bounds from above the number of optimal solutions of interest. The algorithm first constructs a hypergraph representing the k highest scoring reconciliation scenarios between the given gene tree and species tree, and then interrogates this hypergraph for subtrees matching a prespecified RSAM pattern. Our algorithm is optimal in the sense that the number of hypernodes in the hypergraph can be lower bounded by ω(m⋅n⋅k). We implement the new algorithm as a tool, called RSAM-finder, and demonstrate its application to the identification of RSAMs in toxins and drug resistance elements across a data set spanning hundreds of species.
KW - PHYLOGENETIC TREES
KW - algorithms
KW - dynamic programming
UR - http://www.scopus.com/inward/record.url?scp=85095861615&partnerID=8YFLogxK
U2 - 10.1089/cmb.2019.0472
DO - 10.1089/cmb.2019.0472
M3 - Article
AN - SCOPUS:85095861615
VL - 27
SP - 1561
EP - 1580
JO - Journal of Computational Biology
JF - Journal of Computational Biology
SN - 1066-5277
IS - 11
ER -