TY - GEN
T1 - A new paradigm for identifying reconciliation-scenario altering mutations conferring environmental adaptation
AU - Zoller, Roni
AU - Zehavi, Meirav
AU - Ziv-Ukelson, Michal
N1 - Funding Information:
Funding Supported by ISF grants no. 1176/18 and no. 939/18. This work was supported by the Lynn and William Frankel Center for Computer Science.
Publisher Copyright:
© Roni Zoller, Meirav Zehavi, and Michal Ziv-Ukelson; licensed under Creative Commons License CC-BY
PY - 2019/9/1
Y1 - 2019/9/1
N2 - An important goal in microbial computational genomics is to identify crucial events in the evolution of a gene that severely alter the duplication, loss and mobilization patterns of the gene within the genomes in which it disseminates. In this paper, we formalize this microbiological goal as a new pattern-matching problem in the domain of Gene tree and Species tree reconciliation, denoted “Reconciliation-Scenario Altering Mutation (RSAM) Discovery”. We propose an O(m · n · k) time algorithm to solve this new problem, where m and n are the number of vertices of the input Gene tree and Species tree, respectively, and k is a user-specified parameter that bounds from above the number of optimal solutions of interest. The algorithm first constructs a hypergraph representing the k highest scoring reconciliation scenarios between the given Gene tree and Species tree, and then interrogates this hypergraph for subtrees matching a pre-specified RSAM Pattern. Our algorithm is optimal in the sense that the number of hypernodes in the hypergraph can be lower bounded by Ω(m · n · k). We implement the new algorithm as a tool, denoted RSAM-finder, and demonstrate its application to the identification of RSAMs in toxins and drug resistance elements across a dataset spanning hundreds of species.
AB - An important goal in microbial computational genomics is to identify crucial events in the evolution of a gene that severely alter the duplication, loss and mobilization patterns of the gene within the genomes in which it disseminates. In this paper, we formalize this microbiological goal as a new pattern-matching problem in the domain of Gene tree and Species tree reconciliation, denoted “Reconciliation-Scenario Altering Mutation (RSAM) Discovery”. We propose an O(m · n · k) time algorithm to solve this new problem, where m and n are the number of vertices of the input Gene tree and Species tree, respectively, and k is a user-specified parameter that bounds from above the number of optimal solutions of interest. The algorithm first constructs a hypergraph representing the k highest scoring reconciliation scenarios between the given Gene tree and Species tree, and then interrogates this hypergraph for subtrees matching a pre-specified RSAM Pattern. Our algorithm is optimal in the sense that the number of hypernodes in the hypergraph can be lower bounded by Ω(m · n · k). We implement the new algorithm as a tool, denoted RSAM-finder, and demonstrate its application to the identification of RSAMs in toxins and drug resistance elements across a dataset spanning hundreds of species.
KW - Gene tree
KW - Reconciliation
KW - Species tree
UR - http://www.scopus.com/inward/record.url?scp=85072632339&partnerID=8YFLogxK
U2 - 10.4230/LIPIcs.WABI.2019.9
DO - 10.4230/LIPIcs.WABI.2019.9
M3 - Conference contribution
AN - SCOPUS:85072632339
T3 - Leibniz International Proceedings in Informatics, LIPIcs
SP - 9:1-9:13
BT - 19th International Workshop on Algorithms in Bioinformatics, WABI 2019
A2 - Huber, Katharina T.
A2 - Gusfield, Dan
PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
T2 - 19th International Workshop on Algorithms in Bioinformatics, WABI 2019
Y2 - 8 September 2019 through 10 September 2019
ER -