Parallel Maximal Common Subgraphs with Labels for Molecular Biology

  • Wilfried Agbeto
  • , Camille Coti
  • , Vladimir Reinharz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Advances in graph algorithmics have allowed in-depth study of many natural objects from molecular biology or chemistry to social networks. Particularly in molecular biology and cheminformatics, understanding complex structures by identifying conserved sub-structures is a key milestone towards the artificial design of novel components with specific functions. Given a dataset of structures, we are interested in identifying all maximum common connected partial subgraphs between each pair of graphs, a task notoriously NP-Hard. In this work, we present parallel algorithms over shared and distributed memory to enumerate all maximal connected common sub-graphs between pairs of arbitrary multi-directed graphs with labels on their edges. We offer an implementation of these methods and evaluate their performance on the non-redundant dataset of all known RNA 3D structures. We show that we can compute the exact results in a reasonable time for each pairwise comparison while taking into account a much more diverse set of interactions—resulting in much denser graphs—resulting in an order of magnitude more conserved modules. All code is available at https://gitlab.info.uqam.ca/cbe/pasigraph and results in the branch results.

Original languageEnglish
Title of host publicationParallel Processing and Applied Mathematics - 15th International Conference, PPAM 2024, Revised Selected Papers
EditorsRoman Wyrzykowski, Jack Dongarra, Ewa Deelman, Konrad Karczewski
PublisherSpringer Science and Business Media Deutschland GmbH
Pages131-145
Number of pages15
ISBN (Print)9783031856969
DOIs
StatePublished - 1 Jan 2025
Externally publishedYes
Event15th International Conference on Parallel Processing and Applied Mathematics, PPAM 2024 - Ostrava, Czech Republic
Duration: 8 Sep 202411 Sep 2024

Publication series

NameLecture Notes in Computer Science
Volume15579
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference15th International Conference on Parallel Processing and Applied Mathematics, PPAM 2024
Country/TerritoryCzech Republic
CityOstrava
Period8/09/2411/09/24

Keywords

  • Common subgraphs
  • Molecular structure
  • Parallel algorithms

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Parallel Maximal Common Subgraphs with Labels for Molecular Biology'. Together they form a unique fingerprint.

Cite this