A Biclique Approach to Reference-Anchored Gene Blocks and Its Applications to Genomic Islands

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

We formalize a new problem variant in gene-block discovery, denoted Reference-Anchored Gene Blocks (RAGB), given a query sequence Q of length n, representing the gene array of a DNA element, a window size bound d on the length of a substring of interest in Q, and a set of target gene sequences . Our objective is to identify gene blocks in that are centered in a subset q of co-localized genes from Q, and contain genomes from in which the corresponding orthologs of the genes from q are also co-localized. We cast RAGB as a variant of a (colored) biclique problem in bipartite graphs, and analyze its parameterized complexity, as well as the parameterized complexity of other related problems. We give an time algorithm for the uncolored variant of our biclique problem, where m is the number of areas of interest that are parsed from the target sequences, and n and d are as defined earlier. Our algorithm can be adapted to compute all maximal bicliques in the graph within the same time complexity, and to handle edge weights with a slight increase to its time complexity. For the colored version of the problem, our algorithm has a time complexity of . We implement the algorithm and exemplify its application to the data mining of proteobacterial gene blocks that are centered in predicted proteobacterial genomic islands, leading to the identification of putatively mobilized clusters of virulence, pathogenicity, and resistance genes.

Original languageEnglish
Pages (from-to)214-235
Number of pages22
JournalJournal of Computational Biology
Volume25
Issue number2
DOIs
StatePublished - 1 Feb 2018

Keywords

  • Bicliques
  • Bipartite graphs
  • Gene blocks
  • Genomic islands
  • Parameterized complexity

ASJC Scopus subject areas

  • Modeling and Simulation
  • Molecular Biology
  • Genetics
  • Computational Mathematics
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'A Biclique Approach to Reference-Anchored Gene Blocks and Its Applications to Genomic Islands'. Together they form a unique fingerprint.

Cite this