Discovery of multi-operon colinear syntenic blocks in microbial genomes

Dina Svetlitsky, Tal Dagan, Michal Ziv-Ukelson

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Motivation: An important task in comparative genomics is to detect functional units by analyzing gene-context patterns. Colinear syntenic blocks (CSBs) are groups of genes that are consistently encoded in the same neighborhood and in the same order across a wide range of taxa. Such CSBs are likely essential for the regulation of gene expression in prokaryotes. Recent results indicate that colinearity can be conserved across multiple operons, thus motivating the discovery of multi-operon CSBs. This computational task raises scalability challenges in large datasets. Results: We propose an efficient algorithm for the discovery of cross-strand multi-operon CSBs in large genomic datasets. The proposed algorithm uses match-point arithmetic, which is scalable for large datasets of microbial genomes in terms of running time and space requirements. The algorithm is implemented and incorporated into a tool with a graphical user interface, called CSBFinder-S. We applied CSBFinder-S to data mine 1485 prokaryotic genomes and analyzed the identified cross-strand CSBs. Our results indicate that most of the syntenic blocks are exclusively colinear. Additional results indicate that transcriptional regulation by overlapping transcriptional genes is abundant in bacteria. We demonstrate the utility of CSBFinder-S to identify common function of the gene-pair PulEF in multiple contexts, including Type 2 Secretion System, Type 4 Pilus System and DNA uptake machinery.

Original languageEnglish
Pages (from-to)I21-I29
JournalBioinformatics
Volume36
DOIs
StatePublished - 1 Jan 2020

Fingerprint

Dive into the research topics of 'Discovery of multi-operon colinear syntenic blocks in microbial genomes'. Together they form a unique fingerprint.

Cite this