TY - JOUR
T1 - GeneCaRNA
T2 - A Comprehensive Gene-centric Database of Human Non-coding RNAs in the GeneCards Suite
AU - Barshir, Ruth
AU - Fishilevich, Simon
AU - Iny-Stein, Tsippi
AU - Zelig, Ofer
AU - Mazor, Yaron
AU - Guan-Golan, Yaron
AU - Safran, Marilyn
AU - Lancet, Doron
N1 - Publisher Copyright:
© 2021 The Authors
PY - 2021/5/28
Y1 - 2021/5/28
N2 - Non-coding RNA (ncRNA) genes assume increasing biological importance, with growing associations with diseases. Many ncRNA sources are transcript-centric, but for non-coding variant analysis and disease decipherment it is essential to transform this information into a comprehensive set of genome-mapped ncRNA genes. We present GeneCaRNA, a new all-inclusive gene-centric ncRNA database within the GeneCards Suite. GeneCaRNA information is integrated from four community-backed data structures: the major transcript database RNAcentral with its 20 encompassed databases, and the ncRNA entries of three major gene resources HGNC, Ensembl and NCBI Gene. GeneCaRNA presents 219,587 ncRNA gene pages, a 7-fold increase from those available in our three gene mining sources. Each ncRNA gene has wide-ranging annotation, mined from >100 worldwide sources, providing a powerful GeneCards-leveraged search. The latter empowers VarElect, our disease-gene interpretation tool, allowing one to systematically decipher ncRNA variants. The combined power of GeneCaRNA with GeneHancer, our regulatory elements database, facilitates wide-ranging scrutiny of the non-coding terra incognita of gene networks and whole genome analyses.
AB - Non-coding RNA (ncRNA) genes assume increasing biological importance, with growing associations with diseases. Many ncRNA sources are transcript-centric, but for non-coding variant analysis and disease decipherment it is essential to transform this information into a comprehensive set of genome-mapped ncRNA genes. We present GeneCaRNA, a new all-inclusive gene-centric ncRNA database within the GeneCards Suite. GeneCaRNA information is integrated from four community-backed data structures: the major transcript database RNAcentral with its 20 encompassed databases, and the ncRNA entries of three major gene resources HGNC, Ensembl and NCBI Gene. GeneCaRNA presents 219,587 ncRNA gene pages, a 7-fold increase from those available in our three gene mining sources. Each ncRNA gene has wide-ranging annotation, mined from >100 worldwide sources, providing a powerful GeneCards-leveraged search. The latter empowers VarElect, our disease-gene interpretation tool, allowing one to systematically decipher ncRNA variants. The combined power of GeneCaRNA with GeneHancer, our regulatory elements database, facilitates wide-ranging scrutiny of the non-coding terra incognita of gene networks and whole genome analyses.
KW - RNAcentral
KW - comprehensive ncRNA compendium
KW - disease-gene interpretation
KW - non-coding universe
KW - whole genome sequencing
UR - http://www.scopus.com/inward/record.url?scp=85103046508&partnerID=8YFLogxK
U2 - 10.1016/j.jmb.2021.166913
DO - 10.1016/j.jmb.2021.166913
M3 - Article
C2 - 33676929
AN - SCOPUS:85103046508
SN - 0022-2836
VL - 433
JO - Journal of Molecular Biology
JF - Journal of Molecular Biology
IS - 11
M1 - 166913
ER -