TY - JOUR
T1 - Expanding and Enriching the LncRNA Gene–Disease Landscape Using the GeneCaRNA Database
AU - Aggarwal, Shalini
AU - Rosenblum, Chana
AU - Gould, Marshall
AU - Ziman, Shahar
AU - Barshir, Ruth
AU - Zelig, Ofer
AU - Guan-Golan, Yaron
AU - Iny-Stein, Tsippi
AU - Safran, Marilyn
AU - Pietrokovski, Shmuel
AU - Lancet, Doron
N1 - Publisher Copyright:
© 2024 by the authors.
PY - 2024/6/1
Y1 - 2024/6/1
N2 - The GeneCaRNA human gene database is a member of the GeneCards Suite. It presents ~280,000 human non-coding RNA genes, identified algorithmically from ~690,000 RNAcentral transcripts. This expands by ~tenfold the ncRNA gene count relative to other sources. GeneCaRNA thus contains ~120,000 long non-coding RNAs (LncRNAs, >200 bases long), including ~100,000 novel genes. The latter have sparse functional information, a vast terra incognita for future research. LncRNA genes are uniformly represented on all nuclear chromosomes, with 10 genes on mitochondrial DNA. Data obtained from MalaCards, another GeneCards Suite member, finds 1547 genes associated with 1 to 50 diseases. About 15% of the associations portray experimental evidence, with cancers tending to be multigenic. Preliminary text mining within GeneCaRNA discovers interactions of lncRNA transcripts with target gene products, with 25% being ncRNAs and 75% proteins. GeneCaRNA has a biological pathways section, which at present shows 131 pathways for 38 lncRNA genes, a basis for future expansion. Finally, our GeneHancer database provides regulatory elements for ~110,000 lncRNA genes, offering pointers for co-regulated genes and genetic linkages from enhancers to diseases. We anticipate that the broad vista provided by GeneCaRNA will serve as an essential guide for further lncRNA research in disease decipherment.
AB - The GeneCaRNA human gene database is a member of the GeneCards Suite. It presents ~280,000 human non-coding RNA genes, identified algorithmically from ~690,000 RNAcentral transcripts. This expands by ~tenfold the ncRNA gene count relative to other sources. GeneCaRNA thus contains ~120,000 long non-coding RNAs (LncRNAs, >200 bases long), including ~100,000 novel genes. The latter have sparse functional information, a vast terra incognita for future research. LncRNA genes are uniformly represented on all nuclear chromosomes, with 10 genes on mitochondrial DNA. Data obtained from MalaCards, another GeneCards Suite member, finds 1547 genes associated with 1 to 50 diseases. About 15% of the associations portray experimental evidence, with cancers tending to be multigenic. Preliminary text mining within GeneCaRNA discovers interactions of lncRNA transcripts with target gene products, with 25% being ncRNAs and 75% proteins. GeneCaRNA has a biological pathways section, which at present shows 131 pathways for 38 lncRNA genes, a basis for future expansion. Finally, our GeneHancer database provides regulatory elements for ~110,000 lncRNA genes, offering pointers for co-regulated genes and genetic linkages from enhancers to diseases. We anticipate that the broad vista provided by GeneCaRNA will serve as an essential guide for further lncRNA research in disease decipherment.
KW - GeneCaRNA
KW - GeneCards
KW - human noncoding RNA
KW - lncRNA
KW - long non-coding RNA
KW - ncRNA
KW - non-coding RNA
KW - subclassification
UR - https://www.scopus.com/pages/publications/85197302149
U2 - 10.3390/biomedicines12061305
DO - 10.3390/biomedicines12061305
M3 - Article
C2 - 38927512
AN - SCOPUS:85197302149
SN - 2227-9059
VL - 12
JO - Biomedicines
JF - Biomedicines
IS - 6
M1 - 1305
ER -