TY - JOUR
T1 - Computational modeling of mRNA degradation dynamics using deep neural networks
AU - Yaish, Ofir
AU - Orenstein, Yaron
N1 - Funding Information:
The authors gratefully acknowledge the support of Intel Corporation for giving access to the IntelVRAI DevCloud platform used for part of this work. They gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan V GPU used for this research.
Funding Information:
This research was partially supported by the Israeli Council for Higher Education (CHE) via Data Science Research Center, Ben-Gurion University of the Negev, Israel.
Publisher Copyright:
© 2022 Oxford University Press. All rights reserved.
PY - 2022/2/15
Y1 - 2022/2/15
N2 - Motivation: messenger RNA (mRNA) degradation plays critical roles in post-transcriptional gene regulation. A major component of mRNA degradation is determined by 30-UTR elements. Hence, researchers are interested in studying mRNA dynamics as a function of 30-UTR elements. A recent study measured the mRNA degradation dynamics of tens of thousands of 30-UTR sequences using a massively parallel reporter assay. However, the computational approach used to model mRNA degradation was based on a simplifying assumption of a linear degradation rate. Consequently, the underlying mechanism of 30-UTR elements is still not fully understood. Results: Here, we developed deep neural networks to predict mRNA degradation dynamics and interpreted the networks to identify regulatory elements in the 30-UTR and their positional effect. Given an input of a 110 nt-long 30-UTR sequence and an initial mRNA level, the model predicts mRNA levels of eight consecutive time points. Our deep neural networks significantly improved prediction performance of mRNA degradation dynamics compared with extant methods for the task. Moreover, we demonstrated that models predicting the dynamics of two identical 30-UTR sequences, differing by their poly(A) tail, performed better than single-task models. On the interpretability front, by using Integrated Gradients, our convolutional neural networks (CNNs) models identified known and novel cis-regulatory sequence elements of mRNA degradation. By applying a novel systematic evaluation of model interpretability, we demonstrated that the recurrent neural network models are inferior to the CNN models in terms of interpretability and that random initialization ensemble improves both prediction and interoperability performance. Moreover, using a mutagenesis analysis, we newly discovered the positional effect of various 30-UTR elements.
AB - Motivation: messenger RNA (mRNA) degradation plays critical roles in post-transcriptional gene regulation. A major component of mRNA degradation is determined by 30-UTR elements. Hence, researchers are interested in studying mRNA dynamics as a function of 30-UTR elements. A recent study measured the mRNA degradation dynamics of tens of thousands of 30-UTR sequences using a massively parallel reporter assay. However, the computational approach used to model mRNA degradation was based on a simplifying assumption of a linear degradation rate. Consequently, the underlying mechanism of 30-UTR elements is still not fully understood. Results: Here, we developed deep neural networks to predict mRNA degradation dynamics and interpreted the networks to identify regulatory elements in the 30-UTR and their positional effect. Given an input of a 110 nt-long 30-UTR sequence and an initial mRNA level, the model predicts mRNA levels of eight consecutive time points. Our deep neural networks significantly improved prediction performance of mRNA degradation dynamics compared with extant methods for the task. Moreover, we demonstrated that models predicting the dynamics of two identical 30-UTR sequences, differing by their poly(A) tail, performed better than single-task models. On the interpretability front, by using Integrated Gradients, our convolutional neural networks (CNNs) models identified known and novel cis-regulatory sequence elements of mRNA degradation. By applying a novel systematic evaluation of model interpretability, we demonstrated that the recurrent neural network models are inferior to the CNN models in terms of interpretability and that random initialization ensemble improves both prediction and interoperability performance. Moreover, using a mutagenesis analysis, we newly discovered the positional effect of various 30-UTR elements.
UR - http://www.scopus.com/inward/record.url?scp=85130458013&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btab800
DO - 10.1093/bioinformatics/btab800
M3 - Article
C2 - 34849591
AN - SCOPUS:85130458013
SN - 1367-4803
VL - 38
SP - 1087
EP - 1101
JO - Bioinformatics
JF - Bioinformatics
IS - 4
ER -