TY - GEN
T1 - Basecalling by Statistical Profiling and Hardware-Accelerated Convolutional Neural Network
AU - Kra, Yehuda
AU - Rudin, Yehuda
AU - Fish, Alex
AU - Teman, Adam
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024/1/1
Y1 - 2024/1/1
N2 - Oxford Nanopore Technologies (ONT) genome sequencing technology enables the decoding of DNA and RNA sequences by monitoring electrical current fluctuations as nucleic acids pass through a protein nanopore. This work focuses on basecalling, which is the process of decoding these signals to detect a specific sequence. We explore both analytical and machine learning methods based on statistical distribution profiles of read currents per short sub-sequences, referred to as k-mers. Initially, we apply t-statistics to categorize each k-mer according to a predictive statistical model. Additionally, we investigate the use of a Convolutional Neural Network (CNN) for basecalling, where the input is an image representing the statistical profile of the raw data. This CNN model is deployed on a hardware acceleration platform to optimize energy and performance efficiency. Our findings exhibit promising accuracy, paving the way for cost-effective Nanopore-based sequencing applications.
AB - Oxford Nanopore Technologies (ONT) genome sequencing technology enables the decoding of DNA and RNA sequences by monitoring electrical current fluctuations as nucleic acids pass through a protein nanopore. This work focuses on basecalling, which is the process of decoding these signals to detect a specific sequence. We explore both analytical and machine learning methods based on statistical distribution profiles of read currents per short sub-sequences, referred to as k-mers. Initially, we apply t-statistics to categorize each k-mer according to a predictive statistical model. Additionally, we investigate the use of a Convolutional Neural Network (CNN) for basecalling, where the input is an image representing the statistical profile of the raw data. This CNN model is deployed on a hardware acceleration platform to optimize energy and performance efficiency. Our findings exhibit promising accuracy, paving the way for cost-effective Nanopore-based sequencing applications.
UR - http://www.scopus.com/inward/record.url?scp=85199260135&partnerID=8YFLogxK
U2 - 10.1109/PRIME61930.2024.10559692
DO - 10.1109/PRIME61930.2024.10559692
M3 - Conference contribution
AN - SCOPUS:85199260135
T3 - 2024 19th Conference on Ph.D Research in Microelectronics and Electronics, PRIME 2024
BT - 2024 19th Conference on Ph.D Research in Microelectronics and Electronics, PRIME 2024
PB - Institute of Electrical and Electronics Engineers
T2 - 19th Conference on Ph.D Research in Microelectronics and Electronics, PRIME 2024
Y2 - 9 June 2024 through 12 June 2024
ER -