posted on 2024-06-14, 13:33authored bySneha Mittal, Milan Kumar Jena, Biswarup Pathak
RNA sequence information holds immense potential as a
drug target
for diagnosing various RNA-mediated diseases and viral/bacterial infections.
Massively parallel complementary DNA (c-DNA) sequencing helps but
results in a loss of valuable information about RNA modifications,
which are often associated with cancer evolution. Herein, we report
machine learning (ML)-assisted high throughput RNA sequencing with
inherent RNA modification detection using a single-molecule, long-read,
and label-free quantum tunneling approach. The ML tools achieve classification
accuracy as high as 100% in decoding RNA and 98% for decoding both
RNA and RNA modifications simultaneously. The relationships between
input features and target values have been well examined through Shapley
additive explanations. Furthermore, transmission and sensitivity readouts
enable the recognition of RNA and its modifications with good selectivity
and sensitivity. Our approach represents a starting point for ML-assisted
direct RNA sequencing that can potentially decode RNA and its epigenetic
modifications at the molecular level.