Deep Learning-based Predictive Model of mRNA Vaccine Deterioration: An Analysis of the Stanford COVID-19 mRNA Vaccine Dataset
Main Article Content
Abstract
The emergence of SARS-CoV-2, the virus responsible for the COVID-19 pandemic, has resulted in a global health crisis leading to widespread illness, death, and daily life disruptions. Having a vaccine for COVID-19 is crucial to controlling the spread of the virus which will help to end the pandemic and restore normalcy to society. Messenger RNA (mRNA) molecules vaccine has led the way as the swift vaccine candidate for COVID-19, but it faces key probable restrictions including spontaneous deterioration. To address mRNA degradation issues, Stanford University academics and the Eterna community sponsored a Kaggle competition.This study aims to build a deep learning (DL) model which will predict deterioration rates at each base of the mRNA molecule. A sequence DL model based on a bidirectional gated recurrent unit (GRU) is implemented. The model is applied to the Stanford COVID-19 mRNA vaccine dataset to predict the mRNA sequences deterioration by predicting five reactivity values for every base in the sequence, namely reactivity values, deterioration rates at high pH, at high temperature, at high pH with Magnesium, and at high temperature with Magnesium. The Stanford COVID-19 mRNA vaccine dataset is split into the training set, validation set, and test set. The bidirectional GRU model minimizes the mean column wise root mean squared error (MCRMSE) of deterioration rates at each base of the mRNA sequence molecule with a value of 0.32086 for the test set which outperformed the winning models with a margin of (0.02112). This study would help other researchers better understand how to forecast mRNA sequence molecule properties to develop a stable COVID-19 vaccine.
Received 02/02/2023,
Revised 04/08/2023,
Accepted 06/08/2023,
Published 31/08/2023
Article Details
This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
References
Cucinotta D, Vanelli M. WHO declares COVID-19 a pandemic. Acta Bio Medica: Atenei Parmensis. 2020; 91(1): 157. https://dx.doi.org/10.23750/abm.v91i1.9397
Sulayman N. Comparative study for automated coronavirus detection in CT images with transfer learning. Damascus Uni J Eng Sci, 2022; 38(5): 245-255. http://journal.damascusuniversity.edu.sy/index.php/engj/article/download/2248/1775
Mohammed SK, Taha MM, Taha EM, Mohammad MN. Cluster analysis of biochemical markers as predictor of COVID-19 severity. Baghdad Sci J. 2022; 19(6 (Suppl.)): 1423-1429 https://dx.doi.org/10.21123/bsj.2022.4242
Zaki SM, Jaber MM, Kashmoola MA. Diagnosing COVID-19 Infection in Chest X-Ray Images Using Neural Network. Baghdad Sci J. 2022; 19(6): 1356-1361. http://dx.doi.org/10.21123/bsj.2022.5965
Mohammed TS, Sultan AI, Alheeti KM, Aljebory KM, Sultan HI, Al-Ani MS. COVID-19 Diagnosis Using Spectral and Statistical Analysis of Cough Recordings Based on the Combination of SVD and DWT. Baghdad Sci J. 2022; 0372-. https://doi.org/10.21123/bsj.2022.6516
WHO Coronavirus (COVID-19) Dashboard, https://covid19.who.int/ (accessed on 11 August 2022)
Corbett KS, Edwards DK, Leist SR, Abiona OM, Boyoglu-Barnum S, Gillespie RA, Himansu S, Schäfer A, Ziwawo CT, DiPiazza AT, Dinnon KH. SARS-CoV-2 mRNA vaccine design enabled by prototype pathogen preparedness. Nature. 2020; 586(7830): 567-71. https://doi.org/10.1038/s41586-020-2622-0
Kremsner PG, Mann P, Kroidl A, Leroux-Roels I, Schindler C, Gabor JJ, et al. Safety and immunogenicity of an mRNA-lipid nanoparticle vaccine candidate against SARS-CoV-2: A phase 1 randomized clinical trial. Wien Klin Wochenschr. 2021; 133(17-18): 931-41. https://doi.org/10.1007/s00508-021-01922-y
Ribonucleic Acid (RNA). https://www.genome.gov/genetics-glossary/RNA-Ribonucleic-Acid (Accessed on 11 August 2022)
Lyngsø RB, Pedersen CN. RNA pseudoknot prediction in energy-based models. J Comput Biol. 2000; 7(3-4): 409-27. https://doi.org/10.1089/106652700750050862
Pardi N, Hogan MJ, Porter FW, Weissman D. mRNA vaccines—a new era in vaccinology. Nat Rev Drug Discov. 2018; 17(4): 261-79. https://doi.org/10.1038/nrd.2017.243
Bouarara HA. N-Gram-Codon and Recurrent Neural Network (RNN) to Update Pfizer-BioNTech mRNA Vaccine. Int J Comput Intell Syst. 2022; 14(1): 1-24. http://doi.org/10.4018/IJSSCI.305838
Ing SH, Abdullah AA, Harun NH, Kanaya S. COVID-19 mRNA Vaccine Degradation Prediction Using LR and LGBM Algorithms. J Phys Conf Ser. 2021; 1997(1): 012005. IOP Publishing. http://doi.org/10.1088/1742-6596/1997/1/012005
Muneer A, Fati SM, Akbar NA, Agustriawan D, Wahyudi ST. iVaccine-Deep: Prediction of COVID-19 mRNA vaccine degradation using deep learning. J King Saud Univ- Comput. Inf Sci. 2022; 34(9): 7419-32. https://doi.org/10.1016/j.jksuci.2021.10.001
Krishna UV, Premjith B, Soman KP. A Comparative Study of Pre-trained Gene Embeddings for COVID-19 mRNA Vaccine Degradation Prediction. Proc 7th Int Conf Math Comput : pp. 301-308. Singapore: Springer Singapore. https://doi.org/10.1007/978-981-16-6890-6_22
Imran SA, Islam MT, Shahnaz C, Islam MT, Imam OT, Haque M. COVID-19 mRNA Vaccine Degradation Prediction using Regularized LSTM Model. Int Women Eng Conf. 2020; (pp. 328-331). IEEE. https://doi.org/10.1109/WIECON-ECE52138.2020.9398044
Openvaccine: Covid-19 mrna vaccine degradation prediction, Sep. 2020; https://www.kaggle.com/ competitions/stanford-covid-vaccine/data
Danaee P, Rouches M, Wiley M, Deng D, Huang L, Hendrix D. bpRNA: large-scale automated annotation and analysis of RNA secondary structure. Nucleic Acids Res. 2018; 46(11): 5381-94. https://doi.org/10.1093/nar/gky285
Qaid TS, Mazaar H, Alqahtani MS, Raweh AA, Alakwaa W. Deep sequence modelling for predicting COVID-19 mRNA vaccine degradation. Peer J Comput Sci. 2021; 7: e597. https://doi.org10.7717/peerj-cs.597
Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation. Proc Conf Empir Methods Nat Lang Proc. 2014; 1406.1078. https://doi.org/10.48550/arXiv.1406.1078
Cho K, Van Merriënboer B, Bahdanau D, Bengio Y. On the properties of neural machine translation: Encoder-decoder approaches. Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation. 2014; 1409.1259. https://doi.org/10.48550/arXiv.1409.1259
Schoenmaker L, Witzigmann D, Kulkarni JA, Verbeke R, Kersten G, Jiskoot W, et al. mRNA-lipid nanoparticle COVID-19 vaccines: Structure and stability. Int J Pharm X. 2021; 601:120586. https://doi.org/10.1016/j.ijpharm.2021.120586