The COVID 19 Enigma: AI Models as the Saviour
The virus continues to outsmart the world through a complex interplay of infection, mortality, immunity and reinfection. The disease imposed a threat on mankind by diverse clinical presentation, controversial evidence for treatment, fast-tracked vaccine development and unclear systemic implications. Artificial intelligence as the saviour... (https://www.cmu.edu/iii/about/news/2020/covid-ai-predictions.html).
The AI approach: Genomic Surveillance
It means tracking pathogens (bacteria, virus, any other disease-causing organism) using genomic sequencing. By looking at the sequence for a pathogen, the evolution can be tracked to note any change in it impacting its biological properties. Biological sequences contain a plethora of information that can be exploited for genomic surveillance.
The Discovery: Strainflow
An epidemiological early warning system to predict new caseloads in various countries The spike protein latent space representation learned by Strainflow model could be used as a proxy to capture the spatiotemporal diversity in the emerging SARS-CoV-2 strains across different countries. (https://www.frontiersin.org/articles/10.3389/fgene.2022.858252/full)
What is Strainflow?
"One of a kind" early warning system for emergence of new variants of concern and case surges. It is a supervised and causally predictive model using unsupervised latent space features of SARS-CoV-2 genome sequences. Towards this, Strainflow was trained and validated on 0.9 million sequences until June 2021 and counting moreā¦
Strainflow captured the rise in cases 2 months ahead of the Delta and Omicron surges in most countries including the prediction of a surge in India as early as beginning of November, 2021.
The Novelty
An approach for analyzing the emerging strains based on the latent space of spike protein coding nucleotide sequences. Nucleotide sequences were chosen instead of proteins to capture and track the variations that may not have immediate functional consequences.
How does it work?
- Data-driven, de-novo approach.
- Uses complex mixtures for predictions
- Smart approach bypassing the need for expert understanding of the effects of individual mutations.
- Making easy the difficult task of providing simultaneous attention to many information pieces such as multiple codons.
The Effectiveness
- An accurate sense of the sharpness of an infection surge.
- Predicting the probability of a wave with a two-month lead time.
- Providing enough time for the healthcare systems to be prepared.
Strainflow Dashboard
An interactive publicly available web application, created using the strainflow model and is updated on a monthly basis http://strainflow.tavlab.iiitd.edu.in/