Virus community spread mathematical modeling

One of basic model is SIR model which has a good potential for describing epidemics. In this model $S$ denotes susceptible people, which are people that can be infected. The variable $I$ denotes the infected people and $R$ denotes the recovered ones. There are some dynamic equations that interprets the interaction of these three sets. Also, based on the virus behaviours, you can subdivide these sets and give them some sort of weights.

There is a pandemic parameter $R_0$ which plays a crucial rule in this model. For example, $1-\frac{1}{R_0}$ denotes the percent of population that must be quarantined or vaccinated. For example for COVID-19, this number belong to the interval $[0.5,0.75]$. In this model you can take many other conditions and see the effects.

You can search about $SIR$ model and find many valuable things. Specially in plus.math.org page.


this wikipedia article Mathematical modelling of infectious disease may be a good starting point; epidemiology mathematical models is a combination of terms that does the magic with e.g. google.

Introduction to Mathematical Models of the Epidemiology & Control of Infectious Diseases is a recent publication.

COVID-19: Models, Mathematics, and Myths is a recent discussion of mathematical models for COVID-19 disease spread


There is actually a very simple model that works reasonably well over time periods with little change. No model can work across all time periods because various communities (or cities) change their behaviour drastically at certain time points. But for each interval during which the community's behaviour is roughly constant, the following model seems to hold quite well for sufficiently large communities:

$x = a+\frac{n}{2}·\Big(\tanh\big((t-c)·\frac{r}{2}\big)+1\Big)$, where $x$ is the total number of cases and $t$ is time, for some constants $a,n,c,r$.

What do these constants mean? There are two main kinds of drastic behaviour change that marks the start of the current interval.

The first kind introduces new infected people into the community (imported cases) after it has already stabilized, in which case $a$ is the total number of previous cases and $n$ is the total number of new cases that will appear under current conditions (i.e. if the current interval has no endpoint). The second kind changes the connectivity graph (e.g. lockdown) or the susceptibility of people to the virus (e.g. wearing mask), in which case $a = 0$ and $n$ is the total number of cases that will result under current conditions.

$c$ is simply the inflection point at which the rate of growth is maximum in the current interval.

$r$ is the rate parameter for exponential growth from (roughly) the same starting amount. Specifically, both $n·\exp((t-c)·r)$ and $\frac{n}{2}·\Big(\tanh\big((t-c)·\frac{r}{2}\big)+1\Big)$ are asymptotically the same as $t→-∞$, but diverge from one another as $t→∞$.

Why this relation? Well, consider the following differential equation:

$\frac{dy}{dt} = s·y·(n-y)$ where $y=x-a$.

It assumes that the rate of increase in cases is proportional to both the current number $y$ of cases that have been allowed to infect others and the number of cases that would be but have not yet been infected. These are obviously motivated by the fact that the current conditions will lead to some final total and the virus infection rate will be proportional to the number of infected-but-not-isolated cases and the number of to-be-infected cases, and the fact that the infected-but-not-isolated cases is proportional to the allowed-to-infect-others cases.

Solving it gives $\ln(y)-\ln(n-y) = s·t + k$ for some constants $s,k$ and hence we obtain $y = n·\Big(1-\frac1{\exp(s·t+k)+1}\Big) = \frac{n}{2}·\Big(\tanh\big(\frac{1}{2}(s·t+k)\big)+1\Big)$ as desired.