|
@@ -7,48 +7,45 @@
|
|
|
% summary of the content in this chapter
|
|
|
% Version: 01.01.2012
|
|
|
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
|
|
|
-\chapter{Experiments 10}
|
|
|
+\chapter{Experiments}
|
|
|
\label{chap:evaluation}
|
|
|
-In the preceding chapters, we explained the methods (see~\Cref{chap:methods})
|
|
|
-based the theoretical background, that we established in~\Cref{chap:background}.
|
|
|
-In this chapter present the setups and results from the experiments and
|
|
|
-simulations, we ran. First, we discuss the experiments dedicated to identify
|
|
|
-the epidemiological parameters of $\beta$ and $\alpha$ in synthetic and
|
|
|
-real-world data. Second, we examine the reproduction number in synthetic and
|
|
|
-real-world data of Germany. Each section, is divided into a description of the
|
|
|
-experimental setup and the results.
|
|
|
+In ~\Cref{chap:methods}, we explain the methods based the theoretical
|
|
|
+background, that we established in~\Cref{chap:background}. In this chapter, we
|
|
|
+present the setups and results from the experiments and simulations. First, we
|
|
|
+discuss the experiments dedicated to identify the epidemiological transition
|
|
|
+rates of $\beta$ and $\alpha$ in synthetic and real-world data. Second, we
|
|
|
+examine the reproduction number in synthetic and real-world data of Germany.
|
|
|
|
|
|
% -------------------------------------------------------------------
|
|
|
|
|
|
-\section{Identifying the Transition Rates on Real-World and Synthetic Data 5}
|
|
|
+\section{Identifying the Transition Rates}
|
|
|
\label{sec:sir}
|
|
|
In this section, we aim to identify the transmission rate $\beta$ and the
|
|
|
recovery rate $\alpha$ from either synthetic or preprocessed real-world data.
|
|
|
The methodology that we employ to identify the transition rates is described
|
|
|
in~\Cref{sec:pinn:sir}. Meanwhile, the methods we utilize to preprocess the
|
|
|
-real-world data are detailed in~\Cref{sec:preprocessing:rq}.
|
|
|
+real-world data are detailed in~\Cref{sec:preprocessing:rq}. In the first part
|
|
|
+we present the setup of our experiments, then we provide the results including a
|
|
|
+discussion.\\
|
|
|
|
|
|
% -------------------------------------------------------------------
|
|
|
|
|
|
-\subsection{Setup 1}
|
|
|
+\subsection{Setup}
|
|
|
\label{sec:sir:setup}
|
|
|
|
|
|
-In this subsection, we present the configurations for the training of our
|
|
|
-PINNs, which are designed to identify the transition parameters. This
|
|
|
-encompasses the specific parameters for the preprocessing and the configuration
|
|
|
-of the PINN themselves.\\
|
|
|
-
|
|
|
-In order to validate our method, we first generate a dataset of synthetic data.
|
|
|
+\paragraph{Synthetic Data:}In order to validate our method, we first generate a dataset of synthetic data.
|
|
|
We achieve this by solving~\Cref{eq:modSIR} for a given set of parameters.
|
|
|
The parameters are set to $\alpha = \nicefrac{1}{3}$ and $\beta = \nicefrac{1}{2}$.
|
|
|
The size of the population is $N = \expnumber{7.6}{6}$ and the initial amount of
|
|
|
-infectious individuals of is $I_0 = 10$. We conduct the simulation over 150
|
|
|
-days, resulting in a dataset of the form of~\Cref{fig:synthetic_SIR}.\\ In
|
|
|
-order to process the real-world RKI data, it is necessary to preprocess the raw
|
|
|
-data for each state and Germany separately. This is achieved by utilizing a
|
|
|
-recovery queue with a recovery period of 14 days. With regard to population
|
|
|
-size of each state, we set it to the respective value counted at the end of
|
|
|
-2019\footnote{\url{https://de.statista.com/statistik/kategorien/kategorie/8/themen/63/branche/demographie/\#overview}}.
|
|
|
+infectious individuals is $I_0 = 10$. We conduct the simulation over 150
|
|
|
+days, resulting in a dataset of the form of~\Cref{fig:synthetic_SIR}.\\
|
|
|
+
|
|
|
+\paragraph{Real-World Data:}In order to process the real-world RKI data, it is
|
|
|
+necessary to preprocess the raw data for each state and Germany separately.
|
|
|
+This is achieved by utilizing a recovery queue with a recovery period of 14
|
|
|
+days. With regard to population size of each state, we set it to the respective
|
|
|
+value counted at the end of
|
|
|
+2019\footnote{{\tiny \url{https://de.statista.com/statistik/kategorien/kategorie/8/themen/63/branche/demographie/\#overview}}}.
|
|
|
The initial number of infectious individuals is set to the number of infected
|
|
|
people on March 09. 2020 from the dataset. The data we extract spans from
|
|
|
March 09. 2020 to June 22. 2023, encompassing a period of 1200 days and
|
|
@@ -104,129 +101,128 @@ active and severe.
|
|
|
\label{fig:datasets_sir}
|
|
|
\end{figure}
|
|
|
|
|
|
-The PINN that we utilize comprises of seven hidden layers with twenty neurons
|
|
|
-each, and an activation function of ReLU. We employ the Adam optimizer and the
|
|
|
-polynomial scheduler of the PyTorch library, for training, with a base learning rate
|
|
|
-of $\expnumber{1}{-3}$. We train the model for 10000 epochs to extract the
|
|
|
-parameters. For each set of parameters, we conduct five iterations to
|
|
|
-demonstrate stability of the values. The configuration is similar to the
|
|
|
-configuration, that Shaier \etal ~\cite{Shaier2021} use for their work aside
|
|
|
-from the learning rate and the scheduler choice.\\
|
|
|
-
|
|
|
-The following section presents the results of the simulations conducted with the
|
|
|
-setups that we describe in this section.
|
|
|
+\paragraph{Training Parameters:}The PINN that we utilize comprises of seven
|
|
|
+hidden layers with twenty neurons each, and an activation function of ReLU. We
|
|
|
+follow the hyperparameter setting in~\cite{Shaier2021} but change the base
|
|
|
+learning rate to $\expnumber{1}{-3}$. And employ a polynomial scheduler
|
|
|
+implementation from the PyTorch library~\cite{Paszke2019} instead. We train the
|
|
|
+model for 10000 epochs to extract the parameters. For each set of parameters, we
|
|
|
+conduct five iterations to demonstrate stability of the values. For measuring the
|
|
|
+accuracy, we calculate the error $e$, using the 2-Norm. Let $G$ be the set of
|
|
|
+compartment training data the SIR model with $\boldsymbol{g}\in G$ and $\hat{\boldsymbol{g}}$ be the
|
|
|
+corresponding model prediction, then,
|
|
|
+\begin{equation}
|
|
|
+ e_{G} = \frac{1}{|G|}\sum_{g\in G}^{}\frac{\Big\|\hat{\boldsymbol{g}} - \boldsymbol{g}\Big\|_2}{\Big\|\boldsymbol{g}\Big\|_2},
|
|
|
+\end{equation}
|
|
|
+is the average error across all three groups.
|
|
|
|
|
|
% -------------------------------------------------------------------
|
|
|
|
|
|
-\subsection{Results 4}
|
|
|
+\subsection{Results}
|
|
|
\label{sec:sir:results}
|
|
|
|
|
|
-\begin{figure}[t]
|
|
|
- \centering
|
|
|
- \includegraphics[width=0.7\textwidth]{reproducability.pdf}
|
|
|
- \caption{Visualization of all 5 predictions for the synthetic dataset,
|
|
|
- compared to the true values of $\alpha = \nicefrac{1}{3}$ and $\beta = \nicefrac{1}{2}$}
|
|
|
- \label{fig:reprod}
|
|
|
-\end{figure}
|
|
|
-
|
|
|
-In this section, we present the results, that we obtain from the conducted
|
|
|
-experiments, that we describe in the preceding section. We begin by examining
|
|
|
-the results for the synthetic dataset, focusing the accuracy and
|
|
|
-reproducibility. We then proceed to present and discuss the results for the
|
|
|
-German states and Germany.\\
|
|
|
+In this section, we start by examining the results for the synthetic dataset,
|
|
|
+focusing the accuracy and reproducibility. We then proceed to present and
|
|
|
+discuss the results for the German states and Germany.\\
|
|
|
|
|
|
The results of the experiment regarding the synthetic data can be seen
|
|
|
-in~\Cref{table:alpha_beta_synth} and in~\Cref{fig:reprod}.~\Cref{fig:reprod}
|
|
|
-depicts the values of $\beta$ and $\alpha$ for each iteration in comparison to the true
|
|
|
-values of $\beta=\nicefrac{1}{2}$ and $\alpha=\nicefrac{1}{3}$. In~\Cref{table:alpha_beta_synth}
|
|
|
-we present the mean $\mu$ and standard deviation $\sigma$ of both values across
|
|
|
-all five iterations.\\
|
|
|
-
|
|
|
+in~\Cref{table:alpha_beta_synth}. The error and the standard variation for both
|
|
|
+parameters are negligible small. Taking the mean of the parameters across the
|
|
|
+five iterations yields more accurate results.\\
|
|
|
\begin{table}[h]
|
|
|
\begin{center}
|
|
|
- \begin{tabular}{ccc ccc}
|
|
|
+ \caption{Simulation results for the synthetic data. The true values and
|
|
|
+ the respective mean parameter is given.}
|
|
|
+ \label{table:alpha_beta_synth}
|
|
|
+ \begin{tabular}{ccccccccc}
|
|
|
\toprule
|
|
|
- \multicolumn{3}{c}{$\alpha$} & \multicolumn{3}{c}{$\beta$} \\
|
|
|
- \cmidrule{1-3}\cmidrule{4-6}
|
|
|
- true & $\mu$ & $\sigma$ & true & $\mu$ & $\sigma$ \\
|
|
|
+ \multicolumn{2}{c}{$\alpha$} & \phantom{0} & \multicolumn{2}{c}{$\beta$} \\
|
|
|
+ \cmidrule{1-2}\cmidrule{4-5}
|
|
|
+ true & $\mu$ & \phantom{0} & true & $\mu$ & \phantom{0} & $e_{SIR}$ \\
|
|
|
\midrule
|
|
|
- 0.3333 & 0.3334 & 0.0011 & 0.5000 & 0.5000 & 0.0017 \\
|
|
|
+ 0.333 & 0.333{\tiny$\pm 0.001$} & \phantom{0} & 0.500 & 0.500{\tiny$\pm 0.002$} & \phantom{0} & 0.004 \\
|
|
|
\bottomrule
|
|
|
\end{tabular}
|
|
|
- \caption{The mean $\mu$ and standard deviation $\sigma$ across the 5
|
|
|
- independent iterations of training our PINNs with the synthetic dataset.}
|
|
|
- \label{table:alpha_beta_synth}
|
|
|
\end{center}
|
|
|
\end{table}
|
|
|
-
|
|
|
The results demonstrate that the model is capable of approximating the correct
|
|
|
parameters for the small, synthetic dataset in each of the five iterations.
|
|
|
-While the predicted value is not precisely accurate, the standard deviation is
|
|
|
-sufficiently small, and taking the mean of multiple iterations produces an
|
|
|
-almost perfect result.\\
|
|
|
+The mean of the predicted values results in values with a sufficiently small
|
|
|
+error. Thus, we argue that our selected method is well suited to analyze real
|
|
|
+world pandemic data collected in Germany.\\
|
|
|
|
|
|
In~\Cref{table:state_mean_std} we present the results of the training for the
|
|
|
real-world data. The results are presented from top to bottom, in the order of
|
|
|
the community identification number, with the last entry being Germany. Both
|
|
|
the mean $\mu$ and the standard deviation $\sigma$ are calculated across all
|
|
|
-five iterations of our experiment. We can observe that the values of
|
|
|
-\emph{Hamburg} have the highest standard deviation, while \emph{Mecklenburg Vorpommern}
|
|
|
-has the lowest $\sigma$.\\
|
|
|
+five iterations of our experiment. We can observe that the error $e_{SIR}$ is
|
|
|
+the highest for \emph{Saxony} and the lowest for \emph{Lower Saxony}.
|
|
|
+Furthermore, we include the distance $\Delta\beta_{\text{Germany}} = \beta_{\text{state}} - \beta_{\text{Germany}}$
|
|
|
+and the percentage of people that have a basic immunity through vaccination
|
|
|
+$\nu$ for each state provided by the Robert Koch Institute\footnote{{\tiny\url{https://impfdashboard.de/}}}.\\
|
|
|
|
|
|
\begin{table}[h]
|
|
|
\begin{center}
|
|
|
+ \caption{Mean and standard deviation, error $e_{SIR}$ and the distance
|
|
|
+ $\Delta\beta_{\text{Germany}} = \beta_{\text{state}} - \beta_{\text{Germany}}$
|
|
|
+ across the 5 iterations, that we conducted for each German state and Germany
|
|
|
+ as the whole country. Furthermore we include the vaccination percentage
|
|
|
+ $\nu$ provided from the RKI.}
|
|
|
+ \label{table:state_mean_std}
|
|
|
\begin{tabular}{lccccc}
|
|
|
\toprule
|
|
|
- & \multicolumn{2}{c}{$\alpha$} & \multicolumn{2}{c}{$\beta$} & \\
|
|
|
- \cmidrule{2-3}\cmidrule{4-5}
|
|
|
- state name & $\mu$ & $\sigma$ & $\mu$ & $\sigma$ & $e_{\text{synth}}$ \\
|
|
|
+ state name & $\alpha$ & $\beta$ & $e_{SIR}$ & $\Delta\beta_{\text{Germany}}$ & $\nu$ [\%] \\
|
|
|
\midrule
|
|
|
- Schleswig Holstein & 0.0771 & 0.0010 & 0.0966 & 0.0013 & 0.0849 \\
|
|
|
- Hamburg & 0.0847 & 0.0035 & 0.1077 & 0.0037 & 0.0948 \\
|
|
|
- Lower Saxony & 0.0735 & 0.0014 & 0.0962 & 0.0018 & 0.0774 \\
|
|
|
- Bremen & 0.0588 & 0.0018 & 0.0795 & 0.0025 & 0.0933 \\
|
|
|
- North Rhine-Westphalia & 0.0780 & 0.0009 & 0.1001 & 0.0011 & 0.0777 \\
|
|
|
- Hesse & 0.0653 & 0.0016 & 0.0854 & 0.0020 & 0.1017 \\
|
|
|
- Rhineland-Palatinate & 0.0808 & 0.0016 & 0.1036 & 0.0018 & 0.0895 \\
|
|
|
- Baden-Württemberg & 0.0862 & 0.0014 & 0.1132 & 0.0016 & 0.0796 \\\addlinespace
|
|
|
- Bavaria & 0.0809 & 0.0021 & 0.1106 & 0.0027 & 0.0952 \\
|
|
|
- Saarland & 0.0746 & 0.0021 & 0.0996 & 0.0024 & 0.1080 \\
|
|
|
- Berlin & 0.0901 & 0.0008 & 0.1125 & 0.0008 & 0.0667 \\
|
|
|
- Brandenburg & 0.0861 & 0.0008 & 0.1091 & 0.0010 & 0.0724 \\
|
|
|
- Mecklenburg-Vorpommern & 0.0910 & 0.0007 & 0.1167 & 0.0008 & 0.0540 \\
|
|
|
- Saxony & 0.0797 & 0.0017 & 0.1073 & 0.0022 & 0.1109 \\
|
|
|
- Saxony-Anhalt & 0.0932 & 0.0019 & 0.1207 & 0.0027 & 0.0785 \\
|
|
|
- Thuringia & 0.0952 & 0.0011 & 0.1248 & 0.0016 & 0.0837 \\\addlinespace
|
|
|
- Germany & 0.0803 & 0.0012 & 0.1044 & 0.0014 & 0.0804 \\
|
|
|
+ Schleswig Holstein & 0.076{\tiny$\pm0.001$} & 0.095{\tiny$\pm 0.001$} & 0.085 & -0.013 & 79.5 \\
|
|
|
+ Hamburg & 0.082{\tiny$\pm0.001$} & 0.104{\tiny$\pm 0.001$} & 0.095 & -0.004 & 84.5 \\
|
|
|
+ Lower Saxony & 0.075{\tiny$\pm0.002$} & 0.097{\tiny$\pm 0.002$} & 0.077 & -0.011 & 77.6 \\
|
|
|
+ Bremen & 0.058{\tiny$\pm0.002$} & 0.078{\tiny$\pm 0.002$} & 0.093 & -0.030 & 88.3 \\
|
|
|
+ NRW & 0.079{\tiny$\pm0.001$} & 0.101{\tiny$\pm 0.001$} & 0.078 & -0.007 & 79.5 \\
|
|
|
+ Hesse & 0.065{\tiny$\pm0.001$} & 0.085{\tiny$\pm 0.001$} & 0.102 & -0.023 & 75.8 \\
|
|
|
+ Rhineland-Palatinate & 0.085{\tiny$\pm0.004$} & 0.108{\tiny$\pm 0.004$} & 0.090 & 0.001 & 75.6 \\
|
|
|
+ Baden-Württemberg & 0.091{\tiny$\pm0.002$} & 0.118{\tiny$\pm 0.003$} & 0.080 & 0.010 & 74.5 \\
|
|
|
+ Bavaria & 0.085{\tiny$\pm0.004$} & 0.116{\tiny$\pm 0.005$} & 0.095 & 0.008 & 75.1 \\
|
|
|
+ Saarland & 0.075{\tiny$\pm0.002$} & 0.099{\tiny$\pm 0.003$} & 0.108 & -0.009 & 82.4 \\
|
|
|
+ Berlin & 0.087{\tiny$\pm0.001$} & 0.109{\tiny$\pm 0.001$} & 0.067 & 0.001 & 78.1 \\
|
|
|
+ Brandenburg & 0.087{\tiny$\pm0.003$} & 0.110{\tiny$\pm 0.003$} & 0.072 & 0.002 & 68.1 \\
|
|
|
+ MV & 0.089{\tiny$\pm0.002$} & 0.114{\tiny$\pm 0.002$} & 0.054 & 0.006 & 74.7 \\
|
|
|
+ Saxony & 0.075{\tiny$\pm0.002$} & 0.099{\tiny$\pm 0.002$} & 0.111 & -0.009 & 65.1 \\
|
|
|
+ Saxony-Anhalt & 0.092{\tiny$\pm0.003$} & 0.119{\tiny$\pm 0.005$} & 0.079 & 0.011 & 74.1 \\
|
|
|
+ Thuringia & 0.091{\tiny$\pm0.002$} & 0.119{\tiny$\pm 0.003$} & 0.084 & 0.011 & 70.3 \\
|
|
|
+ \midrule
|
|
|
+ Germany & 0.083{\tiny$\pm0.001$} & 0.108{\tiny$\pm 0.002$} & 0.080 & 0.000 & 76.4 \\
|
|
|
\bottomrule
|
|
|
\end{tabular}
|
|
|
- \caption{Mean and standard deviation across the 5 iterations, that we
|
|
|
- conducted for each German state and Germany as the whole country.}
|
|
|
- \label{table:state_mean_std}
|
|
|
+
|
|
|
\end{center}
|
|
|
\end{table}
|
|
|
|
|
|
\begin{figure}[t]
|
|
|
\centering
|
|
|
\includegraphics[width=\textwidth]{mean_std_alpha_beta_res.pdf}
|
|
|
- \caption{Visualization of the mean $\mu$ and standard deviation $\sigma$ of
|
|
|
- the transition rates $\alpha$ and $\beta$ for each state compared to the
|
|
|
- mean values of $\alpha$ and $\beta$ for Germany.}
|
|
|
+ \caption{Visualization of the mean and standard deviation of the transition
|
|
|
+ rates $\alpha$ and $\beta$ for each state compared to the mean values of
|
|
|
+ $\alpha$ and $\beta$ for Germany.}
|
|
|
\label{fig:alpha_beta_mean_std}
|
|
|
\end{figure}
|
|
|
|
|
|
In~\Cref{fig:alpha_beta_mean_std}, we present a visual representation of the
|
|
|
means and standard deviations in comparison to the national values. It is
|
|
|
noteworthy that the states of Saxony-Anhalt and Thuringia have the highest
|
|
|
-transmission rates of all states, while Bremen and Hessen have the lowest
|
|
|
+transmission rates of all states, while Bremen and Hesse have the lowest
|
|
|
values for $\beta$. The transmission rates of Hamburg, Baden Württemberg,
|
|
|
Bavaria, and all eastern states lay above the national rate of transmission.
|
|
|
Similarly, the recovery rate yields comparable outcomes. For the recovery rate,
|
|
|
the same states that exhibit a transmission rate exceeding the national value,
|
|
|
have a higher recovery rate than the national standard, with the exception of
|
|
|
-Saxony.It is noteworthy that the recovery rates of all states exhibit a
|
|
|
+Saxony. It is noteworthy that the recovery rates of all states exhibit a
|
|
|
tendency to align with the recovery rate of $\alpha=\nicefrac{1}{14}$, which is
|
|
|
-equivalent to a recovery period of 14 days.\\
|
|
|
+equivalent to a recovery period of $D=\nicefrac{1}{\alpha}=14$ days. When
|
|
|
+calculating the correlation coefficient between the predicted transmission rate
|
|
|
+and the vaccination ratio, we get a value of $-0.5134$. The strong negative
|
|
|
+correlation indicates that the transmission rate is high when the vaccination
|
|
|
+ratio is low, and vice versa. This shows that the impact of the vaccines can be
|
|
|
+witnessed in our results. \\
|
|
|
|
|
|
It is evident that there is a correlation between the values of $\alpha$ and
|
|
|
$\beta$ for each state. States with a high transmission rate tend to have a
|
|
@@ -242,45 +238,41 @@ to the $R$ compartment 14 days after they were infected.\\
|
|
|
|
|
|
This issue can be addressed by reducing the SIR model, thereby eliminating the
|
|
|
significance of the $R$ compartment size. In the following section, we present
|
|
|
-our experiments for the reduced SIR model with time-independent parameters.
|
|
|
+our experiments for the reduced SIR model with time-dependent parameters.
|
|
|
|
|
|
% -------------------------------------------------------------------
|
|
|
|
|
|
-\section{Reduced SIR Model 5}
|
|
|
+\section{Identifying the Reproduction Number}
|
|
|
\label{sec:rsir}
|
|
|
In this section we describe the experiments we conduct to identify the
|
|
|
time-dependent reproduction number for both synthetic and real-world data.
|
|
|
Similar to the previous section, we first describe the setup of our experiments
|
|
|
-and afterwards present the results. The methods we employ for the preprocessing
|
|
|
-are described in~\Cref{sec:preprocessing:rq} and for the PINN, that we use,
|
|
|
-are described in~\Cref{sec:pinn:rsir}.
|
|
|
+and afterwards present the results and a discussion. The methods we employ for
|
|
|
+the preprocessing are described in~\Cref{sec:preprocessing:rq} and for the PINN,
|
|
|
+that we use, are described in~\Cref{sec:pinn:rsir}.
|
|
|
|
|
|
% -------------------------------------------------------------------
|
|
|
|
|
|
-\subsection{Setup 1}
|
|
|
+\subsection{Setup}
|
|
|
\label{sec:rsir:setup}
|
|
|
-This section outlines the selection of parameters and configuration for data
|
|
|
-generation, preprocessing, and the neural networks. We employ these setups to
|
|
|
-train the PINNs to identify the reproduction number on both synthetic and
|
|
|
-real-world data.\\
|
|
|
-
|
|
|
-For the purposes of validation, we create a synthetic dataset, by setting the parameter
|
|
|
+\paragraph{Synthetic Data:}For the purposes of validation, we create a synthetic dataset, by setting the parameter
|
|
|
of $\alpha$ and the reproduction value each to a specific values, and solving~\Cref{eq:reduced_sir_ODE}
|
|
|
-for a given time interval. We set $\alpha=\nicefrac{1}{3}$ and $\Rt$ to the
|
|
|
-values as can be seen in~\Cref{fig:Rt_dataset} as well as the population
|
|
|
-size $N=\expnumber{7.6}{6}$ and the initial amount of infected people to
|
|
|
-$I_0=10$. Furthermore, we set our simulated time span to 150 days. We use this
|
|
|
-dataset to demonstrate, that our method is working on a simple and minimal
|
|
|
-dataset.\\ To obtain a dataset of the infectious group, consisting of the
|
|
|
+for a given time interval. As in the synthetic data for the aforementioned
|
|
|
+experiments, we set $\alpha=\nicefrac{1}{3}$ and $\Rt$ to the values as can be
|
|
|
+seen in~\Cref{fig:Rt_dataset} as well as the population size
|
|
|
+$N=\expnumber{7.6}{6}$ and the initial amount of infected people to $I_0=10$.
|
|
|
+Furthermore, we set our simulated time span to 150 days. We use this dataset to
|
|
|
+demonstrate, that our method is working on a simple and minimal dataset.\\
|
|
|
+\paragraph{Real-World Data:}To obtain a dataset of the infectious group, consisting of the
|
|
|
real-world data, we processed the data of the dataset
|
|
|
-\emph{COVID-19-Todesfälle in Deutschland} to extract the number of infections
|
|
|
+\emph{COVID-19-Todesfälle in Deutschland}~\cite{GHDead} to extract the number of infections
|
|
|
in Germany as a whole. For the German states, we use the data of \emph{SARS-CoV-2
|
|
|
- Infektionen in Deutschland}. In the preprocessing stage, we employ a constant
|
|
|
+ Infektionen in Deutschland}~\cite{GHInf}. In the preprocessing stage, we employ a constant
|
|
|
rate for $\alpha$ to move individuals into the removed compartment. For each
|
|
|
state we generate two datasets with a different recovery rate. First, we choose
|
|
|
-$\alpha = \nicefrac{1}{14}$, which aligns with the time of recovery\footnote{\url{https://github.com/robert-koch-institut/SARS-CoV-2-Infektionen_in_Deutschland.git}}.
|
|
|
+$\alpha = \nicefrac{1}{14}$, which aligns with the time of recovery~\cite{GHInf}.
|
|
|
Second, we use $\alpha=\nicefrac{1}{5}$, as 5 days into the infection is the
|
|
|
-point at which the infectiousness is at its peak\footnote{\url{https://www.infektionsschutz.de/coronavirus/fragen-und-antworten/ansteckung-uebertragung-und-krankheitsverlauf/}}.
|
|
|
+point at which the infectiousness is at its peak~\cite{COVInfo}.
|
|
|
As in~\Cref{sec:sir}, we set the population size $N$ of each state and Germany
|
|
|
to the corresponding size at the end of 2019. Furthermore, for the same reason
|
|
|
we restrict the data points to an interval of 1200 days, beginning on March 09.
|
|
@@ -309,29 +301,29 @@ we restrict the data points to an interval of 1200 days, beginning on March 09.
|
|
|
\label{fig:Rt_dataset}
|
|
|
\end{figure}
|
|
|
|
|
|
-In order to achieve the desired output, the selected neural network
|
|
|
-architecture comprises of four hidden layers, each containing 100 neurons. The
|
|
|
-activation function is the tangens hyperbolicus function. For the real-world
|
|
|
-data, we weight the data loss by a factor of $\expnumber{1}{6}$, to the total
|
|
|
-loss. The model is trained using a base learning rate of $\expnumber{1}{-3}$,
|
|
|
-with the same scheduler and optimizer as we describe in~\Cref{sec:sir:setup}.
|
|
|
-We train the model for 20000 epochs. To reduce the standard deviation, each
|
|
|
-experiment is conducted 15 times.\\
|
|
|
+\paragraph{Training Parameters:}In order to achieve the desired output, the
|
|
|
+selected neural network architecture comprises of four hidden layers, each
|
|
|
+containing 100 neurons. The activation function is the tangens hyperbolicus
|
|
|
+function. For both the federal state and Germany, the physics loss is weighted
|
|
|
+by a factor of $\expnumber{1}{-6}$, whereas the data loss belonging to Germany
|
|
|
+is also weighted with a high factor of $\expnumber{1}{4}$, relative to the total
|
|
|
+loss. We found this approach to yield the best results. The model is trained
|
|
|
+using a base learning rate of $\expnumber{1}{-3}$, with the same scheduler and
|
|
|
+optimizer as we describe in~\Cref{sec:sir:setup}. We train the model for the
|
|
|
+states 20000 epochs and start the physics training after 10000 epochs, while we
|
|
|
+train for Germany for 25000 and start the physics training after 15000 epochs.
|
|
|
+To reduce the standard deviation, each experiment is conducted 15 times. For
|
|
|
+evaluation, we use the error $e_G$ as we do in the subsequent section.\\
|
|
|
|
|
|
% -------------------------------------------------------------------
|
|
|
|
|
|
-\subsection{Results 4}
|
|
|
+\subsection{Results}
|
|
|
\label{sec:rsir:results}
|
|
|
|
|
|
-In this section we provide the results for our experiments considering the
|
|
|
-reduced SIR model and the reproduction number $\Rt$. First, we present
|
|
|
-our findings for the synthetic dataset. Then, we provide and discuss the
|
|
|
-results for the real-world data.\\
|
|
|
-
|
|
|
\Cref{fig:synth_results} illustrates the results of our experiments conducted on
|
|
|
the synthetic dataset, which can be seen in~\Cref{fig:Rt_dataset}. It is evident
|
|
|
that the model is capable of learning the infection data across all data points.
|
|
|
-The error for this is, $e_{\text{synth}} = 0.0016$, which is of a negligible
|
|
|
+The error for this is, $e_I = 0.0016$, which is of a negligible
|
|
|
magnitude.\\
|
|
|
|
|
|
\begin{figure}[h]
|
|
@@ -354,8 +346,7 @@ An examination of the predictions for the representation value $\Rt$ reveals
|
|
|
that here as well, the model is capable of accurately delineating the value at
|
|
|
each time point. However, during the first 30 days, the standard deviation is
|
|
|
exhibits an upward trend, while during the final 120 days, the predictions
|
|
|
-demonstrate remarkable precision. The overall prediction of $\Rt$ has an error
|
|
|
-of $e_{\Rt} = 0.0521$.\\
|
|
|
+demonstrate remarkable precision.\\
|
|
|
|
|
|
In~\Cref{fig:state_results}, we present the graphs of $\Rt$ for the state with
|
|
|
the highest value of $\beta$, namely Thuringia, and for the state with the lowest
|
|
@@ -390,8 +381,9 @@ is shorter, but the peak value is higher.\\
|
|
|
\label{fig:state_results}
|
|
|
\caption{Visualization of the prediction of the training and the graphs of
|
|
|
$\Rt$ for Thuringia (left) and Bremen (right) with both
|
|
|
- $\alpha = \nicefrac{1}{14}$ and $\alpha = \nicefrac{1}{5}$. Events like
|
|
|
- the peak of an influential variant are marked horizontally.}
|
|
|
+ $\alpha = \nicefrac{1}{14}$ and $\alpha = \nicefrac{1}{5}$. Events~\cite{COVIDChronik} like
|
|
|
+ the peak of an influential variant or the start of the vaccination of the public are marked horizontally. Further
|
|
|
+ visualizations can be found in~\Cref{chap:appendix}.}
|
|
|
\end{figure}
|
|
|
|
|
|
\Cref{table:state_error} presents data regarding the discrepancy between the
|
|
@@ -406,6 +398,41 @@ to the normalization factor $C$. The model is unable to learn effectively if the
|
|
|
values of the data loss $\mathcal{L}_{\text{data}}$ are too large or too small
|
|
|
at the beginning.\\
|
|
|
|
|
|
+\begin{table}[t]
|
|
|
+ \begin{center}
|
|
|
+ \caption{This table displays all average values of the error $e_{\text{I}}$
|
|
|
+ for all German states and Germany. The average is formed across all
|
|
|
+ 10 iteration.}
|
|
|
+ \label{table:state_error}
|
|
|
+ \begin{tabular}{lccccccc}
|
|
|
+ \toprule
|
|
|
+ & \multicolumn{2}{c}{$e_I$} & \phantom{0} & \multicolumn{2}{c}{days with $\Rt>1$} & \multicolumn{2}{c}{peak $\Rt$} \\
|
|
|
+ \cmidrule{2-3}\cmidrule{5-6}\cmidrule{7-8}
|
|
|
+ state name & $\alpha=\frac{1}{14}$ & $\alpha=\frac{1}{5}$ & \phantom{0} & $\alpha=\frac{1}{14}$ & $\alpha=\frac{1}{5}$ & $\alpha=\frac{1}{14}$ & $\alpha=\frac{1}{5}$ \\
|
|
|
+ \midrule
|
|
|
+ Schleswig Holstein & 0.228 & 0.258 & \phantom{0} & 467.5 & 458.5 & 1.475 & 1.166 \\
|
|
|
+ Hamburg & 0.265 & 0.330 & \phantom{0} & 424.3 & 409.8 & 1.500 & 1.297 \\
|
|
|
+ Lower Saxony & 0.224 & 0.340 & \phantom{0} & 413.1 & 430.3 & 1.662 & 1.223 \\
|
|
|
+ Bremen & 0.246 & 0.380 & \phantom{0} & 468.6 & 539.1 & 1.582 & 1.179 \\
|
|
|
+ NRW & 0.185 & 0.252 & \phantom{0} & 486.3 & 602.0 & 1.573 & 1.205 \\
|
|
|
+ Hesse & 0.302 & 0.346 & \phantom{0} & 553.0 & 511.2 & 1.409 & 1.157 \\
|
|
|
+ Rhineland-Palatinate & 0.256 & 0.277 & \phantom{0} & 484.7 & 404.7 & 1.534 & 1.175 \\
|
|
|
+ Baden-Württemberg & 0.198 & 0.284 & \phantom{0} & 469.2 & 590.0 & 1.457 & 1.180 \\
|
|
|
+ Bavaria & 0.225 & 0.318 & \phantom{0} & 490.5 & 486.1 & 1.428 & 1.199 \\
|
|
|
+ Saarland & 0.284 & 0.408 & \phantom{0} & 500.2 & 564.7 & 1.515 & 1.180 \\
|
|
|
+ Berlin & 0.201 & 0.240 & \phantom{0} & 591.9 & 514.4 & 1.721 & 1.262 \\
|
|
|
+ Brandenburg & 0.237 & 0.242 & \phantom{0} & 555.9 & 596.3 & 1.447 & 1.159 \\
|
|
|
+ MV & 0.170 & 0.257 & \phantom{0} & 537.5 & 544.3 & 1.563 & 1.135 \\
|
|
|
+ Saxony & 0.292 & 0.256 & \phantom{0} & 722.3 & 695.4 & 1.790 & 1.407 \\
|
|
|
+ Saxony-Anhalt & 0.213 & 0.268 & \phantom{0} & 572.0 & 631.9 & 1.387 & 1.165 \\
|
|
|
+ Thuringia & 0.180 & 0.222 & \phantom{0} & 732.1 & 730.6 & 1.586 & 1.249 \\
|
|
|
+ \midrule
|
|
|
+ Germany & 0.284 & 0.239 & \phantom{0} & 587.7 & 430.7 & 1.561 & 1.219 \\
|
|
|
+ \bottomrule
|
|
|
+ \end{tabular}
|
|
|
+ \end{center}
|
|
|
+\end{table}
|
|
|
+
|
|
|
As illustrated in~\Cref{fig:state_results}, the training data is overlaid with the
|
|
|
corresponding prediction of the model. We can observe that the prediction, though
|
|
|
an exact reconstruction, accurately captures the general trajectory of the
|
|
@@ -414,8 +441,8 @@ peaks, exhibiting a tendency to ignore smaller changes. This suggests that the
|
|
|
prediction of the model is capable show the rough outline of the progression of COVID-19. In the
|
|
|
beginning, the majority of predictions below $\Rt=1$, indicating an outbreak.
|
|
|
As we observed in the synthetic data, the model exhibits a higher standard
|
|
|
-deviation at the boundaries. In the graphs, we mark the
|
|
|
-peaks of the most severe COVID-19 variants in Germany. While the peaks of the
|
|
|
+deviation at the boundaries. In the graphs, we mark the peaks of the most severe
|
|
|
+COVID-19 variants in Germany~\cite{COVIDChronik}. While the peaks of the
|
|
|
Alpha and Delta variants are clearly visible in the data, the model does not
|
|
|
learn these, and thus they are not reflected in the results. The peak of the
|
|
|
Omicron variant represents the culmination of the COVID-19 pandemic in Germany
|
|
@@ -424,46 +451,9 @@ value of the reproduction number across all states. This phenomenon can be expla
|
|
|
number of individuals infected by one infectious person reaching its peak. In
|
|
|
some states the peaks of other Omicron variants after the maximum peak are visible (see Thuringia).\\
|
|
|
|
|
|
-The experiments demonstrate, that our model encounteres difficulties in learning the data for the
|
|
|
+The experiments demonstrate, that our model encounters difficulties in learning the data for the
|
|
|
states and Germany and consequently in predicting the reproduction values for each dataset.
|
|
|
Nonetheless, the predictions illustrate the general trends of the most impactful
|
|
|
events of the COVID-19 pandemic.\\
|
|
|
|
|
|
-\begin{table}[t]
|
|
|
- \begin{center}
|
|
|
- \begin{tabular}{lcc}
|
|
|
- \toprule
|
|
|
- & \multicolumn{2}{c}{$e_I$} \\
|
|
|
- \cmidrule{2-3}
|
|
|
- state name & $\alpha=\nicefrac{1}{14}$ & $\alpha=\nicefrac{1}{5}$ \\
|
|
|
- \midrule
|
|
|
- Schleswig Holstein & 0.2005 & 0.2514 \\
|
|
|
- Hamburg & 0.3045 & 0.3357 \\
|
|
|
- Lower Saxony & 0.2140 & 0.3082 \\
|
|
|
- Bremen & 0.2370 & 0.3838 \\
|
|
|
- North Rhine-Westphalia & 0.1718 & 0.2460 \\
|
|
|
- Hesse & 0.2736 & 0.3172 \\
|
|
|
- Rhineland-Palatinate & 0.2442 & 0.2674 \\
|
|
|
- Baden-Württemberg & 0.1984 & 0.2958 \\\addlinespace
|
|
|
- Bavaria & 0.1928 & 0.2825 \\
|
|
|
- Saarland & 0.2554 & 0.4676 \\
|
|
|
- Berlin & 0.1885 & 0.2948 \\
|
|
|
- Brandenburg & 0.2023 & 0.2571 \\
|
|
|
- Mecklenburg-Vorpommern & 0.1518 & 0.3272 \\
|
|
|
- Saxony & 0.3382 & 0.2807 \\
|
|
|
- Saxony-Anhalt & 0.1959 & 0.2564 \\
|
|
|
- Thuringia & 0.1401 & 0.2221 \\\addlinespace
|
|
|
- Germany & 0.3371 & 0.2533 \\
|
|
|
- \bottomrule
|
|
|
- \end{tabular}
|
|
|
- \caption{This table displays all average values of the error $e_{\text{synth}}$
|
|
|
- for all German states and Germany. The average is formed across all
|
|
|
- 10 iteration.}
|
|
|
- \label{table:state_error}
|
|
|
- \end{center}
|
|
|
-\end{table}
|
|
|
-
|
|
|
-
|
|
|
-
|
|
|
-
|
|
|
% -------------------------------------------------------------------
|