chap04.tex 28 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426
  1. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  2. % Author: Phillip Rothenbeck
  3. % Title: Investigating the Evolution of the COVID-19 Pandemic in Germany Using Physics-Informed Neural Networks
  4. % File: chap04/chap04.tex
  5. % Part: Experiments
  6. % Description:
  7. % summary of the content in this chapter
  8. % Version: 01.01.2012
  9. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  10. \chapter{Experiments}
  11. \label{chap:evaluation}
  12. In ~\Cref{chap:methods}, we explain the methods based on the theoretical
  13. background, that we established in~\Cref{chap:background}. In this chapter, we
  14. present the setups and results from the experiments and simulations. First, we
  15. discuss the experiments dedicated to identify the epidemiological transition
  16. rates of $\alpha$ and $\beta$ in synthetic and real-world data. Second, we
  17. examine the reproduction number $\Rt$ in synthetic and real-world data of Germany.
  18. % -------------------------------------------------------------------
  19. \section{Identifying the Transmission and Recovery Rates}
  20. \label{sec:sir}
  21. In this section, we aim to identify the transmission rate $\beta$ and the
  22. recovery rate $\alpha$ from either synthetic or preprocessed real-world data.
  23. The methodology that we employ to identify these epidemiological parameters is described
  24. in~\Cref{sec:pinn:sir}. Meanwhile, the methods we utilize to preprocess the
  25. real-world data are detailed in~\Cref{sec:preprocessing:rq}. In the first part,
  26. we present the setup of our experiments, then we provide the results including a
  27. discussion.\\
  28. % -------------------------------------------------------------------
  29. \subsection{Setup}
  30. \label{sec:sir:setup}
  31. \paragraph{Synthetic Data:}In order to validate our method, we first generate a dataset of synthetic data.
  32. We achieve this by solving~\Cref{eq:modSIR} for a given set of parameters.
  33. The parameters are set to $\alpha = \nicefrac{1}{3}$ and $\beta = \nicefrac{1}{2}$.
  34. The size of the population is $N = \expnumber{7.6}{6}$ and the initial amount of
  35. infectious individuals is $I_0 = 10$. We conduct the simulation over 150
  36. days, resulting in a dataset of the form of~\Cref{fig:datasets_sir}.\\
  37. \paragraph{Real-World Data:}In order to process the real-world RKI data, it is
  38. necessary to preprocess the raw data for each state from the infection
  39. dataset~\cite{GHInf} and for Germany from the death case dataset~\cite{GHDead} separately.
  40. This is achieved by utilizing a recovery queue with a recovery period of 14
  41. days. With regard to population size of each state, we set it to the respective
  42. value counted at the end of
  43. 2019\footnote{{\url{https://datacommons.org/?hl=de} Last accessed: 2024-07-20}}.
  44. The initial number of infectious individuals is set to the number of infected
  45. people on 2020-03-09 from the dataset. The data we extract spans from
  46. 2020-03-09 to 2023-06-22, encompassing a period of 1200 days and
  47. representing the time span during which the COVID-19 disease was the most
  48. active and severe.
  49. \begin{figure}[t]
  50. \centering
  51. \includegraphics[width=\textwidth]{in_text_SIR.pdf}
  52. \caption{Synthetic and real-world training data. The synthetic data is
  53. generated with $\alpha=\nicefrac{1}{3}$ and $\beta=\nicefrac{1}{2}$
  54. and~\Cref{eq:modSIR}. The Germany data is taken from the death case
  55. data set. Exemplatory we show illustrations of the datasets of
  56. Schleswig-Holstein, Berlin, and Thuringia. Mind that this visualization
  57. does not have standardized y-axes. For all other states
  58. with standardized y-axes see~\Cref{sec:sir_datasets}.}
  59. \label{fig:datasets_sir}
  60. \end{figure}
  61. \paragraph{Training Parameters:}The PINN that we utilize comprises of seven
  62. hidden layers with twenty neurons each, and an activation function of ReLU~\cite{Fukushima1969}.
  63. We follow the hyperparameter setting in~\cite{Shaier2021} but change the base
  64. learning rate to $\expnumber{1}{-3}$. And employ a polynomial scheduler
  65. implementation from the PyTorch library~\cite{Paszke2019} instead. We train the
  66. model for 10000 iterations to extract the parameters. For each set of parameters, we
  67. conduct five runs to demonstrate stability of the values. For measuring the
  68. accuracy, we calculate the \emph{Relative L2 Error} $e$. Let $G$ be the set of
  69. compartment training data the SIR model with $\boldsymbol{g}\in G$ and $\hat{\boldsymbol{g}}$ be the
  70. corresponding model prediction, then,
  71. \begin{equation}\label{eq:error}
  72. e_{G} = \frac{1}{|G|}\sum_{g\in G}^{}\frac{\Big\|\hat{\boldsymbol{g}} - \boldsymbol{g}\Big\|_2}{\Big\|\boldsymbol{g}\Big\|_2},
  73. \end{equation}
  74. is the average error across all three compartments.
  75. % -------------------------------------------------------------------
  76. \subsection{Results and Discussion}
  77. \label{sec:sir:results}
  78. In this section, we start by examining the results for the synthetic dataset,
  79. focusing on the accuracy and reproducibility. We then proceed to present and
  80. discuss the results for the German states and Germany.\\
  81. The results of the experiment regarding the synthetic data can be seen
  82. in~\Cref{table:alpha_beta_synth}. The error and the standard variation for both
  83. parameters are negligible small. Taking the mean of the parameters across the
  84. five iterations yields more accurate results. The results demonstrate that the
  85. model is capable of approximating the correct parameters for the small,
  86. synthetic dataset in each of the five iterations. The mean of the predicted
  87. values results in values with a sufficiently small error. Thus, we argue that
  88. our selected method is well suited to analyze real-world pandemic data
  89. collected in Germany.\\
  90. \begin{table}[t]
  91. \begin{center}
  92. \caption{Simulation results for the synthetic data. The true values and
  93. the respective mean parameter and standard deviation is given. We
  94. calculate the error $e_{\text{SIR}}$ with~\Cref{eq:error}.}.
  95. \label{table:alpha_beta_synth}
  96. \begin{tabular}{ccccccccc}
  97. \toprule
  98. \multicolumn{2}{c}{$\alpha$} & \phantom{0} & \multicolumn{2}{c}{$\beta$} \\
  99. \cmidrule{1-2}\cmidrule{4-5}
  100. true & $\mu$ & \phantom{0} & true & $\mu$ & \phantom{0} & $e_{\text{SIR}}$ \\
  101. \midrule
  102. 0.333 & 0.333{\tiny$\pm 0.001$} & \phantom{0} & 0.500 & 0.500{\tiny$\pm 0.002$} & \phantom{0} & 0.004 \\
  103. \bottomrule
  104. \end{tabular}
  105. \end{center}
  106. \end{table}
  107. In~\Cref{table:state_mean_std} we present the results of the training for the
  108. real-world data. The results are presented from top to bottom, in the order of
  109. the community identification number, with the last entry being Germany. Both
  110. the mean $\mu$ and the standard deviation $\sigma$ are calculated across all
  111. five iterations of our experiment. We can observe that the error $e_{\text{SIR}}$ is
  112. the highest for \emph{Saxony} and the lowest for \emph{Lower Saxony}.
  113. Furthermore, we include the distance $\Delta\beta_{\text{Germany}} = \beta_{\text{state}} - \beta_{\text{Germany}}$
  114. and the percentage of people that have a basic immunity through vaccination
  115. $\nu$ for each state provided by the Robert Koch Institute~\cite{FMH}.\\
  116. \begin{table}[t]
  117. \begin{center}
  118. \caption{Mean and standard deviation, the error $e_{\text{SIR}}$ which we
  119. calculate with~\Cref{eq:error} and the distance
  120. $\Delta\beta_{\text{Germany}} = \beta_{\text{state}} - \beta_{\text{Germany}}$
  121. across the 5 iterations, that we conducted for each German state (MWP=Mecklenburg-Western Pomerania, NRW=North Rhine-Westphalia) and Germany
  122. as the whole country. Furthermore, we include the vaccination percentage
  123. $\nu$ provided from the German Federal Ministry for Health~\cite{FMH}.}
  124. \label{table:state_mean_std}
  125. \begin{tabular}{lccccc}
  126. \toprule
  127. state name & $\alpha$ & $\beta$ & $e_{\text{SIR}}$ & $\Delta\beta_{\text{Germany}}$ & $\nu$ [\%] \\
  128. \midrule
  129. Schleswig-Holstein & 0.076{\tiny$\pm0.001$} & 0.095{\tiny$\pm 0.001$} & 0.085 & -0.013 & 79.5 \\
  130. Hamburg & 0.082{\tiny$\pm0.001$} & 0.104{\tiny$\pm 0.001$} & 0.095 & -0.004 & 84.5 \\
  131. Lower Saxony & 0.075{\tiny$\pm0.002$} & 0.097{\tiny$\pm 0.002$} & 0.077 & -0.011 & 77.6 \\
  132. Bremen & 0.058{\tiny$\pm0.002$} & 0.078{\tiny$\pm 0.002$} & 0.093 & -0.030 & 88.3 \\
  133. NRW & 0.079{\tiny$\pm0.001$} & 0.101{\tiny$\pm 0.001$} & 0.078 & -0.007 & 79.5 \\
  134. Hesse & 0.065{\tiny$\pm0.001$} & 0.085{\tiny$\pm 0.001$} & 0.102 & -0.023 & 75.8 \\
  135. Rhineland-Palatinate & 0.085{\tiny$\pm0.004$} & 0.108{\tiny$\pm 0.004$} & 0.090 & 0.001 & 75.6 \\
  136. Baden-Württemberg & 0.091{\tiny$\pm0.002$} & 0.118{\tiny$\pm 0.003$} & 0.080 & 0.010 & 74.5 \\
  137. Bavaria & 0.085{\tiny$\pm0.004$} & 0.116{\tiny$\pm 0.005$} & 0.095 & 0.008 & 75.1 \\
  138. Saarland & 0.075{\tiny$\pm0.002$} & 0.099{\tiny$\pm 0.003$} & 0.108 & -0.009 & 82.4 \\
  139. Berlin & 0.087{\tiny$\pm0.001$} & 0.109{\tiny$\pm 0.001$} & 0.067 & 0.001 & 78.1 \\
  140. Brandenburg & 0.087{\tiny$\pm0.003$} & 0.110{\tiny$\pm 0.003$} & 0.072 & 0.002 & 68.1 \\
  141. MWP & 0.089{\tiny$\pm0.002$} & 0.114{\tiny$\pm 0.002$} & 0.054 & 0.006 & 74.7 \\
  142. Saxony & 0.075{\tiny$\pm0.002$} & 0.099{\tiny$\pm 0.002$} & 0.111 & -0.009 & 65.1 \\
  143. Saxony-Anhalt & 0.092{\tiny$\pm0.003$} & 0.119{\tiny$\pm 0.005$} & 0.079 & 0.011 & 74.1 \\
  144. Thuringia & 0.091{\tiny$\pm0.002$} & 0.119{\tiny$\pm 0.003$} & 0.084 & 0.011 & 70.3 \\
  145. \midrule
  146. Germany & 0.083{\tiny$\pm0.001$} & 0.108{\tiny$\pm 0.002$} & 0.080 & 0.000 & 76.4 \\
  147. \bottomrule
  148. \end{tabular}
  149. \end{center}
  150. \end{table}
  151. \begin{figure}[t]
  152. \centering
  153. \includegraphics[width=\textwidth]{mean_std_alpha_beta_res.pdf}
  154. \caption{Visualization of the mean and standard deviation of the transition
  155. rates $\alpha$ and $\beta$ for each state (MWP=Mecklenburg-Western Pomerania) compared to the mean values of
  156. $\alpha$ and $\beta$ for Germany.}
  157. \label{fig:alpha_beta_mean_std}
  158. \end{figure}
  159. In~\Cref{fig:alpha_beta_mean_std}, we present a visual representation of the
  160. means and standard deviations in comparison to the national values. It is
  161. noteworthy that the states of Saxony-Anhalt and Thuringia have the highest
  162. transmission rates of all states, while Bremen and Hesse have the lowest
  163. values for $\beta$. The transmission rates of Hamburg, Baden-Württemberg,
  164. Bavaria, and all eastern states lay above the national rate of transmission.
  165. Similarly, the recovery rate yields comparable outcomes. For the recovery rate,
  166. the same states that exhibit a transmission rate exceeding the national value,
  167. have a higher recovery rate than the national standard, with the exception of
  168. Saxony. It is noteworthy that the recovery rates of all states exhibit a
  169. tendency to align with the recovery rate of $\alpha=\nicefrac{1}{14}$, which is
  170. equivalent to a recovery period of 14 days. When
  171. calculating the correlation coefficient between the predicted transmission rate
  172. and the vaccination ratio, we get a value of $-0.5134$. The strong negative
  173. correlation indicates that the transmission rate is high when the vaccination
  174. ratio is low, and vice versa. This shows that the impact of the vaccines can be
  175. witnessed in our results. \\
  176. It is evident that there is a correlation between the values of $\alpha$ and
  177. $\beta$ for each state. States with a high transmission rate tend to have a
  178. high recovery rate, and vice versa. The correlation between $\alpha$ and
  179. $\beta$ can be explained by the implicate definition of $\alpha$ using a
  180. recovery queue with a constant recovery period of 14 days. This might result in
  181. the PINN not learning $\alpha$ as a standalone parameter but rather as a
  182. function of the transmission rate $\beta$. This phenomenon occurs because the
  183. transmission rate determines the number of individuals that get infected per
  184. day, and the recovery queue moves a proportional number of people to the
  185. removed compartment. Consequently, a number of people defined by $\beta$ move
  186. to the $R$ compartment 14 days after they were infected. Furthermore,
  187. in~\Cref{sec:pandemicModel:rsir}, we discussed the reproduction number $\Rt$,
  188. which describes the number of individuals infected by one infectious individual.
  189. This can be another reason for the observed correlation, as $\Rt$ depends on
  190. both $\alpha$ and $\beta$ (see~\Cref{eq:repr_num}), which illustrates that both
  191. parameters are influenced by changes to the reproductivity of the disease.\\
  192. This issue can be addressed by reducing the SIR model, thereby eliminating the
  193. significance of the $R$ compartment size. In the following section, we present
  194. our experiments for the reduced SIR model with time-dependent parameters.
  195. % -------------------------------------------------------------------
  196. \section{Identifying the Reproduction Number}
  197. \label{sec:rsir}
  198. In this section we describe the experiments we conduct to identify the
  199. time-dependent reproduction number for both synthetic and real-world data.
  200. Similar to the previous section, we first describe the setup of our experiments
  201. and afterwards present the results and a discussion. The methods we employ for
  202. the preprocessing are described in~\Cref{sec:preprocessing:rq} and for the PINN,
  203. that we use, are described in~\Cref{sec:pinn:rsir}.
  204. % -------------------------------------------------------------------
  205. \subsection{Setup}
  206. \label{sec:rsir:setup}
  207. \paragraph{Synthetic Data:}For the purposes of validation, we create a synthetic dataset, by setting the parameter
  208. of $\alpha$ and the reproduction value each to a specific values, and solving~\Cref{eq:reduced_sir_ODE}
  209. for a given time interval. As in the synthetic data for the aforementioned
  210. experiments, we set $\alpha=\nicefrac{1}{3}$ and $\Rt$ to the values as can be
  211. seen in~\Cref{fig:Rt_dataset} as well as the population size
  212. $N=\expnumber{7.6}{6}$ and the initial amount of infected people to $I_0=10$.
  213. Furthermore, we set our simulated time span to 150 days. We use this dataset to
  214. demonstrate, that our method is working on a simple and minimal dataset.\\
  215. \paragraph{Real-World Data:}To obtain a dataset of the infectious group, consisting of the
  216. real-world data, we processed the data of the dataset
  217. \emph{COVID-19-Todesfälle in Deutschland}~\cite{GHDead} to extract the number of infections
  218. in Germany as a whole. For the German states, we use the data of \emph{SARS-CoV-2
  219. Infektionen in Deutschland}~\cite{GHInf}. In the preprocessing stage, we employ a constant
  220. rate for $\alpha$ to move individuals into the removed compartment. For each
  221. state we generate two datasets with a different recovery rate. First, we choose
  222. $\alpha = \nicefrac{1}{14}$, which aligns with the time of recovery~\cite{GHInf}.
  223. Second, we use $\alpha=\nicefrac{1}{5}$, as 5 days into the infection is the
  224. point at which the infectiousness is at its peak~\cite{COVInfo}.
  225. As in~\Cref{sec:sir}, we set the population size $N$ of each state and Germany
  226. to the corresponding size at the end of 2019. Furthermore, for the same reason
  227. we restrict the data points to an interval of 1200 days, beginning on 2020-03-09.
  228. 2020.\\
  229. \begin{figure}[t]
  230. \centering
  231. \begin{subfigure}{0.3\textwidth}
  232. \centering
  233. \includegraphics[width=\textwidth]{I_synth.pdf}
  234. \end{subfigure}
  235. \quad
  236. \begin{subfigure}{0.3\textwidth}
  237. \centering
  238. \includegraphics[width=\textwidth]{I_synth_r_t.pdf}
  239. \end{subfigure}
  240. \vskip\baselineskip
  241. \begin{subfigure}{0.67\textwidth}
  242. \centering
  243. \includegraphics[width=\textwidth]{datasets_states/Germany_datasets.pdf}
  244. \end{subfigure}
  245. \caption{The upper two graphics show the curve of the size of the
  246. infectious group (left) and the corresponding true reproduction value
  247. $\Rt$ (right) for the synthetic data. The lower graphic exemplary
  248. illustrates the different curves for Germany.}
  249. \label{fig:Rt_dataset}
  250. \end{figure}
  251. \paragraph{Training Parameters:}In order to achieve the desired output, the
  252. selected neural network architecture comprises of four hidden layers, each
  253. containing 100 neurons. The activation function is the tangens hyperbolicus
  254. function. For both the federal state and Germany, the physics loss is weighted
  255. by a factor of $\expnumber{1}{-6}$, whereas the data loss belonging to Germany
  256. is also weighted with a high factor of $\expnumber{1}{4}$, relative to the total
  257. loss. We found this approach to yield the best results. The model is trained
  258. using a base learning rate of $\expnumber{1}{-3}$, with the same scheduler and
  259. optimizer as we describe in~\Cref{sec:sir:setup}. We train the model for the federal
  260. states 20000 epochs and start the physics training after 10000 epochs, while we
  261. train for Germany for 25000 and start the physics training after 15000 epochs.
  262. To ensure the reliability of the results, we conduct ten trials of each experiment. For
  263. evaluation, we use the error $e_G$ as we do in the subsequent section.\\
  264. % -------------------------------------------------------------------
  265. \subsection{Results and Discussion}
  266. \label{sec:rsir:results}
  267. \begin{figure}[t]
  268. \centering
  269. \begin{subfigure}{0.45\textwidth}
  270. \includegraphics[width=\textwidth]{synthetic_I_prediction.pdf}
  271. \end{subfigure}
  272. \quad
  273. \begin{subfigure}{0.45\textwidth}
  274. \includegraphics[width=\textwidth]{synthetic_R_t_statistics.pdf}
  275. \end{subfigure}
  276. \caption{Results for the reproduction rate $\Rt$ on synthetic data. The
  277. left graphic show the prediction of the model regarding the $I$ group. The
  278. right graphic presents the predicted $\Rt$ against the true value, with the
  279. standard deviation.}
  280. \label{fig:r_t_synth_res}
  281. \end{figure}
  282. \Cref{fig:r_t_synth_res} illustrates the results of our experiments conducted on
  283. the synthetic dataset, which can be seen in~\Cref{fig:Rt_dataset}. It is evident
  284. that the model is capable of learning the infection data across all data points.
  285. The error for this is, $e_I = 0.0016$, which is of a negligible
  286. magnitude. An examination of the predictions for the reproduction number $\Rt$
  287. reveals that here as well, the model is capable of accurately delineating the
  288. value at each time point. However, during the first 30 days, the standard
  289. deviation exhibits an upward trend, while during the final 120 days, the
  290. predictions demonstrate remarkable precision.\\
  291. In~\Cref{fig:state_results}, we present the graphs of $\Rt$ for the state with
  292. the highest value of $\beta$, namely Thuringia, and for the state with the lowest
  293. $\beta$, namely Bremen. Further visualizations of the results
  294. can be found in~\Cref{chap:appendix}. In all datasets, the graphs with $\alpha =
  295. \nicefrac{1}{5}$ are of a smaller size than those with
  296. $\alpha = \nicefrac{1}{14}$. This is due to the fact that the individuals are
  297. being moved to the removed compartment at a faster rate. Resulting, it can be
  298. observed that the value of $\Rt$ is constantly remaining closer to the threshold
  299. of $\Rt=1$, while the reproduction number for datasets with $\alpha = \nicefrac{1}{14}$
  300. reaches values of up to 1.6. In states with higher values of $\beta$, the period
  301. during which the value of $\Rt$ is above the threshold of one 1 is longer, but
  302. the peak is lower. In states with a lower transmission rate, the period above 1
  303. is shorter, but the peak value is higher.\\
  304. \begin{figure}[t]
  305. \centering
  306. \begin{subfigure}{0.45\textwidth}
  307. \includegraphics[width=\textwidth]{r_t_cluster_intext.pdf}
  308. \end{subfigure}
  309. \begin{subfigure}{0.45\textwidth}
  310. \includegraphics[width=\textwidth]{I_cluster_intext.pdf}
  311. \end{subfigure}
  312. \label{fig:state_results}
  313. \caption{Visualization of the prediction of the training and the graphs of
  314. $\Rt$ for Thuringia (upper) and Bremen (lower) with both
  315. $\alpha = \nicefrac{1}{14}$ and $\alpha = \nicefrac{1}{5}$. Events~\cite{COVIDChronik} like
  316. the peak of an influential variant or the start of the vaccination of the public are marked horizontally. Further
  317. visualizations can be found in~\Cref{sec:r_t_results}.}
  318. \end{figure}
  319. \Cref{table:state_error} presents data regarding the discrepancy between the
  320. predicted and actual values from the dataset for compartment $I$. It is evident,
  321. that the error for all experiments falls within a range of values that is not
  322. negligible and will have an influence on the resulting reproduction values that
  323. are learned while fitting the data. A comparison of the results for the various
  324. values of $\alpha$ reveals that the errors associated with $\alpha = \nicefrac{1}{14}$
  325. are consistently smaller than for $\alpha = \nicefrac{1}{5}$, with the exception of Saxony and Germany. This can be
  326. attributed to the differing sizes of infection counts, particularly in relation
  327. to the normalization factor $C$. The model is unable to learn effectively if the
  328. values of the data loss $\mathcal{L}_{\text{data}}$ are too large or too small
  329. at the beginning.\\
  330. \begin{table}[t]
  331. \begin{center}
  332. \caption{For both $\alpha=\nicefrac{1}{14}$ and $\alpha=\nicefrac{1}{5}$
  333. this table presents the error $e_{\text{I}}$, calculated with~\Cref{eq:error},
  334. the average number of days with $\Rt > 1$, and
  335. the average peak values of $\Rt$ for all German states
  336. (MWP=Mecklenburg-Western Pomerania, NRW=North Rhine-Westphalia) and
  337. Germany. The average is formed across all
  338. 10 iteration.}
  339. \label{table:state_error}
  340. \begin{tabular}{lccccccc}
  341. \toprule
  342. & \multicolumn{2}{c}{$e_I$} & \phantom{0} & \multicolumn{2}{c}{days with $\Rt>1$} & \multicolumn{2}{c}{peak $\Rt$} \\
  343. \cmidrule{2-3}\cmidrule{5-6}\cmidrule{7-8}
  344. state name & $\alpha=\frac{1}{14}$ & $\alpha=\frac{1}{5}$ & \phantom{0} & $\alpha=\frac{1}{14}$ & $\alpha=\frac{1}{5}$ & $\alpha=\frac{1}{14}$ & $\alpha=\frac{1}{5}$ \\
  345. \midrule
  346. Schleswig-Holstein & 0.228 & 0.258 & \phantom{0} & 467.5 & 458.5 & 1.475 & 1.166 \\
  347. Hamburg & 0.265 & 0.330 & \phantom{0} & 424.3 & 409.8 & 1.500 & 1.297 \\
  348. Lower Saxony & 0.224 & 0.340 & \phantom{0} & 413.1 & 430.3 & 1.662 & 1.223 \\
  349. Bremen & 0.246 & 0.380 & \phantom{0} & 468.6 & 539.1 & 1.582 & 1.179 \\
  350. NRW & 0.185 & 0.252 & \phantom{0} & 486.3 & 602.0 & 1.573 & 1.205 \\
  351. Hesse & 0.302 & 0.346 & \phantom{0} & 553.0 & 511.2 & 1.409 & 1.157 \\
  352. Rhineland-Palatinate & 0.256 & 0.277 & \phantom{0} & 484.7 & 404.7 & 1.534 & 1.175 \\
  353. Baden-Württemberg & 0.198 & 0.284 & \phantom{0} & 469.2 & 590.0 & 1.457 & 1.180 \\
  354. Bavaria & 0.225 & 0.318 & \phantom{0} & 490.5 & 486.1 & 1.428 & 1.199 \\
  355. Saarland & 0.284 & 0.408 & \phantom{0} & 500.2 & 564.7 & 1.515 & 1.180 \\
  356. Berlin & 0.201 & 0.240 & \phantom{0} & 591.9 & 514.4 & 1.721 & 1.262 \\
  357. Brandenburg & 0.237 & 0.242 & \phantom{0} & 555.9 & 596.3 & 1.447 & 1.159 \\
  358. MWP & 0.170 & 0.257 & \phantom{0} & 537.5 & 544.3 & 1.563 & 1.135 \\
  359. Saxony & 0.292 & 0.256 & \phantom{0} & 722.3 & 695.4 & 1.790 & 1.407 \\
  360. Saxony-Anhalt & 0.213 & 0.268 & \phantom{0} & 572.0 & 631.9 & 1.387 & 1.165 \\
  361. Thuringia & 0.180 & 0.222 & \phantom{0} & 732.1 & 730.6 & 1.586 & 1.249 \\
  362. \midrule
  363. Germany & 0.284 & 0.239 & \phantom{0} & 587.7 & 430.7 & 1.561 & 1.219 \\
  364. \bottomrule
  365. \end{tabular}
  366. \end{center}
  367. \end{table}
  368. As illustrated in~\Cref{fig:state_results}, the training data is overlaid with the
  369. corresponding prediction of the model. We can observe that the prediction, though
  370. an exact reconstruction, accurately captures the general trajectory of the
  371. pandemic. The model's prediction demonstrates an ability to capture larger
  372. peaks, exhibiting a tendency to ignore smaller changes. This suggests that the
  373. prediction of the model is capable show the rough outline of the progression of COVID-19. In the
  374. beginning, the majority of predictions below $\Rt=1$, indicating an outbreak.
  375. As we observed in the synthetic data, the model exhibits a higher standard
  376. deviation at the boundaries. In the graphs, we mark the peaks of the most severe
  377. COVID-19 variants in Germany~\cite{COVIDChronik}. While the peaks of the
  378. Alpha and Delta variants are clearly visible in the data, the model does not
  379. learn these, and thus they are not reflected in the results. The peak of the
  380. Omicron variant represents the culmination of the COVID-19 pandemic in Germany
  381. and can be identified as the most prominent peak in the dataset. Immediately preceding this peak, we observe the highest
  382. value of the reproduction number across all states. This phenomenon can be explained, by
  383. number of individuals infected by one infectious person reaching its peak. In
  384. some states the peaks of other Omicron variants after the maximum peak are visible (see Thuringia).\\
  385. The experiments demonstrate, that our model encounters difficulties in learning the data for the
  386. states and Germany and consequently in predicting the reproduction values for each dataset.
  387. Nonetheless, the predictions illustrate the general trends of the most impactful
  388. events of the COVID-19 pandemic.\\
  389. % -------------------------------------------------------------------