chap04.tex 28 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418
  1. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  2. % Author: Phillip Rothenbeck
  3. % Title: Investigating the Evolution of the COVID-19 Pandemic in Germany Using Physics-Informed Neural Networks
  4. % File: chap04/chap04.tex
  5. % Part: Experiments
  6. % Description:
  7. % summary of the content in this chapter
  8. % Version: 01.01.2012
  9. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  10. \chapter{Experiments}
  11. \label{chap:evaluation}
  12. In ~\Cref{chap:methods}, we explain the methods based the theoretical
  13. background, that we established in~\Cref{chap:background}. In this chapter, we
  14. present the setups and results from the experiments and simulations. First, we
  15. discuss the experiments dedicated to identify the epidemiological transition
  16. rates of $\beta$ and $\alpha$ in synthetic and real-world data. Second, we
  17. examine the reproduction number in synthetic and real-world data of Germany.
  18. % -------------------------------------------------------------------
  19. \section{Identifying the Transmission and Recovery Rates}
  20. \label{sec:sir}
  21. In this section, we aim to identify the transmission rate $\beta$ and the
  22. recovery rate $\alpha$ from either synthetic or preprocessed real-world data.
  23. The methodology that we employ to identify the epidemiological parameters is described
  24. in~\Cref{sec:pinn:sir}. Meanwhile, the methods we utilize to preprocess the
  25. real-world data are detailed in~\Cref{sec:preprocessing:rq}. In the first part
  26. we present the setup of our experiments, then we provide the results including a
  27. discussion.\\
  28. % -------------------------------------------------------------------
  29. \subsection{Setup}
  30. \label{sec:sir:setup}
  31. \paragraph{Synthetic Data:}In order to validate our method, we first generate a dataset of synthetic data.
  32. We achieve this by solving~\Cref{eq:modSIR} for a given set of parameters.
  33. The parameters are set to $\alpha = \nicefrac{1}{3}$ and $\beta = \nicefrac{1}{2}$.
  34. The size of the population is $N = \expnumber{7.6}{6}$ and the initial amount of
  35. infectious individuals is $I_0 = 10$. We conduct the simulation over 150
  36. days, resulting in a dataset of the form of~\Cref{fig:synthetic_SIR}.\\
  37. \paragraph{Real-World Data:}In order to process the real-world RKI data, it is
  38. necessary to preprocess the raw data for each state and Germany separately.
  39. This is achieved by utilizing a recovery queue with a recovery period of 14
  40. days. With regard to population size of each state, we set it to the respective
  41. value counted at the end of
  42. 2019\footnote{{\tiny \url{https://de.statista.com/statistik/kategorien/kategorie/8/themen/63/branche/demographie/\#overview}}}.
  43. The initial number of infectious individuals is set to the number of infected
  44. people on March 09. 2020 from the dataset. The data we extract spans from
  45. March 09. 2020 to June 22. 2023, encompassing a period of 1200 days and
  46. representing the time span during which the COVID-19 disease was the most
  47. active and severe.
  48. \begin{figure}[h]
  49. \centering
  50. \includegraphics[width=\textwidth]{in_text_SIR.pdf}
  51. \caption{Synthetic and real-world training data. The synthetic data is
  52. generated with $\alpha=\nicefrac{1}{3}$ and $\beta=\nicefrac{1}{2}$
  53. and~\Cref{eq:modSIR}. The Germany data is taken from the death case
  54. data set. Exemplatory we show illustrations of the datasets of Schleswig
  55. Holstein, Berlin, and Thuringia. For the other states see~\Cref{chap:appendix} }
  56. \label{fig:datasets_sir}
  57. \end{figure}
  58. \paragraph{Training Parameters:}The PINN that we utilize comprises of seven
  59. hidden layers with twenty neurons each, and an activation function of ReLU. We
  60. follow the hyperparameter setting in~\cite{Shaier2021} but change the base
  61. learning rate to $\expnumber{1}{-3}$. And employ a polynomial scheduler
  62. implementation from the PyTorch library~\cite{Paszke2019} instead. We train the
  63. model for 10000 iterations to extract the parameters. For each set of parameters, we
  64. conduct five runs to demonstrate stability of the values. For measuring the
  65. accuracy, we calculate the \emph{Relative L2 Error} $e$. Let $G$ be the set of
  66. compartment training data the SIR model with $\boldsymbol{g}\in G$ and $\hat{\boldsymbol{g}}$ be the
  67. corresponding model prediction, then,
  68. \begin{equation}
  69. e_{G} = \frac{1}{|G|}\sum_{g\in G}^{}\frac{\Big\|\hat{\boldsymbol{g}} - \boldsymbol{g}\Big\|_2}{\Big\|\boldsymbol{g}\Big\|_2},
  70. \end{equation}
  71. is the average error across all three compartments.
  72. % -------------------------------------------------------------------
  73. \subsection{Results}
  74. \label{sec:sir:results}
  75. In this section, we start by examining the results for the synthetic dataset,
  76. focusing the accuracy and reproducibility. We then proceed to present and
  77. discuss the results for the German states and Germany.\\
  78. The results of the experiment regarding the synthetic data can be seen
  79. in~\Cref{table:alpha_beta_synth}. The error and the standard variation for both
  80. parameters are negligible small. Taking the mean of the parameters across the
  81. five iterations yields more accurate results.\\
  82. \begin{table}[h]
  83. \begin{center}
  84. \caption{Simulation results for the synthetic data. The true values and
  85. the respective mean parameter is given.}
  86. \label{table:alpha_beta_synth}
  87. \begin{tabular}{ccccccccc}
  88. \toprule
  89. \multicolumn{2}{c}{$\alpha$} & \phantom{0} & \multicolumn{2}{c}{$\beta$} \\
  90. \cmidrule{1-2}\cmidrule{4-5}
  91. true & $\mu$ & \phantom{0} & true & $\mu$ & \phantom{0} & $e_{SIR}$ \\
  92. \midrule
  93. 0.333 & 0.333{\tiny$\pm 0.001$} & \phantom{0} & 0.500 & 0.500{\tiny$\pm 0.002$} & \phantom{0} & 0.004 \\
  94. \bottomrule
  95. \end{tabular}
  96. \end{center}
  97. \end{table}
  98. The results demonstrate that the model is capable of approximating the correct
  99. parameters for the small, synthetic dataset in each of the five iterations.
  100. The mean of the predicted values results in values with a sufficiently small
  101. error. Thus, we argue that our selected method is well suited to analyze real
  102. world pandemic data collected in Germany.\\
  103. In~\Cref{table:state_mean_std} we present the results of the training for the
  104. real-world data. The results are presented from top to bottom, in the order of
  105. the community identification number, with the last entry being Germany. Both
  106. the mean $\mu$ and the standard deviation $\sigma$ are calculated across all
  107. five iterations of our experiment. We can observe that the error $e_{SIR}$ is
  108. the highest for \emph{Saxony} and the lowest for \emph{Lower Saxony}.
  109. Furthermore, we include the distance $\Delta\beta_{\text{Germany}} = \beta_{\text{state}} - \beta_{\text{Germany}}$
  110. and the percentage of people that have a basic immunity through vaccination
  111. $\nu$ for each state provided by the Robert Koch Institute~\cite{FMH}.\\
  112. \begin{table}[h]
  113. \begin{center}
  114. \caption{Mean and standard deviation, error $e_{SIR}$ and the distance
  115. $\Delta\beta_{\text{Germany}} = \beta_{\text{state}} - \beta_{\text{Germany}}$
  116. across the 5 iterations, that we conducted for each German state and Germany
  117. as the whole country. Furthermore we include the vaccination percentage
  118. $\nu$ provided from the RKIe~\cite{FMH}.}
  119. \label{table:state_mean_std}
  120. \begin{tabular}{lccccc}
  121. \toprule
  122. state name & $\alpha$ & $\beta$ & $e_{SIR}$ & $\Delta\beta_{\text{Germany}}$ & $\nu$ [\%] \\
  123. \midrule
  124. Schleswig Holstein & 0.076{\tiny$\pm0.001$} & 0.095{\tiny$\pm 0.001$} & 0.085 & -0.013 & 79.5 \\
  125. Hamburg & 0.082{\tiny$\pm0.001$} & 0.104{\tiny$\pm 0.001$} & 0.095 & -0.004 & 84.5 \\
  126. Lower Saxony & 0.075{\tiny$\pm0.002$} & 0.097{\tiny$\pm 0.002$} & 0.077 & -0.011 & 77.6 \\
  127. Bremen & 0.058{\tiny$\pm0.002$} & 0.078{\tiny$\pm 0.002$} & 0.093 & -0.030 & 88.3 \\
  128. NRW & 0.079{\tiny$\pm0.001$} & 0.101{\tiny$\pm 0.001$} & 0.078 & -0.007 & 79.5 \\
  129. Hesse & 0.065{\tiny$\pm0.001$} & 0.085{\tiny$\pm 0.001$} & 0.102 & -0.023 & 75.8 \\
  130. Rhineland-Palatinate & 0.085{\tiny$\pm0.004$} & 0.108{\tiny$\pm 0.004$} & 0.090 & 0.001 & 75.6 \\
  131. Baden-Württemberg & 0.091{\tiny$\pm0.002$} & 0.118{\tiny$\pm 0.003$} & 0.080 & 0.010 & 74.5 \\
  132. Bavaria & 0.085{\tiny$\pm0.004$} & 0.116{\tiny$\pm 0.005$} & 0.095 & 0.008 & 75.1 \\
  133. Saarland & 0.075{\tiny$\pm0.002$} & 0.099{\tiny$\pm 0.003$} & 0.108 & -0.009 & 82.4 \\
  134. Berlin & 0.087{\tiny$\pm0.001$} & 0.109{\tiny$\pm 0.001$} & 0.067 & 0.001 & 78.1 \\
  135. Brandenburg & 0.087{\tiny$\pm0.003$} & 0.110{\tiny$\pm 0.003$} & 0.072 & 0.002 & 68.1 \\
  136. MV & 0.089{\tiny$\pm0.002$} & 0.114{\tiny$\pm 0.002$} & 0.054 & 0.006 & 74.7 \\
  137. Saxony & 0.075{\tiny$\pm0.002$} & 0.099{\tiny$\pm 0.002$} & 0.111 & -0.009 & 65.1 \\
  138. Saxony-Anhalt & 0.092{\tiny$\pm0.003$} & 0.119{\tiny$\pm 0.005$} & 0.079 & 0.011 & 74.1 \\
  139. Thuringia & 0.091{\tiny$\pm0.002$} & 0.119{\tiny$\pm 0.003$} & 0.084 & 0.011 & 70.3 \\
  140. \midrule
  141. Germany & 0.083{\tiny$\pm0.001$} & 0.108{\tiny$\pm 0.002$} & 0.080 & 0.000 & 76.4 \\
  142. \bottomrule
  143. \end{tabular}
  144. \end{center}
  145. \end{table}
  146. \begin{figure}[t]
  147. \centering
  148. \includegraphics[width=\textwidth]{mean_std_alpha_beta_res.pdf}
  149. \caption{Visualization of the mean and standard deviation of the transition
  150. rates $\alpha$ and $\beta$ for each state compared to the mean values of
  151. $\alpha$ and $\beta$ for Germany.}
  152. \label{fig:alpha_beta_mean_std}
  153. \end{figure}
  154. In~\Cref{fig:alpha_beta_mean_std}, we present a visual representation of the
  155. means and standard deviations in comparison to the national values. It is
  156. noteworthy that the states of Saxony-Anhalt and Thuringia have the highest
  157. transmission rates of all states, while Bremen and Hesse have the lowest
  158. values for $\beta$. The transmission rates of Hamburg, Baden Württemberg,
  159. Bavaria, and all eastern states lay above the national rate of transmission.
  160. Similarly, the recovery rate yields comparable outcomes. For the recovery rate,
  161. the same states that exhibit a transmission rate exceeding the national value,
  162. have a higher recovery rate than the national standard, with the exception of
  163. Saxony. It is noteworthy that the recovery rates of all states exhibit a
  164. tendency to align with the recovery rate of $\alpha=\nicefrac{1}{14}$, which is
  165. equivalent to a recovery period of 14 days. When
  166. calculating the correlation coefficient between the predicted transmission rate
  167. and the vaccination ratio, we get a value of $-0.5134$. The strong negative
  168. correlation indicates that the transmission rate is high when the vaccination
  169. ratio is low, and vice versa. This shows that the impact of the vaccines can be
  170. witnessed in our results. \\
  171. It is evident that there is a correlation between the values of $\alpha$ and
  172. $\beta$ for each state. States with a high transmission rate tend to have a
  173. high recovery rate, and vice versa. The correlation between $\alpha$ and
  174. $\beta$ can be explained by the implicate definition of $\alpha$ using a
  175. recovery queue with a constant recovery period of 14 days. This might result to
  176. the PINN not learning $\alpha$ as a standalone parameter but rather as a
  177. function of the transmission rate $\beta$. This phenomenon occurs because the
  178. transmission rate determines the number of individuals that get infected per
  179. day, and the recovery queue moves a proportional number of people to the
  180. removed compartment. Consequently, a number of people defined by $\beta$ move
  181. to the $R$ compartment 14 days after they were infected. Furthermore,
  182. in~\Cref{sec:pandemicModel:rsir}, we discussed the reproduction number $\Rt$,
  183. which describes the number of individuals infected by one infectious individual.
  184. This can be another reason for the observed correlation, as $\Rt$ depends on
  185. both $\alpha$ and $\beta$ (see~\Cref{eq:repr_num}), which illustrates that both
  186. parameters are influenced by changes to the reproductivity of the disease.\\
  187. This issue can be addressed by reducing the SIR model, thereby eliminating the
  188. significance of the $R$ compartment size. In the following section, we present
  189. our experiments for the reduced SIR model with time-dependent parameters.
  190. % -------------------------------------------------------------------
  191. \section{Identifying the Reproduction Number}
  192. \label{sec:rsir}
  193. In this section we describe the experiments we conduct to identify the
  194. time-dependent reproduction number for both synthetic and real-world data.
  195. Similar to the previous section, we first describe the setup of our experiments
  196. and afterwards present the results and a discussion. The methods we employ for
  197. the preprocessing are described in~\Cref{sec:preprocessing:rq} and for the PINN,
  198. that we use, are described in~\Cref{sec:pinn:rsir}.
  199. % -------------------------------------------------------------------
  200. \subsection{Setup}
  201. \label{sec:rsir:setup}
  202. \paragraph{Synthetic Data:}For the purposes of validation, we create a synthetic dataset, by setting the parameter
  203. of $\alpha$ and the reproduction value each to a specific values, and solving~\Cref{eq:reduced_sir_ODE}
  204. for a given time interval. As in the synthetic data for the aforementioned
  205. experiments, we set $\alpha=\nicefrac{1}{3}$ and $\Rt$ to the values as can be
  206. seen in~\Cref{fig:Rt_dataset} as well as the population size
  207. $N=\expnumber{7.6}{6}$ and the initial amount of infected people to $I_0=10$.
  208. Furthermore, we set our simulated time span to 150 days. We use this dataset to
  209. demonstrate, that our method is working on a simple and minimal dataset.\\
  210. \paragraph{Real-World Data:}To obtain a dataset of the infectious group, consisting of the
  211. real-world data, we processed the data of the dataset
  212. \emph{COVID-19-Todesfälle in Deutschland}~\cite{GHDead} to extract the number of infections
  213. in Germany as a whole. For the German states, we use the data of \emph{SARS-CoV-2
  214. Infektionen in Deutschland}~\cite{GHInf}. In the preprocessing stage, we employ a constant
  215. rate for $\alpha$ to move individuals into the removed compartment. For each
  216. state we generate two datasets with a different recovery rate. First, we choose
  217. $\alpha = \nicefrac{1}{14}$, which aligns with the time of recovery~\cite{GHInf}.
  218. Second, we use $\alpha=\nicefrac{1}{5}$, as 5 days into the infection is the
  219. point at which the infectiousness is at its peak~\cite{COVInfo}.
  220. As in~\Cref{sec:sir}, we set the population size $N$ of each state and Germany
  221. to the corresponding size at the end of 2019. Furthermore, for the same reason
  222. we restrict the data points to an interval of 1200 days, beginning on March 09.
  223. 2020.\\
  224. \begin{figure}[t]
  225. \centering
  226. \begin{subfigure}{0.3\textwidth}
  227. \centering
  228. \includegraphics[width=\textwidth]{I_synth.pdf}
  229. \end{subfigure}
  230. \quad
  231. \begin{subfigure}{0.3\textwidth}
  232. \centering
  233. \includegraphics[width=\textwidth]{I_synth_r_t.pdf}
  234. \end{subfigure}
  235. \vskip\baselineskip
  236. \begin{subfigure}{0.67\textwidth}
  237. \centering
  238. \includegraphics[width=\textwidth]{datasets_states/Germany_datasets.pdf}
  239. \end{subfigure}
  240. \caption{The upper two graphics show the curve of the size of the
  241. infectious group (left) and the corresponding true reproduction value
  242. $\Rt$ (right) for the synthetic data. The lower graphic exemplary
  243. illustrates the different curves for Germany.}
  244. \label{fig:Rt_dataset}
  245. \end{figure}
  246. \paragraph{Training Parameters:}In order to achieve the desired output, the
  247. selected neural network architecture comprises of four hidden layers, each
  248. containing 100 neurons. The activation function is the tangens hyperbolicus
  249. function. For both the federal state and Germany, the physics loss is weighted
  250. by a factor of $\expnumber{1}{-6}$, whereas the data loss belonging to Germany
  251. is also weighted with a high factor of $\expnumber{1}{4}$, relative to the total
  252. loss. We found this approach to yield the best results. The model is trained
  253. using a base learning rate of $\expnumber{1}{-3}$, with the same scheduler and
  254. optimizer as we describe in~\Cref{sec:sir:setup}. We train the model for the
  255. states 20000 epochs and start the physics training after 10000 epochs, while we
  256. train for Germany for 25000 and start the physics training after 15000 epochs.
  257. To ensure the reliability of the results, we conduct ten trials of each experiment. For
  258. evaluation, we use the error $e_G$ as we do in the subsequent section.\\
  259. % -------------------------------------------------------------------
  260. \subsection{Results}
  261. \label{sec:rsir:results}
  262. \Cref{fig:synth_results} illustrates the results of our experiments conducted on
  263. the synthetic dataset, which can be seen in~\Cref{fig:Rt_dataset}. It is evident
  264. that the model is capable of learning the infection data across all data points.
  265. The error for this is, $e_I = 0.0016$, which is of a negligible
  266. magnitude.\\
  267. \begin{figure}[h]
  268. \centering
  269. \begin{subfigure}{0.45\textwidth}
  270. \includegraphics[width=\textwidth]{synthetic_I_prediction.pdf}
  271. \end{subfigure}
  272. \quad
  273. \begin{subfigure}{0.45\textwidth}
  274. \includegraphics[width=\textwidth]{synthetic_R_t_statistics.pdf}
  275. \end{subfigure}
  276. \label{fig:synth_results}
  277. \caption{Results for the reproduction rate $\Rt$ on synthetic data. The
  278. left graphic show the prediction of the model regarding the $I$ group. The
  279. right graphic presents the predicted $\Rt$ against the true value, with the
  280. standard deviation.}
  281. \end{figure}
  282. An examination of the predictions for the representation value $\Rt$ reveals
  283. that here as well, the model is capable of accurately delineating the value at
  284. each time point. However, during the first 30 days, the standard deviation is
  285. exhibits an upward trend, while during the final 120 days, the predictions
  286. demonstrate remarkable precision.\\
  287. In~\Cref{fig:state_results}, we present the graphs of $\Rt$ for the state with
  288. the highest value of $\beta$, namely Thuringia, and for the state with the
  289. $\beta$, namely Bremen. Further visualizations of the results
  290. can be found in~\Cref{chap:appendix}. In all datasets, the graphs with $\alpha =
  291. \nicefrac{1}{5}$ are of a smaller size than those with
  292. $\alpha = \nicefrac{1}{14}$. This is due to the fact that the individuals are
  293. being moved to the removed compartment at a faster rate. Resulting, it can be
  294. observed that the value of $\Rt$ is constantly remaining closer to the threshold
  295. of $\Rt=1$, while the reproduction number for datasets with $\alpha = \nicefrac{1}{14}$
  296. reaches values of up to 1.6. In states with higher values of $\beta$, the period
  297. during which the value of $\Rt$ is above the threshold of one 1 is longer, but
  298. the peak is lower. In states with a lower transmission rate, the period above 1
  299. is shorter, but the peak value is higher.\\
  300. \begin{figure}[t]
  301. \centering
  302. \begin{subfigure}{0.45\textwidth}
  303. \includegraphics[width=\textwidth]{r_t_cluster_intext.pdf}
  304. \end{subfigure}
  305. \begin{subfigure}{0.45\textwidth}
  306. \includegraphics[width=\textwidth]{I_cluster_intext.pdf}
  307. \end{subfigure}
  308. \label{fig:state_results}
  309. \caption{Visualization of the prediction of the training and the graphs of
  310. $\Rt$ for Thuringia (upper) and Bremen (lower) with both
  311. $\alpha = \nicefrac{1}{14}$ and $\alpha = \nicefrac{1}{5}$. Events~\cite{COVIDChronik} like
  312. the peak of an influential variant or the start of the vaccination of the public are marked horizontally. Further
  313. visualizations can be found in~\Cref{chap:appendix}.}
  314. \end{figure}
  315. \Cref{table:state_error} presents data regarding the discrepancy between the
  316. predicted and actual values from the dataset for compartment $I$. It is evident,
  317. that the error for all experiments falls within a range of values that is not
  318. negligible and will have an influence on the resulting reproduction values that
  319. are learned while fitting the data. A comparison of the results for the various
  320. values of $\alpha$ reveals that the errors associated with $\alpha = \nicefrac{1}{14}$
  321. are consistently smaller, with the exception of Saxony and Germany. This can be
  322. attributed to the differing sizes of infection counts, particularly in relation
  323. to the normalization factor $C$. The model is unable to learn effectively if the
  324. values of the data loss $\mathcal{L}_{\text{data}}$ are too large or too small
  325. at the beginning.\\
  326. \begin{table}[t]
  327. \begin{center}
  328. \caption{This table displays all average values of the error $e_{\text{I}}$
  329. for all German states and Germany. The average is formed across all
  330. 10 iteration.}
  331. \label{table:state_error}
  332. \begin{tabular}{lccccccc}
  333. \toprule
  334. & \multicolumn{2}{c}{$e_I$} & \phantom{0} & \multicolumn{2}{c}{days with $\Rt>1$} & \multicolumn{2}{c}{peak $\Rt$} \\
  335. \cmidrule{2-3}\cmidrule{5-6}\cmidrule{7-8}
  336. state name & $\alpha=\frac{1}{14}$ & $\alpha=\frac{1}{5}$ & \phantom{0} & $\alpha=\frac{1}{14}$ & $\alpha=\frac{1}{5}$ & $\alpha=\frac{1}{14}$ & $\alpha=\frac{1}{5}$ \\
  337. \midrule
  338. Schleswig Holstein & 0.228 & 0.258 & \phantom{0} & 467.5 & 458.5 & 1.475 & 1.166 \\
  339. Hamburg & 0.265 & 0.330 & \phantom{0} & 424.3 & 409.8 & 1.500 & 1.297 \\
  340. Lower Saxony & 0.224 & 0.340 & \phantom{0} & 413.1 & 430.3 & 1.662 & 1.223 \\
  341. Bremen & 0.246 & 0.380 & \phantom{0} & 468.6 & 539.1 & 1.582 & 1.179 \\
  342. NRW & 0.185 & 0.252 & \phantom{0} & 486.3 & 602.0 & 1.573 & 1.205 \\
  343. Hesse & 0.302 & 0.346 & \phantom{0} & 553.0 & 511.2 & 1.409 & 1.157 \\
  344. Rhineland-Palatinate & 0.256 & 0.277 & \phantom{0} & 484.7 & 404.7 & 1.534 & 1.175 \\
  345. Baden-Württemberg & 0.198 & 0.284 & \phantom{0} & 469.2 & 590.0 & 1.457 & 1.180 \\
  346. Bavaria & 0.225 & 0.318 & \phantom{0} & 490.5 & 486.1 & 1.428 & 1.199 \\
  347. Saarland & 0.284 & 0.408 & \phantom{0} & 500.2 & 564.7 & 1.515 & 1.180 \\
  348. Berlin & 0.201 & 0.240 & \phantom{0} & 591.9 & 514.4 & 1.721 & 1.262 \\
  349. Brandenburg & 0.237 & 0.242 & \phantom{0} & 555.9 & 596.3 & 1.447 & 1.159 \\
  350. MV & 0.170 & 0.257 & \phantom{0} & 537.5 & 544.3 & 1.563 & 1.135 \\
  351. Saxony & 0.292 & 0.256 & \phantom{0} & 722.3 & 695.4 & 1.790 & 1.407 \\
  352. Saxony-Anhalt & 0.213 & 0.268 & \phantom{0} & 572.0 & 631.9 & 1.387 & 1.165 \\
  353. Thuringia & 0.180 & 0.222 & \phantom{0} & 732.1 & 730.6 & 1.586 & 1.249 \\
  354. \midrule
  355. Germany & 0.284 & 0.239 & \phantom{0} & 587.7 & 430.7 & 1.561 & 1.219 \\
  356. \bottomrule
  357. \end{tabular}
  358. \end{center}
  359. \end{table}
  360. As illustrated in~\Cref{fig:state_results}, the training data is overlaid with the
  361. corresponding prediction of the model. We can observe that the prediction, though
  362. an exact reconstruction, accurately captures the general trajectory of the
  363. pandemic. The model's prediction demonstrates an ability to capture larger
  364. peaks, exhibiting a tendency to ignore smaller changes. This suggests that the
  365. prediction of the model is capable show the rough outline of the progression of COVID-19. In the
  366. beginning, the majority of predictions below $\Rt=1$, indicating an outbreak.
  367. As we observed in the synthetic data, the model exhibits a higher standard
  368. deviation at the boundaries. In the graphs, we mark the peaks of the most severe
  369. COVID-19 variants in Germany~\cite{COVIDChronik}. While the peaks of the
  370. Alpha and Delta variants are clearly visible in the data, the model does not
  371. learn these, and thus they are not reflected in the results. The peak of the
  372. Omicron variant represents the culmination of the COVID-19 pandemic in Germany
  373. and can be identified as the most prominent peak in the dataset. Immediately preceding this peak, we observe the highest
  374. value of the reproduction number across all states. This phenomenon can be explained, by
  375. number of individuals infected by one infectious person reaching its peak. In
  376. some states the peaks of other Omicron variants after the maximum peak are visible (see Thuringia).\\
  377. The experiments demonstrate, that our model encounters difficulties in learning the data for the
  378. states and Germany and consequently in predicting the reproduction values for each dataset.
  379. Nonetheless, the predictions illustrate the general trends of the most impactful
  380. events of the COVID-19 pandemic.\\
  381. % -------------------------------------------------------------------