chap04.tex 29 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459
  1. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  2. % Author: Phillip Rothenbeck
  3. % Title: Investigating the Evolution of the COVID-19 Pandemic in Germany Using Physics-Informed Neural Networks
  4. % File: chap04/chap04.tex
  5. % Part: Experiments
  6. % Description:
  7. % summary of the content in this chapter
  8. % Version: 01.01.2012
  9. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  10. \chapter{Experiments}
  11. \label{chap:evaluation}
  12. In ~\Cref{chap:methods}, we explain the methods based the theoretical
  13. background, that we established in~\Cref{chap:background}. In this chapter, we
  14. present the setups and results from the experiments and simulations. First, we
  15. discuss the experiments dedicated to identify the epidemiological transition
  16. rates of $\beta$ and $\alpha$ in synthetic and real-world data. Second, we
  17. examine the reproduction number in synthetic and real-world data of Germany.
  18. % -------------------------------------------------------------------
  19. \section{Identifying the Transition Rates}
  20. \label{sec:sir}
  21. In this section, we aim to identify the transmission rate $\beta$ and the
  22. recovery rate $\alpha$ from either synthetic or preprocessed real-world data.
  23. The methodology that we employ to identify the transition rates is described
  24. in~\Cref{sec:pinn:sir}. Meanwhile, the methods we utilize to preprocess the
  25. real-world data are detailed in~\Cref{sec:preprocessing:rq}. In the first part
  26. we present the setup of our experiments, then we provide the results including a
  27. discussion.\\
  28. % -------------------------------------------------------------------
  29. \subsection{Setup}
  30. \label{sec:sir:setup}
  31. \paragraph{Synthetic Data:}In order to validate our method, we first generate a dataset of synthetic data.
  32. We achieve this by solving~\Cref{eq:modSIR} for a given set of parameters.
  33. The parameters are set to $\alpha = \nicefrac{1}{3}$ and $\beta = \nicefrac{1}{2}$.
  34. The size of the population is $N = \expnumber{7.6}{6}$ and the initial amount of
  35. infectious individuals is $I_0 = 10$. We conduct the simulation over 150
  36. days, resulting in a dataset of the form of~\Cref{fig:synthetic_SIR}.\\
  37. \paragraph{Real-World Data:}In order to process the real-world RKI data, it is
  38. necessary to preprocess the raw data for each state and Germany separately.
  39. This is achieved by utilizing a recovery queue with a recovery period of 14
  40. days. With regard to population size of each state, we set it to the respective
  41. value counted at the end of
  42. 2019\footnote{{\tiny \url{https://de.statista.com/statistik/kategorien/kategorie/8/themen/63/branche/demographie/\#overview}}}.
  43. The initial number of infectious individuals is set to the number of infected
  44. people on March 09. 2020 from the dataset. The data we extract spans from
  45. March 09. 2020 to June 22. 2023, encompassing a period of 1200 days and
  46. representing the time span during which the COVID-19 disease was the most
  47. active and severe.
  48. \begin{figure}[h]
  49. %\centering
  50. \setlength{\unitlength}{1cm} % Set the unit length for coordinates
  51. \begin{picture}(12, 9.5) % Specify the size of the picture environment (width, height)
  52. \put(1.5, 4.5){
  53. \begin{subfigure}{0.3\textwidth}
  54. \centering
  55. \includegraphics[width=\textwidth]{SIR_synth.pdf}
  56. \label{fig:synthetic_SIR}
  57. \end{subfigure}
  58. }
  59. \put(8, 4.5){
  60. \begin{subfigure}{0.3\textwidth}
  61. \centering
  62. \includegraphics[width=\textwidth]{datasets_states/Germany_SIR_14.pdf}
  63. \label{fig:germany_sir}
  64. \end{subfigure}
  65. }
  66. \put(0, 0){
  67. \begin{subfigure}{0.3\textwidth}
  68. \centering
  69. \includegraphics[width=\textwidth]{datasets_states/Schleswig_Holstein_SIR_14.pdf}
  70. \label{fig:schleswig_holstein_sir}
  71. \end{subfigure}
  72. }
  73. \put(4.75, 0){
  74. \begin{subfigure}{0.3\textwidth}
  75. \centering
  76. \includegraphics[width=\textwidth]{datasets_states/Berlin_SIR_14.pdf}
  77. \label{fig:berlin_sir}
  78. \end{subfigure}
  79. }
  80. \put(9.5, 0){
  81. \begin{subfigure}{0.3\textwidth}
  82. \centering
  83. \includegraphics[width=\textwidth]{datasets_states/Thueringen_SIR_14.pdf}
  84. \label{fig:thüringen_sir}
  85. \end{subfigure}
  86. }
  87. \end{picture}
  88. \caption{Synthetic and real-world training data. The synthetic data is
  89. generated with $\alpha=\nicefrac{1}{3}$ and $\beta=\nicefrac{1}{2}$
  90. and~\Cref{eq:modSIR}. The Germany data is taken from the death case
  91. data set. Exemplatory we show illustrations of the datasets of Schleswig
  92. Holstein, Berlin, and Thuringia. For the other states see~\Cref{chap:appendix} }
  93. \label{fig:datasets_sir}
  94. \end{figure}
  95. \paragraph{Training Parameters:}The PINN that we utilize comprises of seven
  96. hidden layers with twenty neurons each, and an activation function of ReLU. We
  97. follow the hyperparameter setting in~\cite{Shaier2021} but change the base
  98. learning rate to $\expnumber{1}{-3}$. And employ a polynomial scheduler
  99. implementation from the PyTorch library~\cite{Paszke2019} instead. We train the
  100. model for 10000 epochs to extract the parameters. For each set of parameters, we
  101. conduct five iterations to demonstrate stability of the values. For measuring the
  102. accuracy, we calculate the error $e$, using the 2-Norm. Let $G$ be the set of
  103. compartment training data the SIR model with $\boldsymbol{g}\in G$ and $\hat{\boldsymbol{g}}$ be the
  104. corresponding model prediction, then,
  105. \begin{equation}
  106. e_{G} = \frac{1}{|G|}\sum_{g\in G}^{}\frac{\Big\|\hat{\boldsymbol{g}} - \boldsymbol{g}\Big\|_2}{\Big\|\boldsymbol{g}\Big\|_2},
  107. \end{equation}
  108. is the average error across all three groups.
  109. % -------------------------------------------------------------------
  110. \subsection{Results}
  111. \label{sec:sir:results}
  112. In this section, we start by examining the results for the synthetic dataset,
  113. focusing the accuracy and reproducibility. We then proceed to present and
  114. discuss the results for the German states and Germany.\\
  115. The results of the experiment regarding the synthetic data can be seen
  116. in~\Cref{table:alpha_beta_synth}. The error and the standard variation for both
  117. parameters are negligible small. Taking the mean of the parameters across the
  118. five iterations yields more accurate results.\\
  119. \begin{table}[h]
  120. \begin{center}
  121. \caption{Simulation results for the synthetic data. The true values and
  122. the respective mean parameter is given.}
  123. \label{table:alpha_beta_synth}
  124. \begin{tabular}{ccccccccc}
  125. \toprule
  126. \multicolumn{2}{c}{$\alpha$} & \phantom{0} & \multicolumn{2}{c}{$\beta$} \\
  127. \cmidrule{1-2}\cmidrule{4-5}
  128. true & $\mu$ & \phantom{0} & true & $\mu$ & \phantom{0} & $e_{SIR}$ \\
  129. \midrule
  130. 0.333 & 0.333{\tiny$\pm 0.001$} & \phantom{0} & 0.500 & 0.500{\tiny$\pm 0.002$} & \phantom{0} & 0.004 \\
  131. \bottomrule
  132. \end{tabular}
  133. \end{center}
  134. \end{table}
  135. The results demonstrate that the model is capable of approximating the correct
  136. parameters for the small, synthetic dataset in each of the five iterations.
  137. The mean of the predicted values results in values with a sufficiently small
  138. error. Thus, we argue that our selected method is well suited to analyze real
  139. world pandemic data collected in Germany.\\
  140. In~\Cref{table:state_mean_std} we present the results of the training for the
  141. real-world data. The results are presented from top to bottom, in the order of
  142. the community identification number, with the last entry being Germany. Both
  143. the mean $\mu$ and the standard deviation $\sigma$ are calculated across all
  144. five iterations of our experiment. We can observe that the error $e_{SIR}$ is
  145. the highest for \emph{Saxony} and the lowest for \emph{Lower Saxony}.
  146. Furthermore, we include the distance $\Delta\beta_{\text{Germany}} = \beta_{\text{state}} - \beta_{\text{Germany}}$
  147. and the percentage of people that have a basic immunity through vaccination
  148. $\nu$ for each state provided by the Robert Koch Institute\footnote{{\tiny\url{https://impfdashboard.de/}}}.\\
  149. \begin{table}[h]
  150. \begin{center}
  151. \caption{Mean and standard deviation, error $e_{SIR}$ and the distance
  152. $\Delta\beta_{\text{Germany}} = \beta_{\text{state}} - \beta_{\text{Germany}}$
  153. across the 5 iterations, that we conducted for each German state and Germany
  154. as the whole country. Furthermore we include the vaccination percentage
  155. $\nu$ provided from the RKI.}
  156. \label{table:state_mean_std}
  157. \begin{tabular}{lccccc}
  158. \toprule
  159. state name & $\alpha$ & $\beta$ & $e_{SIR}$ & $\Delta\beta_{\text{Germany}}$ & $\nu$ [\%] \\
  160. \midrule
  161. Schleswig Holstein & 0.076{\tiny$\pm0.001$} & 0.095{\tiny$\pm 0.001$} & 0.085 & -0.013 & 79.5 \\
  162. Hamburg & 0.082{\tiny$\pm0.001$} & 0.104{\tiny$\pm 0.001$} & 0.095 & -0.004 & 84.5 \\
  163. Lower Saxony & 0.075{\tiny$\pm0.002$} & 0.097{\tiny$\pm 0.002$} & 0.077 & -0.011 & 77.6 \\
  164. Bremen & 0.058{\tiny$\pm0.002$} & 0.078{\tiny$\pm 0.002$} & 0.093 & -0.030 & 88.3 \\
  165. NRW & 0.079{\tiny$\pm0.001$} & 0.101{\tiny$\pm 0.001$} & 0.078 & -0.007 & 79.5 \\
  166. Hesse & 0.065{\tiny$\pm0.001$} & 0.085{\tiny$\pm 0.001$} & 0.102 & -0.023 & 75.8 \\
  167. Rhineland-Palatinate & 0.085{\tiny$\pm0.004$} & 0.108{\tiny$\pm 0.004$} & 0.090 & 0.001 & 75.6 \\
  168. Baden-Württemberg & 0.091{\tiny$\pm0.002$} & 0.118{\tiny$\pm 0.003$} & 0.080 & 0.010 & 74.5 \\
  169. Bavaria & 0.085{\tiny$\pm0.004$} & 0.116{\tiny$\pm 0.005$} & 0.095 & 0.008 & 75.1 \\
  170. Saarland & 0.075{\tiny$\pm0.002$} & 0.099{\tiny$\pm 0.003$} & 0.108 & -0.009 & 82.4 \\
  171. Berlin & 0.087{\tiny$\pm0.001$} & 0.109{\tiny$\pm 0.001$} & 0.067 & 0.001 & 78.1 \\
  172. Brandenburg & 0.087{\tiny$\pm0.003$} & 0.110{\tiny$\pm 0.003$} & 0.072 & 0.002 & 68.1 \\
  173. MV & 0.089{\tiny$\pm0.002$} & 0.114{\tiny$\pm 0.002$} & 0.054 & 0.006 & 74.7 \\
  174. Saxony & 0.075{\tiny$\pm0.002$} & 0.099{\tiny$\pm 0.002$} & 0.111 & -0.009 & 65.1 \\
  175. Saxony-Anhalt & 0.092{\tiny$\pm0.003$} & 0.119{\tiny$\pm 0.005$} & 0.079 & 0.011 & 74.1 \\
  176. Thuringia & 0.091{\tiny$\pm0.002$} & 0.119{\tiny$\pm 0.003$} & 0.084 & 0.011 & 70.3 \\
  177. \midrule
  178. Germany & 0.083{\tiny$\pm0.001$} & 0.108{\tiny$\pm 0.002$} & 0.080 & 0.000 & 76.4 \\
  179. \bottomrule
  180. \end{tabular}
  181. \end{center}
  182. \end{table}
  183. \begin{figure}[t]
  184. \centering
  185. \includegraphics[width=\textwidth]{mean_std_alpha_beta_res.pdf}
  186. \caption{Visualization of the mean and standard deviation of the transition
  187. rates $\alpha$ and $\beta$ for each state compared to the mean values of
  188. $\alpha$ and $\beta$ for Germany.}
  189. \label{fig:alpha_beta_mean_std}
  190. \end{figure}
  191. In~\Cref{fig:alpha_beta_mean_std}, we present a visual representation of the
  192. means and standard deviations in comparison to the national values. It is
  193. noteworthy that the states of Saxony-Anhalt and Thuringia have the highest
  194. transmission rates of all states, while Bremen and Hesse have the lowest
  195. values for $\beta$. The transmission rates of Hamburg, Baden Württemberg,
  196. Bavaria, and all eastern states lay above the national rate of transmission.
  197. Similarly, the recovery rate yields comparable outcomes. For the recovery rate,
  198. the same states that exhibit a transmission rate exceeding the national value,
  199. have a higher recovery rate than the national standard, with the exception of
  200. Saxony. It is noteworthy that the recovery rates of all states exhibit a
  201. tendency to align with the recovery rate of $\alpha=\nicefrac{1}{14}$, which is
  202. equivalent to a recovery period of $D=\nicefrac{1}{\alpha}=14$ days. When
  203. calculating the correlation coefficient between the predicted transmission rate
  204. and the vaccination ratio, we get a value of $-0.5134$. The strong negative
  205. correlation indicates that the transmission rate is high when the vaccination
  206. ratio is low, and vice versa. This shows that the impact of the vaccines can be
  207. witnessed in our results. \\
  208. It is evident that there is a correlation between the values of $\alpha$ and
  209. $\beta$ for each state. States with a high transmission rate tend to have a
  210. high recovery rate, and vice versa. The correlation between $\alpha$ and
  211. $\beta$ can be explained by the implicate definition of $\alpha$ using a
  212. recovery queue with a constant recovery period of 14 days. This might result to
  213. the PINN not learning $\alpha$ as a standalone parameter but rather as a
  214. function of the transmission rate $\beta$. This phenomenon occurs because the
  215. transmission rate determines the number of individuals that get infected per
  216. day, and the recovery queue moves a proportional number of people to the
  217. removed compartment. Consequently, a number of people defined by $\beta$ move
  218. to the $R$ compartment 14 days after they were infected.\\
  219. This issue can be addressed by reducing the SIR model, thereby eliminating the
  220. significance of the $R$ compartment size. In the following section, we present
  221. our experiments for the reduced SIR model with time-dependent parameters.
  222. % -------------------------------------------------------------------
  223. \section{Identifying the Reproduction Number}
  224. \label{sec:rsir}
  225. In this section we describe the experiments we conduct to identify the
  226. time-dependent reproduction number for both synthetic and real-world data.
  227. Similar to the previous section, we first describe the setup of our experiments
  228. and afterwards present the results and a discussion. The methods we employ for
  229. the preprocessing are described in~\Cref{sec:preprocessing:rq} and for the PINN,
  230. that we use, are described in~\Cref{sec:pinn:rsir}.
  231. % -------------------------------------------------------------------
  232. \subsection{Setup}
  233. \label{sec:rsir:setup}
  234. \paragraph{Synthetic Data:}For the purposes of validation, we create a synthetic dataset, by setting the parameter
  235. of $\alpha$ and the reproduction value each to a specific values, and solving~\Cref{eq:reduced_sir_ODE}
  236. for a given time interval. As in the synthetic data for the aforementioned
  237. experiments, we set $\alpha=\nicefrac{1}{3}$ and $\Rt$ to the values as can be
  238. seen in~\Cref{fig:Rt_dataset} as well as the population size
  239. $N=\expnumber{7.6}{6}$ and the initial amount of infected people to $I_0=10$.
  240. Furthermore, we set our simulated time span to 150 days. We use this dataset to
  241. demonstrate, that our method is working on a simple and minimal dataset.\\
  242. \paragraph{Real-World Data:}To obtain a dataset of the infectious group, consisting of the
  243. real-world data, we processed the data of the dataset
  244. \emph{COVID-19-Todesfälle in Deutschland}~\cite{GHDead} to extract the number of infections
  245. in Germany as a whole. For the German states, we use the data of \emph{SARS-CoV-2
  246. Infektionen in Deutschland}~\cite{GHInf}. In the preprocessing stage, we employ a constant
  247. rate for $\alpha$ to move individuals into the removed compartment. For each
  248. state we generate two datasets with a different recovery rate. First, we choose
  249. $\alpha = \nicefrac{1}{14}$, which aligns with the time of recovery~\cite{GHInf}.
  250. Second, we use $\alpha=\nicefrac{1}{5}$, as 5 days into the infection is the
  251. point at which the infectiousness is at its peak~\cite{COVInfo}.
  252. As in~\Cref{sec:sir}, we set the population size $N$ of each state and Germany
  253. to the corresponding size at the end of 2019. Furthermore, for the same reason
  254. we restrict the data points to an interval of 1200 days, beginning on March 09.
  255. 2020.\\
  256. \begin{figure}[t]
  257. \centering
  258. \begin{subfigure}{0.3\textwidth}
  259. \centering
  260. \includegraphics[width=\textwidth]{I_synth.pdf}
  261. \end{subfigure}
  262. \quad
  263. \begin{subfigure}{0.3\textwidth}
  264. \centering
  265. \includegraphics[width=\textwidth]{I_synth_r_t.pdf}
  266. \end{subfigure}
  267. \vskip\baselineskip
  268. \begin{subfigure}{0.67\textwidth}
  269. \centering
  270. \includegraphics[width=\textwidth]{datasets_states/Germany_datasets.pdf}
  271. \end{subfigure}
  272. \caption{The upper two graphics show the curve of the size of the
  273. infectious group (left) and the corresponding true reproduction value
  274. $\Rt$ (right) for the synthetic data. The lower graphic exemplary
  275. illustrates the different curves for Germany.}
  276. \label{fig:Rt_dataset}
  277. \end{figure}
  278. \paragraph{Training Parameters:}In order to achieve the desired output, the
  279. selected neural network architecture comprises of four hidden layers, each
  280. containing 100 neurons. The activation function is the tangens hyperbolicus
  281. function. For both the federal state and Germany, the physics loss is weighted
  282. by a factor of $\expnumber{1}{-6}$, whereas the data loss belonging to Germany
  283. is also weighted with a high factor of $\expnumber{1}{4}$, relative to the total
  284. loss. We found this approach to yield the best results. The model is trained
  285. using a base learning rate of $\expnumber{1}{-3}$, with the same scheduler and
  286. optimizer as we describe in~\Cref{sec:sir:setup}. We train the model for the
  287. states 20000 epochs and start the physics training after 10000 epochs, while we
  288. train for Germany for 25000 and start the physics training after 15000 epochs.
  289. To reduce the standard deviation, each experiment is conducted 15 times. For
  290. evaluation, we use the error $e_G$ as we do in the subsequent section.\\
  291. % -------------------------------------------------------------------
  292. \subsection{Results}
  293. \label{sec:rsir:results}
  294. \Cref{fig:synth_results} illustrates the results of our experiments conducted on
  295. the synthetic dataset, which can be seen in~\Cref{fig:Rt_dataset}. It is evident
  296. that the model is capable of learning the infection data across all data points.
  297. The error for this is, $e_I = 0.0016$, which is of a negligible
  298. magnitude.\\
  299. \begin{figure}[h]
  300. \centering
  301. \begin{subfigure}{0.45\textwidth}
  302. \includegraphics[width=\textwidth]{synthetic_I_prediction.pdf}
  303. \end{subfigure}
  304. \quad
  305. \begin{subfigure}{0.45\textwidth}
  306. \includegraphics[width=\textwidth]{synthetic_R_t_statistics.pdf}
  307. \end{subfigure}
  308. \label{fig:synth_results}
  309. \caption{Results for the reproduction rate $\Rt$ on synthetic data. The
  310. left graphic show the prediction of the model regarding the $I$ group. The
  311. right graphic presents the predicted $\Rt$ against the true value, with the
  312. standard deviation.}
  313. \end{figure}
  314. An examination of the predictions for the representation value $\Rt$ reveals
  315. that here as well, the model is capable of accurately delineating the value at
  316. each time point. However, during the first 30 days, the standard deviation is
  317. exhibits an upward trend, while during the final 120 days, the predictions
  318. demonstrate remarkable precision.\\
  319. In~\Cref{fig:state_results}, we present the graphs of $\Rt$ for the state with
  320. the highest value of $\beta$, namely Thuringia, and for the state with the lowest
  321. transmission rate $\beta$, namely Bremen. Further visualizations of the results
  322. can be found in~\Cref{chap:appendix}. In all datasets, the graphs with $\alpha =
  323. \nicefrac{1}{5}$ are of a smaller size than those with
  324. $\alpha = \nicefrac{1}{14}$. This is due to the fact that the individuals are
  325. being moved to the removed compartment at a faster rate. Resulting, it can be
  326. observed that the value of $\Rt$ is constantly remaining closer to the threshold
  327. of $\Rt=1$, while the reproduction number for datasets with $\alpha = \nicefrac{1}{14}$
  328. reaches values of up to 1.6. In states with higher values of $\beta$, the period
  329. during which the value of $\Rt$ is above the threshold of one 1 is longer, but
  330. the peak is lower. In states with a lower transmission rate, the period above 1
  331. is shorter, but the peak value is higher.\\
  332. \begin{figure}[t]
  333. \centering
  334. \begin{subfigure}{0.45\textwidth}
  335. \includegraphics[width=\textwidth]{I_prediction/Thueringen_I_prediction.pdf}
  336. \end{subfigure}
  337. \quad
  338. \begin{subfigure}{0.45\textwidth}
  339. \includegraphics[width=\textwidth]{I_prediction/Bremen_I_prediction.pdf}
  340. \end{subfigure}
  341. \begin{subfigure}{0.45\textwidth}
  342. \includegraphics[width=\textwidth]{R_t/Thueringen_R_t_statistics.pdf}
  343. \end{subfigure}
  344. \quad
  345. \begin{subfigure}{0.45\textwidth}
  346. \includegraphics[width=\textwidth]{R_t/Bremen_R_t_statistics.pdf}
  347. \end{subfigure}
  348. \label{fig:state_results}
  349. \caption{Visualization of the prediction of the training and the graphs of
  350. $\Rt$ for Thuringia (left) and Bremen (right) with both
  351. $\alpha = \nicefrac{1}{14}$ and $\alpha = \nicefrac{1}{5}$. Events~\cite{COVIDChronik} like
  352. the peak of an influential variant or the start of the vaccination of the public are marked horizontally. Further
  353. visualizations can be found in~\Cref{chap:appendix}.}
  354. \end{figure}
  355. \Cref{table:state_error} presents data regarding the discrepancy between the
  356. predicted and actual values from the dataset for compartment $I$. It is evident,
  357. that the error for all experiments falls within a range of values that is not
  358. negligible and will have an influence on the resulting reproduction values that
  359. are learned while fitting the data. A comparison of the results for the various
  360. values of $\alpha$ reveals that the errors associated with $\alpha = \nicefrac{1}{14}$
  361. are consistently smaller, with the exception of Saxony and Germany. This can be
  362. attributed to the differing sizes of infection counts, particularly in relation
  363. to the normalization factor $C$. The model is unable to learn effectively if the
  364. values of the data loss $\mathcal{L}_{\text{data}}$ are too large or too small
  365. at the beginning.\\
  366. \begin{table}[t]
  367. \begin{center}
  368. \caption{This table displays all average values of the error $e_{\text{I}}$
  369. for all German states and Germany. The average is formed across all
  370. 10 iteration.}
  371. \label{table:state_error}
  372. \begin{tabular}{lccccccc}
  373. \toprule
  374. & \multicolumn{2}{c}{$e_I$} & \phantom{0} & \multicolumn{2}{c}{days with $\Rt>1$} & \multicolumn{2}{c}{peak $\Rt$} \\
  375. \cmidrule{2-3}\cmidrule{5-6}\cmidrule{7-8}
  376. state name & $\alpha=\frac{1}{14}$ & $\alpha=\frac{1}{5}$ & \phantom{0} & $\alpha=\frac{1}{14}$ & $\alpha=\frac{1}{5}$ & $\alpha=\frac{1}{14}$ & $\alpha=\frac{1}{5}$ \\
  377. \midrule
  378. Schleswig Holstein & 0.228 & 0.258 & \phantom{0} & 467.5 & 458.5 & 1.475 & 1.166 \\
  379. Hamburg & 0.265 & 0.330 & \phantom{0} & 424.3 & 409.8 & 1.500 & 1.297 \\
  380. Lower Saxony & 0.224 & 0.340 & \phantom{0} & 413.1 & 430.3 & 1.662 & 1.223 \\
  381. Bremen & 0.246 & 0.380 & \phantom{0} & 468.6 & 539.1 & 1.582 & 1.179 \\
  382. NRW & 0.185 & 0.252 & \phantom{0} & 486.3 & 602.0 & 1.573 & 1.205 \\
  383. Hesse & 0.302 & 0.346 & \phantom{0} & 553.0 & 511.2 & 1.409 & 1.157 \\
  384. Rhineland-Palatinate & 0.256 & 0.277 & \phantom{0} & 484.7 & 404.7 & 1.534 & 1.175 \\
  385. Baden-Württemberg & 0.198 & 0.284 & \phantom{0} & 469.2 & 590.0 & 1.457 & 1.180 \\
  386. Bavaria & 0.225 & 0.318 & \phantom{0} & 490.5 & 486.1 & 1.428 & 1.199 \\
  387. Saarland & 0.284 & 0.408 & \phantom{0} & 500.2 & 564.7 & 1.515 & 1.180 \\
  388. Berlin & 0.201 & 0.240 & \phantom{0} & 591.9 & 514.4 & 1.721 & 1.262 \\
  389. Brandenburg & 0.237 & 0.242 & \phantom{0} & 555.9 & 596.3 & 1.447 & 1.159 \\
  390. MV & 0.170 & 0.257 & \phantom{0} & 537.5 & 544.3 & 1.563 & 1.135 \\
  391. Saxony & 0.292 & 0.256 & \phantom{0} & 722.3 & 695.4 & 1.790 & 1.407 \\
  392. Saxony-Anhalt & 0.213 & 0.268 & \phantom{0} & 572.0 & 631.9 & 1.387 & 1.165 \\
  393. Thuringia & 0.180 & 0.222 & \phantom{0} & 732.1 & 730.6 & 1.586 & 1.249 \\
  394. \midrule
  395. Germany & 0.284 & 0.239 & \phantom{0} & 587.7 & 430.7 & 1.561 & 1.219 \\
  396. \bottomrule
  397. \end{tabular}
  398. \end{center}
  399. \end{table}
  400. As illustrated in~\Cref{fig:state_results}, the training data is overlaid with the
  401. corresponding prediction of the model. We can observe that the prediction, though
  402. an exact reconstruction, accurately captures the general trajectory of the
  403. pandemic. The model's prediction demonstrates an ability to capture larger
  404. peaks, exhibiting a tendency to ignore smaller changes. This suggests that the
  405. prediction of the model is capable show the rough outline of the progression of COVID-19. In the
  406. beginning, the majority of predictions below $\Rt=1$, indicating an outbreak.
  407. As we observed in the synthetic data, the model exhibits a higher standard
  408. deviation at the boundaries. In the graphs, we mark the peaks of the most severe
  409. COVID-19 variants in Germany~\cite{COVIDChronik}. While the peaks of the
  410. Alpha and Delta variants are clearly visible in the data, the model does not
  411. learn these, and thus they are not reflected in the results. The peak of the
  412. Omicron variant represents the culmination of the COVID-19 pandemic in Germany
  413. and can be identified as the most prominent peak in the dataset. Immediately preceding this peak, we observe the highest
  414. value of the reproduction number across all states. This phenomenon can be explained, by
  415. number of individuals infected by one infectious person reaching its peak. In
  416. some states the peaks of other Omicron variants after the maximum peak are visible (see Thuringia).\\
  417. The experiments demonstrate, that our model encounters difficulties in learning the data for the
  418. states and Germany and consequently in predicting the reproduction values for each dataset.
  419. Nonetheless, the predictions illustrate the general trends of the most impactful
  420. events of the COVID-19 pandemic.\\
  421. % -------------------------------------------------------------------