chap01-introduction.tex 11 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166
  1. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  2. % Author: Phillip Rothenbeck
  3. % Title: Investigating the Evolution of the COVID-19 Pandemic in Germany Using Physics-Informed Neural Networks
  4. % File: chap01-introduction/chap01-introduction.tex
  5. % Part: introduction
  6. % Description:
  7. % summary of the content in this chapter
  8. % Version: 01.01.2012
  9. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  10. \chapter{Introduction}
  11. \label{chap:introduction}
  12. In the early months of 2020, Germany, like many other countries, was struck by
  13. the novel \emph{Coronavirus Disease} (COVID-19)~\cite{WHO}. The pandemic, which
  14. originates in Wuhan, China, had a profound impact on the global community,
  15. paralyzing it for over two years. In response to the pandemic, the German
  16. government employed a multifaceted approach~\cite{RKI}, encompassing the
  17. introduction of vaccines and non-pharmaceutical mitigation policies such as
  18. lockdowns. Between mitigation policies and varying strains of COVID-19, which
  19. have exhibited varying degrees of infectiousness and lethality~\cite{RKIa},
  20. Germany had recorded over 38,400,000 infection cases and 174,000 deaths, by
  21. the end of June in 2023~\cite{SRD}. In light of these figures the need for an
  22. analysis arises.\\
  23. The dynamics of disease transmission in the real-world are complex. A multitude
  24. of factors influence the course of a disease, and it is
  25. challenging to gain a comprehensive understanding of these factors and develop
  26. tools that allow for the comparison of disease courses across different
  27. diseases and time points. The common approach in epidemiology to address this is
  28. the utilization of epidemiological models that approximate the dynamics by
  29. focusing on specific factors and modeling these using mathematical tools. These
  30. models provide epidemiological parameters that determine the behavior of a
  31. disease within the boundaries of the model. A seminal epidemiological model is
  32. the \emph{SIR model}, which was first proposed by Kermack and McKendrick~\cite{1927}
  33. in 1927. The SIR model is a compartmental model that divides the entire
  34. population into three distinct groups: the \emph{susceptible} compartment, $S$;
  35. the \emph{infectious} compartment, $I$; and the \emph{removed} compartment, $R$.
  36. In the context of the SIR model, the constant parameters of the transmission
  37. rate $\beta$ and the recovery rate $\alpha$ serve to quantify and determine the
  38. course of a pandemic. However, a pandemic is not a static entity, therefore Liu
  39. and Stechlinski~\cite{Liu2012}, and Setianto and Hidayat~\cite{Setianto2023}
  40. propose an SIR model with time-dependent epidemiological parameters and
  41. reproduction numbers $\Rt$. The SIR model is defined by a system of differential
  42. equations, that incorporate the parameters $\alpha$ and $\beta$, thereby
  43. depicting the fluctuation between the three compartments. For a given set of
  44. data, the epidemiological parameters can be identified by solving the set of
  45. differential systems. Recently, the data-driven approach of \emph{Physics-Informed Neural Networks}
  46. (PINN) has gained attention due to its capability of finding solutions to
  47. differential equations by fitting its predictions to both given data and the
  48. governing system of differential equations. By employing this methodology,
  49. Shaier \etal~\cite{Shaier2021} were able to find the epidemiological parameters
  50. on data for different diseases. Additionally, Millevoi \etal~\cite{Millevoi2023}
  51. were able to identify the reproduction number $\Rt$ for both synthetic and
  52. Italian COVID-19 data using an approach based on a reduced version of the SIR
  53. model.\\
  54. The objective of this thesis is to identify the epidemiological parameters
  55. $\alpha$ and $\beta$, as well as the reproduction number $\Rt$ of COVID-19 over
  56. the first 1200 days of recorded data in Germany and its federal states. The
  57. Robert Koch Institute (RKI)\footnote{\url{https://www.rki.de/EN/Home/homepage_node.html}} has compiled data on both reported cases and
  58. associated moralities from the beginning of the outbreak in Germany to the
  59. present. We utilize and preprocess this data according to the required format of
  60. our approaches. As the raw data lacks information on recovery incidence, we
  61. introduce the recovery queue that simulates a recovery period. To estimate the
  62. epidemiological parameters we adopt the approach of Shaier
  63. \etal~\cite{Shaier2021}, which utilizes a PINN learning the data, that consists
  64. of time points with their respective sizes of the $S, I$ and $R$ compartments,
  65. to predict the epidemiological parameters based on the data and the governing
  66. system of differential equations. Additionally, we apply the methodology by
  67. Millevoi \etal~\cite{Millevoi2023} to estimate the time-dependent reproduction
  68. number, $\Rt$, over a 1200-day period for each German federal state and Germany
  69. as a whole in the reduced SIR model. Thus needing only the size of the $I$
  70. group for each time step. To validate the effectiveness of these methods, we
  71. first conduct experiments on a small synthetic dataset before applying the
  72. techniques to real-world data. We then analyze the plausibility of our results
  73. by comparing them to real-world events and data such as vaccination ratios of
  74. each region or the peaks of impactful variants. This analysis demonstrates the
  75. relevance of our findings and reveals a correlation between our results and
  76. real-world developments, thus supporting the effectiveness of our approach.\\
  77. % -------------------------------------------------------------------
  78. \section{Related work}
  79. \label{sec:relatedWork}
  80. In this section, we categorize our work into the context of existing literature
  81. on the topic of solving the epidemiological models for real-world data. The
  82. first work, by Smirnova \etal~\cite{Smirnova2017}, endeavors to identify a
  83. stochastic methodology for estimating the time-dependent transmission rate
  84. $\beta(t)$. They achieve this by projecting the time-dependent transmission
  85. rate onto a finite subspace, that is defined by Legendre polynomials.
  86. Subsequently, they compare the three regularization techniques of variational
  87. (Tikhonov's) regularization, truncated singular value decomposition (TSVD), and
  88. modified TSVD to ascertain the most reliable method for forecasting with
  89. limited data. Their findings indicate that modified TSVD provides the most
  90. stable forecasts on, as demonstrated on both simulated data and real-world data
  91. from the 1918 influenza pandemic and the Ebola epidemic. In contrast, we
  92. utilize PINNs to find the constant epidemiological parameters and the
  93. reproduction number for Germany and its states.\\
  94. Some related works similar to our method apply PINN approaches to COVID-19 and
  95. other real-world disease examples~\cite{Shaier2021,Millevoi2023,Berkhahn2022,Olumoyin2021}.
  96. Specifically Shaier \etal~\cite{Shaier2021} put forth a data-driven method
  97. which they refer to as \emph{Disease-Informed Neural Networks} (DINN). In their
  98. work, they demonstrate the capacity of PINNs to forecast the trajectory of
  99. epidemics and pandemics. They underpin the efficacy of their approach by
  100. applying it to 11 diseases, that have previously been modeled. In their
  101. experiments they employ the SIDR (susceptible, infectious, dead, recovered)
  102. model. Finally, they present that this method is a robust and effective means
  103. of identifying the parameters of a SIR model.\\
  104. Similarly Berkhahn and Ehrhard~\cite{Berkhahn2022}, employ the susceptible,
  105. vaccinated, infectious, hospitalized and removed (SVIHR) model. The proposed
  106. PINN methodology initially estimates the SVIHR model parameters for German
  107. COVID-19 data, covering the time span from the inceptions of the outbreak to
  108. the end of 2021. For comparative purposes, Berkhahn and Ehrhard employ the
  109. method of non-standard finite differences (NSFD) as well. The authors utilize
  110. both forecasting methods to project the trajectory of COVID-19 from mid-April
  111. 2023 onwards. Berkhahn and Ehrhard find that PINNs are able to adapt to varying
  112. vaccination rates and emerging variants.\\
  113. Furthermore, Olumoyin \etal~\cite{Olumoyin2021} employ an alternative
  114. methodology for identifying the time-dependent transmission rate of an
  115. asymptomatic-SIR model accounting for unreported infectious cases. The PINN
  116. approach they introduce, utilizes the cumulative and daily reported infection
  117. cases and symptomatic recovered cases, to demonstrate the effect of different
  118. mitigation measures and to ascertain the proportion of non-symptomatic
  119. individuals and asymptomatic recovered individuals. With this they can
  120. illustrate the influence of vaccinations and a set non-pharmaceutical
  121. mitigation methods on the transmission of COVID-19 on data from Italy, South
  122. Korea, the United Kingdom, and the United States.\\
  123. Finally, Millevoi \etal~\cite{Millevoi2023} address the issue of the changes in
  124. the transmission rate due to the dynamics of a pandemic. The authors employ the
  125. reproduction number $\Rt$ to reduce the system of differential equations to a
  126. single equation and introduce a reduced-split version of the PINN, which
  127. initially trains on the data and then trains to minimize the residual of the
  128. ordinary differential equation. They test their approach on five synthetic and
  129. two real-world scenarios from the early stages of the COVID-19 pandemic in
  130. Italy. This method yields an increase in both accuracy and training speed. In
  131. contrast, to these works, we estimate the epidemiological of $\alpha$ and
  132. $\beta$ and the reproduction number $\Rt$ for Germany for the entirety of the
  133. span from early March in 2020 to late June in 2023.
  134. % -------------------------------------------------------------------
  135. \section{Overview}
  136. This thesis is comprised of four chapters. \Cref{chap:background}
  137. starts with the theoretical overview of mathematical modeling in epidemiology,
  138. with a particular focus on the SIR model. Subsequently, it shifts its focus to
  139. neural networks, specifically on the background of PINNs and their use in
  140. solving ordinary differential equations.~\Cref{chap:methods} outlines the
  141. methodology employed in this thesis. First, we present the data, that was
  142. collected by the RKI and our preprocessing. Then, we present the PINN
  143. approaches, which are inspired by the work of Shaier \etal~\cite{Shaier2021}
  144. and Millevoi \etal~\cite{Millevoi2023}.~\Cref{chap:evaluation} provides the
  145. setups and results of the experiments that we conduct. This chapter is divided
  146. into two sections. The first section shows and discusses the results concerning
  147. the epidemiological parameters of $\alpha$ and $\beta$. The subsequent section
  148. presents the results concerning the reproduction value $\Rt$. Finally, in
  149. \Cref{chap:conclusions}, give a conclusion of our work and provide an overview
  150. of potential further work.
  151. % -------------------------------------------------------------------