chap01-introduction.tex 11 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166
  1. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  2. % Author: Phillip Rothenbeck
  3. % Title: Investigating the Evolution of the COVID-19 Pandemic in Germany Using Physics-Informed Neural Networks
  4. % File: chap01-introduction/chap01-introduction.tex
  5. % Part: introduction
  6. % Description:
  7. % summary of the content in this chapter
  8. % Version: 01.01.2012
  9. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  10. \chapter{Introduction}
  11. \label{chap:introduction}
  12. In the early months of 2020, Germany, like many other countries, was struck by
  13. the novel \emph{Coronavirus Disease} (COVID-19)~\cite{WHO}. The pandemic, which
  14. originates in Wuhan, China, had a profound impact on the global community,
  15. paralyzing it for over two years. In response to the pandemic, the German
  16. government employed a multifaceted approach~\cite{RKI}, encompassing the
  17. introduction of vaccines and non-pharmaceutical mitigation policies such as
  18. lockdowns. Between mitigation policies and varying strains of COVID-19, which
  19. have exhibited varying degrees of infectiousness and lethality~\cite{RKIa},
  20. Germany had recorded over 38,400,000 infection cases and 174,000 deaths, as of
  21. the end of June in 2023~\cite{SRD}. In light of these figures the need for an
  22. analysis arises.\\
  23. The dynamics of the spread of disease transmission in the real-world are
  24. complex. A multitude of factors influence the course of a disease, and it is
  25. challenging to gain a comprehensive understanding of these factors and develop
  26. tools that allows for the comparison of disease courses across different
  27. diseases and time points. The common approach in epidemiology to address this is
  28. the utilization of epidemiological models that approximate the dynamics by
  29. focusing on specific factors and modeling these using mathematical tools. These
  30. models provide epidemiological parameters that determine the behavior of a
  31. disease within the boundaries of the model. A seminal epidemiological model is
  32. the \emph{SIR model}, which was first proposed by Kermack and McKendrick~\cite{1927}
  33. in 1927. The SIR model is a compartmentalized model that divides the entire
  34. population into three distinct groups: the \emph{susceptible} compartment, $S$;
  35. the \emph{infectious} compartment, $I$; and the \emph{removed} compartment, $R$.
  36. In the context of the SIR model, the constant parameters of the transmission
  37. rate $\beta$ and the recovery rate $\alpha$ serve to quantify and determine the
  38. course of a pandemic. However, a pandemic is not a static entity, therefore Liu
  39. and Stechlinski~\cite{Liu2012}, and Setianto and Hidayat~\cite{Setianto2023}
  40. propose an SIR model with time-dependent epidemiological parameters and
  41. reproduction number $\Rt$. The SIR model is defined by a system of differential
  42. equations, that incorporate the parameters $\alpha$ and $\beta$, thereby
  43. depicting the fluctuation between the three compartments. For a given set of
  44. data, the epidemiological parameters can be identified by solving the set of
  45. differential systems. Recently, the data-driven approach of \emph{Physics-Informed Neural Networks}
  46. (PINN) has gained attention due to its capability of finding solutions to
  47. differential equations by fitting its predictions to both given data and the
  48. governing system of differential equations. By employing this methodology,
  49. Shaier \etal~\cite{Shaier2021} were able to find the epidemiological parameters
  50. on data for different diseases. Additionally, Millevoi \etal~\cite{Millevoi2023}
  51. were able to identify the reproduction number $\Rt$ for both synthetic and
  52. Italian COVID-19 data using an approach based on a reduced version of the SIR
  53. model.\\
  54. The objective of this thesis is to identify the epidemiological parameters
  55. $\beta$ and $\alpha$, as well as the reproduction number $\Rt$ of COVID-19 over
  56. the first 1200 days of recorded data in Germany and its federal states. The
  57. Robert Koch Institute (RKI) has compiled data on both reported cases and
  58. associated moralities from the beginning of the outbreak in Germany to the
  59. present. We utilize and preprocess this data according to the required format of
  60. our approaches. As the raw data lacks information on recovery incidence, we
  61. introduce the recovery queue that simulates a recovery period. To estimate the
  62. epidemiological parameters we adopt the approach of Shaier
  63. \etal~\cite{Shaier2021}, which utilizes a PINN learning the data, which consists
  64. of time points with their respective sizes of the $S, I$ and $R$ compartments,
  65. to predict the epidemiological parameters based on the data and the governing
  66. system of differential equations. Moreover, we utilize the methodology proposed
  67. by Millevoi \etal~\cite{Millevoi2023} that estimates the reproduction number for
  68. each day across the 1200-day span for each German state and Germany as a whole,
  69. in the reduced SIR model. Thus needing only the size of the $I$ group for each
  70. time step. To validate the effectiveness of these methods, we first conduct
  71. experiments on a small synthetic dataset before applying the techniques to
  72. real-world data. We then analyze the plausibility of our results by comparing
  73. them to real-world events and data such as vaccination ratios of each region or
  74. the peaks of impactful variants to demonstrate the relevance of these numbers.
  75. This analysis demonstrates the relevance of our findings and reveals a
  76. correlation between our results and real-world developments, thus supporting the
  77. effectiveness of our approach.\\
  78. % -------------------------------------------------------------------
  79. \section{Related work}
  80. \label{sec:relatedWork}
  81. In this section, we categorize our work into the context of existing literature
  82. on the topic of solving the epidemiological models for real-world data. The
  83. first work, by Smirnova \etal~\cite{Smirnova2017}, endeavors to identify a
  84. stochastic methodology for estimating the time-dependent transmission rate
  85. $\beta(t)$. They achieve this by projecting the time-dependent transmission rate
  86. onto a finite subspace, that is defined by Legendre polynomials. Subsequently,
  87. they compare the three regularization techniques of variational (Tikhonov's)
  88. regularization, truncated singular value decomposition (TSVD), and modified TSVD
  89. to ascertain the most reliable method for forecasting with limited data. Their
  90. findings indicate that modified TSVD provides the most stable forecasts on
  91. limited data, as demonstrated on both simulated data and real-world data from
  92. the 1918 influenza pandemic and the Ebola epidemic. In contrast, we
  93. utilize PINNs to find the constant epidemiological parameters
  94. and the reproduction number for Germany and its states.\\
  95. Some related works similar to our approach apply PINN approaches to COVID-19 and
  96. other real-world disease examples~\cite{Shaier2021,Millevoi2023,Berkhahn2022,Olumoyin2021}.
  97. Specifically Shaier \etal~\cite{Shaier2021} put forth a data-driven approach
  98. which they refer to as \emph{Disease-Informed Neural Networks} (DINN). In their
  99. work, they demonstrate the capacity of DINNs to forecast the trajectory of
  100. epidemics and pandemics. They underpin the efficacy of their approach by
  101. applying it to 11 diseases, that have previously been modeled. In their
  102. experiments they employ the SIDR (susceptible, infectious, dead, recovered)
  103. model. Finally, they present that this method is a robust and effective means of
  104. identifying the parameters of a SIR model.\\
  105. Similarly Berkhahn and Ehrhard~\cite{Berkhahn2022}, employ the susceptible,
  106. vaccinated, infectious, hospitalized and removed (SVIHR) model. The proposed
  107. PINN methodology initially estimates the SVIHR model parameters for German
  108. COVID-19 data, covering the time span from the inceptions of the outbreak to the
  109. end of 2021. For comparative purposes, Berkhahn and Ehrhard employ the method of
  110. non-standard finite differences (NSFD) as well. The authors employ both
  111. forecasting methods project the trajectory of COVID-19 from mid-April 2023
  112. onwards. Berkhahn and Ehrhard find that the PINN is able to adapt to varying
  113. vaccination rates and emerging variants.\\
  114. Furthermore, Olumoyin \etal~\cite{Olumoyin2021} employ an alternative
  115. methodology for identifying the time-dependent transmission rate of an
  116. asymptomatic-SIR model accounting for unreported infectious cases. The PINN
  117. approach they introduce, utilizes the cumulative and daily reported infection
  118. cases and symptomatic recovered cases, to demonstrate the effect of different
  119. mitigation measures and to ascertain the proportion of non-symptomatic
  120. individuals and asymptomatic recovered individuals. With this they can
  121. illustrate the influence of vaccination and a set non-pharmaceutical mitigation
  122. methods on the transmission of COVID-19 on data from Italy, South Korea, the
  123. United Kingdom, and the United States.\\
  124. Finally, Millevoi \etal~\cite{Millevoi2023} address the issue of the changes in
  125. the transmission rate due to the dynamics of a pandemic. The authors employ the
  126. reproduction number to reduce the system of differential equations to a single
  127. equation and introduce a reduced-split version of the PINN, which initially
  128. trains on the data and then trains to minimize the residual of the ordinary
  129. differential equation. They test their approach on five synthetic and two
  130. real-world scenarios from the early stages of the COVID-19 pandemic in Italy.
  131. This method yields an increase in both accuracy and training speed. In contrast,
  132. to these works, we estimate the rates and the reproduction number for Germany
  133. for the entirety of the span from early March in 2020 to late June in 2023.
  134. % -------------------------------------------------------------------
  135. \section{Overview}
  136. This thesis is comprised of four chapters. \Cref{chap:background}
  137. presents with the theoretical overview of mathematical modeling in epidemiology,
  138. with a particular focus on the SIR model. Subsequently, it shifts its focus to
  139. neural networks, specifically on the background of PINNs and their use in
  140. solving ordinary differential equations.~\Cref{chap:methods} outlines the
  141. methodology employed in this thesis. First we present the data, that was
  142. collected by the RKI. Then we present the PINN approaches, which are inspired by
  143. the work of Shaier \etal~\cite{Shaier2021} and Millevoi
  144. \etal~\cite{Millevoi2023}.~\Cref{chap:evaluation} presents the setups and
  145. results of the experiments that we conduct. This chapter is divided into two
  146. sections. The first section presents and discusses the results concerning the
  147. epidemiological parameters of $\beta$ and $\alpha$. The subsequent section
  148. presents the results concerning the reproduction value $\Rt$. Finally, in
  149. \Cref{chap:conclusions}, we connect our results with the events of the
  150. real-world and give an overview of potential further work.
  151. % -------------------------------------------------------------------