chap01-introduction.tex 11 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164
  1. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  2. % Author: Phillip Rothenbeck
  3. % Title: Investigating the Evolution of the COVID-19 Pandemic in Germany Using Physics-Informed Neural Networks
  4. % File: chap01-introduction/chap01-introduction.tex
  5. % Part: introduction
  6. % Description:
  7. % summary of the content in this chapter
  8. % Version: 01.01.2012
  9. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  10. \chapter{Introduction}
  11. \label{chap:introduction}
  12. In the early months of 2020, Germany, like many other countries, was struck by the novel
  13. \emph{Coronavirus Disease} (COVID-19)~\cite{WHO}. The pandemic, which originates in
  14. Wuhan, China, had a profound impact on the global community, paralyzing it for
  15. over two years. In response to the pandemic, the German government employed a
  16. multifaceted approach~\cite{RKI}, encompassing the introduction of vaccines and
  17. non-pharmaceutical mitigation policies such as lockdowns. Between mitigation
  18. policies and varying strains of COVID-19, which have exhibited varying degrees
  19. of infectiousness and lethality~\cite{RKIa}, Germany had recorded over 38,400,000 infection
  20. cases and 174,000 deaths, as of the end of June in 2023~\cite{SRD}. In light of these
  21. figures the need for an analysis arises.\\
  22. The dynamics of the spread of disease transmission in the real-world are
  23. complex. A multitude of factors influence the course of a disease, and it is
  24. challenging to gain a comprehensive understanding of these factors and develop
  25. tools that allows for the comparison of disease courses across different diseases
  26. and time points. The common approach in epidemiology to address this is the
  27. utilization of epidemiological models that approximate the dynamics by focusing
  28. on specific factors and modeling these using mathematical tools. These models
  29. provide transition rates and parameters that determine the behavior of a disease
  30. within the boundaries of the model. A fundamental epidemiological model, is the
  31. \emph{SIR model}, which was first proposed by Kermack and McKendrick~\cite{1927}
  32. in 1927. The SIR model is a compartmentalized model that divides the entire
  33. population into three distinct groups: the \emph{susceptible} compartment, $S$; the
  34. \emph{infectious} compartment, $I$; and the \emph{removed} compartment, $R$.
  35. In the context of the SIR model, the constant parameters of the transmission
  36. rate $\beta$ and the recovery rate $\alpha$ serve to quantify and determine the
  37. course of a pandemic. However, pandemic is not a static entity, therefor, Liu
  38. and Stechlinski~\cite{Liu2012}, and Setianto and Hidayat~\cite{Setianto2023},
  39. propose an SIR model with time-dependent transition rates and reproduction number $\Rt$. The SIR model
  40. is defined by a system of differential equations, that incorporate
  41. the transition rates, thereby depicting the fluctuation between the three
  42. compartments. For a given set of data, the transition rate can be identified by
  43. solving the set of differential systems. Recently, the data-driven approach of
  44. \emph{physics-informed neural networks} (PINN) has gained attention due to its
  45. capability of finding solutions to differential equations by fitting its
  46. predictions to both given data and the governing system of differential
  47. equations. By employing this methodology, Shaier \etal~\cite{Shaier2021} were
  48. able to find the transition rate on data for different diseases. Additionally,
  49. Millevoi \etal~\cite{Millevoi2023} were able to identify the reproduction number
  50. $\Rt$ for both synthetic and Italian COVID-19 data using an approach based on a
  51. reduced version of the SIR model.\\
  52. The objective of this thesis is to identify the transition rates $\beta$ and
  53. $alpha$, as well as the reproduction number $\Rt$ of COVID-19 over the first
  54. 1200 days of recorded data in Germany and its federal states. The Robert Koch
  55. Institute (RKI) has compiled data on both reported cases and associated
  56. moralities from the beginning of the outbreak in Germany to the present. We
  57. utilize and preprocess this data according to the required format of our
  58. approaches. As the raw data lacks information on recovery incidence, we
  59. introduce the recovery queue that simulates a recovery period. To estimate the
  60. transition rates we adopt the approach of Shaier \etal~\cite{Shaier2021}, which
  61. utilizes a physics-informed neural network learning the data, which consists of
  62. time point with their respective sizes of the $S, I$ and $R$ compartments, to
  63. predict the transition rates based on the data and the governing system of
  64. differential equations. Moreover, we utilize the methodology proposed by
  65. Millevoi \etal~\cite{Millevoi2023} that estimates the reproduction number for
  66. each day across the 1200-day span for each German state and Germany as a whole,
  67. in reduced SIR model. Thus needing only the size of the $I$ group for each time
  68. step. To validate the effectiveness of these methods, we first conduct
  69. experiments on a small synthetic dataset before applying the techniques to
  70. real-world data. We then analyze the plausibility of our results by comparing
  71. them to real-world events and data such as vaccination ratios of each region or
  72. the peaks of impactful variants to demonstrate the relevance of these numbers.
  73. This analysis demonstrates the relevance of our findings and reveals a
  74. correlation between our results and real-world developments, thus supporting the
  75. effectiveness of our approach.\\
  76. % -------------------------------------------------------------------
  77. \section{Related work}
  78. \label{sec:relatedWork}
  79. In this section, we categorize our work into the context of existing literature
  80. on the topic of solving the epidemiological models for real-world data. The
  81. first work, by Smirnova \etal~\cite{Smirnova2017}, endeavors to identify a
  82. stochastic methodology for estimating the time-dependent transmission rate
  83. $\beta(t)$. They achieve this by projecting the time-dependent transmission rate
  84. onto a finite subspace, that is defined by Legendre polynomials. Subsequently,
  85. they compare the three regularization techniques of variational (Tikhonov’s)
  86. regularization, truncated singular value decomposition (TSVD), and modified TSVD
  87. to ascertain the most reliable method for forecasting with limited data. Their
  88. findings indicate that modified TSVD provides the most stable forecasts on
  89. limited data, as demonstrated on both simulated data and real-world data from
  90. the 1918 influenza pandemic and the Ebola epidemic. In contrast, we
  91. utilize physics-informed neural networks (PINN) to find the constant transition rates
  92. and the reproduction number for Germany and its states\\
  93. Some related works similarly to us apply PINN approaches to COVID-19 and other
  94. real-world disease data such as~\cite{Shaier2021,Berkhahn2022,Olumoyin2021,Millevoi2023}.
  95. Specifically in~\cite{Shaier2021}, Shaier \etal put forth a data-driven
  96. approach which they refer to as disease informed neural networks (DINN). In their work,
  97. they demonstrate the capacity of DINNs to forecast the trajectory of epidemics
  98. and pandemics. They underpin the efficacy of their approach by applying it to 11
  99. diseases, that have previously been modeled. In their experiments they employ
  100. the SIDR (susceptible, infectious, dead, recovered) model. Finally, they present
  101. that this method is a robust and effective means of identifying the parameters
  102. of a SIR model.\\
  103. Similarly in~\cite{Berkhahn2022}, Berkhahn and Ehrhard employ the susceptible,
  104. vaccinated, infectious, hospitalized and removed (SVIHR) model. The proposed
  105. PINN methodology initially estimates the SVIHR model parameters for German
  106. COVID-19 data, covering the time span from the inceptions of the outbreak to the
  107. end of 2021. For comparative purposes, Berkhahn and Ehrhard employ the method of
  108. non-standard finite differences (NSFD) as well. The authors employ both methods
  109. the two forecasting methods project the trajectory of COVID-19 from mid-April
  110. 2023 onwards. Berkhahn and Ehrhard find that the PINN is able to adapt to
  111. varying vaccination rates and emerging variants.\\
  112. Furthermore, Olumoyin \etal~\cite{Olumoyin2021} employ an alternative
  113. methodology for identifying the time-dependent transmission rate of an
  114. asymptomatic-SIR model accounting for unreported infectious cases. The PINN
  115. approach they introduce, utilizes the cumulative and daily reported infection
  116. cases and symptomatic recovered cases, to demonstrate the effect of different
  117. mitigation measures and to ascertain the proportion of non-symptomatic
  118. individuals and asymptomatic recovered individuals. With this they can
  119. illustrate the influence of vaccination and a set non-pharmaceutical mitigation
  120. methods on the transmission of COVID-19 on data from Italy, South Korea, the
  121. United Kingdom, and the United States.\\
  122. Finally, Millevoi \etal~\cite{Millevoi2023} address the issue of the changes in
  123. the transmission rate due to the dynamics of a pandemic. The authors employ the
  124. reproduction number to reduce the system of differential equations to a single
  125. equation and introduce a reduced-split version of the PINN, which initially
  126. trains on the data and then trains to minimize the residual of the ODE. They
  127. test their approach on five synthetic and two real-world scenarios from the
  128. early stages of the COVID-19 pandemic in Italy. This method yields an increase
  129. in both accuracy and training speed. In contrast, to these works, we estimate
  130. the rates and the reproduction number for Germany for the entirety of the span
  131. from early March in 2020 to late June in 2023.
  132. % -------------------------------------------------------------------
  133. \section{Overview}
  134. This thesis is comprised of four chapters. \Cref{chap:background}
  135. presents with the theoretical overview of mathematical modeling in epidemiology,
  136. with a particular focus on the SIR model. Subsequently, it shifts its focus to
  137. neural networks, specifically on the background of physics-informed neural
  138. networks (PINN) and their use in solving ordinary differential equations.~\Cref{chap:methods}
  139. outlines the methodology employed in this thesis. First
  140. we present the data, that was collected by the Robert Koch Institute (RKI). Then
  141. we present the PINN approaches, which are inspired by the work of Shaier \etal~\cite{Shaier2021}
  142. and Millevoi \etal~\cite{Millevoi2023}.~\Cref{chap:evaluation}
  143. presents the setups and results of the experiments that we conduct. This chapter
  144. is divided into two sections. The first section presents and discusses the
  145. results concerning the transition rates of $\beta$ and $\alpha$. The subsequent
  146. section presents the results concerning the reproduction value $\Rt$. Finally,
  147. in \Cref{chap:conclusions}, we connect our results with the events of the
  148. real-world and give an overview of potential further work.
  149. % -------------------------------------------------------------------