chap01-introduction.tex 11 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179
  1. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  2. % Author: Phillip Rothenbeck
  3. % Title: Investigating the Evolution of the COVID-19 Pandemic in Germany Using Physics-Informed Neural Networks
  4. % File: chap01-introduction/chap01-introduction.tex
  5. % Part: introduction
  6. % Description:
  7. % summary of the content in this chapter
  8. % Version: 01.01.2012
  9. % %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  10. \chapter{Introduction 5}
  11. \label{chap:introduction}
  12. In the early months of 2020, Germany, like many other countries, was struck by the novel
  13. \emph{Coronavirus Disease} (COVID-19). The pandemic, which originates in
  14. Wuhan, China, had a profound impact on the global community, paralyzing it for
  15. over two years. In response to the pandemic, the German government employed a
  16. multifaceted approach, encompassing the introduction of vaccines and
  17. non-pharmaceutical mitigation policies such as lockdowns. Between mitigation
  18. policies and varying strains of COVID-19, which have exhibited varying degrees
  19. of infectiousness and lethality, Germany had recorded over 38,400,000 infection
  20. cases and 174,000 deaths, as of the end of June in 2023. In light of these
  21. figures the need for an analysis arises.\\
  22. The dynamics of the spread of disease transmission in the real-world are
  23. complex. A multitude of factors influence the course of a disease, and it is
  24. challenging to gain a comprehensive understanding of these factors and develop a
  25. tool that allows for the comparison of disease courses across different diseases
  26. and time points. The common approach in epidemiology to address this is the
  27. utilization of epidemiological models that approximate the dynamics by focusing
  28. on specific factors and modeling these using differential equations and other
  29. mathematical tools for modeling. These models provide transition rates and
  30. parameters that determine the behavior of a disease within the boundaries of the
  31. model. A fundamental epidemiological model, is the \emph{SIR model}, which was
  32. first proposed by Kermack and McKendrick~\cite{1927} in 1927. The SIR model is a
  33. compartmentalized model that divides the entire population into three distinct
  34. compartments. The first compartment is the \emph{susceptible} compartment, $S$,
  35. which contains all individuals of the population who are susceptible to
  36. infection. The second group, is the \emph{infectious} compartment, $I$, which
  37. comprises all individuals currently infected and capable of infecting
  38. susceptible individuals. Lastly, the \emph{removed} compartment, $R$, contains
  39. all individuals, who have succumbed to the disease or recovered from it and are
  40. therefore no longer susceptible to infection. The model is characterized by two
  41. transition rates: the transmission rate $\beta$, which controls the rate of
  42. individuals becoming infected and consequently transitioning from $S$ to $I$;
  43. and the recovery rate $\alpha$, which determines the rate at which individuals
  44. either recover or succumb to the disease, thereby transitioning from $I$ to $R$.
  45. In the context of the SIR model, the values of $\beta$ and $\alpha$ serve to
  46. quantify and determine the course of a pandemic.\\
  47. The transition rates of $\beta$ and $\alpha$ serve to quantify a pandemic across
  48. its entire duration. However, it is important to recognize that a pandemic is
  49. not a static entity; rather, it evolves, and the infectiousness, deadliness and
  50. time to recovery associated with it change with each of its numerous variants.
  51. To address this issue, Liu and Stechlinski, and Setianto and Hidayat~\cite{Liu2012, Setianto2023},
  52. propose an SIR model with time-dependent transition rates $\beta(t)$ and
  53. $\alpha(t)$. From these rates, they derive the time-dependent reproductive
  54. number $\Rt$, which represents the average number of individuals, that are
  55. infected by one infectious person. A high value for $\Rt$ indicates a rapid
  56. spread of the disease, while a low value either suggests either an outbreak or
  57. the disease is declining. This qualifies the time-dependent reproduction number
  58. $\Rt$ as an indicator of the pandemic's progression.\\
  59. The SIR model is defined by a system of differential equations, that incorporate
  60. the transition rates, thereby depicting the fluctuation between the three
  61. compartments. For a given set of data, the transition rate can be identified by
  62. solving the set of differential systems. Recently, the data-driven approach of
  63. \emph{physics-informed neural networks} (PINN) has gained attention due to its
  64. capability of finding solutions to differential equations by fitting its
  65. predictions to both given data and the governing system of differential
  66. equations. By employing this methodology, Shaier \etal~\cite{Shaier2021} were
  67. able to find the transition rate on synthetic data. Additionally, Millevoi
  68. \etal~\cite{Millevoi2023} were able to identify the reproduction number $\Rt$
  69. for both synthetic and Italian COVID-19 data using an approach based on a
  70. reduced version of the SIR model.\\
  71. The Robert Koch Institute has collected incident and death case data from the
  72. beginning of the outbreak in Germany to the present. This data will be utilitzed
  73. in this bachelor thesis to investigate the transition rates and reproduction
  74. number for each German state and the country as a whole, employing the
  75. methodologies proposed by Shaier \etal and Millevoi \etal. Additionally, the
  76. findings will be contextualized and correlated with the events of the real
  77. world.\\
  78. % -------------------------------------------------------------------
  79. \section{Related work 2}
  80. \label{sec:relatedWork}
  81. In \emph{Forecasting Epidemics Through Nonparametric Estimation of
  82. Time-Dependent Transmission Rates Using the SEIR Model}~\cite{Smirnova2017},
  83. Smirnova \etal endeavor to identify a stochastic methodology for estimating the
  84. time-dependent transmission rate $\beta(t)$. This is in response to the
  85. limitations of earlier parametric estimation methods, which are prone
  86. instability due to the difficulty in identifying parameter finding and a low
  87. amount of available data. They achieve this by projecting the time-dependent
  88. transmission rate onto a finite subspace, that is defined by Legendre
  89. polynomials. Subsequently, they compare the three regularization techniques of
  90. variational (Tikhonov’s) regularization, truncated singular value decomposition
  91. (TSVD), and modified TSVD to ascertain the most reliable method for forecasting
  92. with limited data. Their findings indicate that modified TSVD provides the most
  93. stable forecasts on limited data, as demonstrated on both simulated data and
  94. real-world data from the 1918 influenza pandemic and the 2014-2015 Ebola
  95. epidemic.\\
  96. In their publication, entitled \emph{Data-driven approaches for predicting
  97. spread of infectious diseases through DINNs: Disease Informed Neural Networks},
  98. Shaier \etal~\cite{Shaier2021} put forth a data-driven approach for identifying
  99. the parameters of epidemiological models. The authors apply physics-informed
  100. neural networks to the compartmental SIR models, and refer to their method as
  101. disease informed neural networks (DINN). In their work, they demonstrate the
  102. capacity of DINNs to forecast the trajectory of epidemics and pandemics. They
  103. underpin the efficacy of their approach by applying it to 11 diseases, that have
  104. previously been modeled, including examples such as COVID, HIV, Tuberculosis and
  105. Ebola. In their experiments they employ the SIDR (susceptible, infectious, dead,
  106. recovered) model. Finally, they present that this method is a robust and
  107. effective means of identifying the parameters of a SIR model.\\
  108. In their article \emph{A physics-informed neural network to model COVID-19
  109. infection and hospitalization scenarios}, Berkhahn and Ehrhard~\cite{Berkhahn2022}
  110. employ the susceptible, vaccinated, infectious, hospitalized and removed (SVIHR)
  111. model. They solve the system of differential equations inherent to the SVIHR
  112. model by the means of PINNs. The authors utilize a dataset of German COVID-19
  113. data, covering the time span from the inceptions of the outbreak to the end of
  114. 2021. The proposed PINN methodology initially estimates the SVIHR model
  115. parameters and subsequently forecasts the data. For comparative purposes,
  116. Berkhahn and Ehrhard employ the method of non-standard finite differences (NSFD)
  117. as well. In the validation process, the two forecasting methods project the
  118. trajectory of COVID-19 from mid-April onwards. Berkhahn and Ehrhard find that
  119. the PINN is able to adapt to varying vaccination rates and emerging variants.\\
  120. In their work, \emph{Data-Driven Deep-Learning Algorithm for Asymptomatic
  121. COVID-19 Model with Varying Mitigation Measures and Transmission Rate},
  122. Olumoyin \etal~\cite{Olumoyin2021} employ an alternative methodology for
  123. identifying the time-dependent transmission rate of an asymptomatic-SIR model.
  124. On the premise that not all the infectious individuals are reported and included
  125. in the data available. The algorithm they introduce, utilizes the cumulative and
  126. daily reported infection cases and symptomatic recovered cases, to demonstrate
  127. the effect of different mitigation measures and to ascertain the size of the
  128. part of non-symptomatic individuals in the total number of infective individuals
  129. and the proportion of asymptomatic recovered individuals. With this they can
  130. illustrate the influence of vaccination and a set non-pharmaceutical mitigation
  131. methods on the transmission of COVID-19 on data from Italy, South Korea, the
  132. United Kingdom, and the United States.\\
  133. In \emph{A Physics-Informed Neural Network approach for compartmental
  134. epidemiological models} Millevoi \etal~\cite{Millevoi2023} address the issue
  135. of describing the dynamically changing transmission rate, which is influenced by
  136. the emergence of new variants or the implementation of non-pharmaceutical
  137. measures. They employ a PINN to maintain an account of the changes of the
  138. transmission rate included in the reproduction number and to approximate the
  139. model state variables. To this end, Millevoi \etal employ the reproduction
  140. number to reduce the system of differential equations to a single equation and
  141. introduce a reduced-split version of the PINN, which initially trains on the
  142. data and then trains to minimize the residual of the ODE. They test their
  143. approach on five synthetic and two real-world scenarios from the early stages of
  144. the COVID-19 pandemic in Italy. This method yields an increase in both accuracy
  145. and training speed.
  146. % -------------------------------------------------------------------
  147. \section{Overview}
  148. This thesis is comprised of four chapters. \Cref{chap:background}
  149. presents with the theoretical overview of mathematical modeling in epidemiology,
  150. with a particular focus on the SIR model. Subsequently, it shifts its focus to
  151. neural networks, specifically on the background of physics-informed neural
  152. networks (PINN) and their use in solving ordinary differential equations.
  153. In~\Cref{chap:methods} outlines the methodology employed in this thesis. First
  154. we present the data, that was collected by the Robert Koch Institute (RKI). Then
  155. we present the PINN approaches, which are inspired by the work of Shaier \etal
  156. and Millevoi \etal~\cite{Shaier2021,Millevoi2023}.~\Cref{chap:evaluation}
  157. presents the setups and results of the experiments that we conduct. This chapter
  158. is divided into two sections. The first section presents and discusses the
  159. results concerning the transition rates of $\beta$ and $\alpha$. The subsequent
  160. section presents the results concerning the reproduction value $\Rt$. Finally,
  161. in \Cref{chap:conclusions}, we connect our results with the events of the
  162. real-world and give an overview of potential further work.
  163. % -------------------------------------------------------------------