пре 1 година · 772c7704eb
--- a/chapters/chap02/chap02.tex
+++ b/chapters/chap02/chap02.tex
@@ -535,7 +535,7 @@ In contrast to standard MLP's, the loss term of a PINN comprises two
 
				 components. The first term incorporates the aforementioned prior knowledge to pertinent the problem. As Raissi
			
 
				 \etal~\cite{Raissi2017} propose, the residual of each differential equation in
			
 
				 the system must be minimized in order for the model to optimize its output in accordance with the theory.
			
 
				-We obtain the residual $R_i$, with $i\in\{1, ...,N_d\}$, by rearranging the differential equation and
			
 
				+We obtain the residual $r_i$, with $i\in\{1, ...,N_d\}$, by rearranging the differential equation and
			
 
				 calculating the difference between the left-hand side and the right-hand side
			
 
				 of the equation. $N_d$ is the number of differential equations in a system. As
			
 
				 Raissi \etal~\cite{Raissi2017} propose the \emph{physics
			
@@ -616,10 +616,10 @@ Tenenbaum and Morris provide, there are three potential solutions to this
 
				 issue. However only the \emph{underdamped case} results in an oscillating
			
 
				 movement of the body, as illustrated in~\Cref{fig:spring}. In order to apply a
			
 
				 PINN to this problem, we require a set of training data $x$. This consists of
			
 
				-pairs of timepoints and corresponding displacement measurements
			
 
				+pairs of time points and corresponding displacement measurements
			
 
				 $(t^{(i)}, u^{(i)})$, where $i\in\{1, ..., N_t\}$. In this hypothetical case,
			
 
				 we know the mass $m=1kg$, and the spring constant $k=200\frac{N}{m}$ and the
			
 
				-initial displacement $u^{(1)} = 1$ and $\frac{du(0)}{dt} = 0$, However, we do
			
 
				+initial displacement $u^{(1)} = 1$ and $\frac{du(0)}{dt} = 0$. However, we do
			
 
				 not know the value of the friction $\mu$. In this case the loss function,
			
 
				 \begin{equation}
			
 
				   \mathcal{L}_{osc}(\boldsymbol{x}, \boldsymbol{u}, \hat{\boldsymbol{u}}) = (u^{(1)}-1)+\frac{du(0)}{dt}+||m\frac{d^2u}{dx^2}+\mu\frac{du}{dx}+ku||^2 + \frac{1}{N_t}\sum_{i=1}^{N_t} ||\hat{\boldsymbol{u}}^{(i)}-\boldsymbol{u}^{(i)}||^2,
			
@@ -631,5 +631,45 @@ parameter and the observation loss.
 
				 
			
 
				 \subsection{Disease Informed Neural Networks   2}
			
 
				 \label{sec:pinn:dinn}
			
 
				-
			
 
				+In this section, we describe the capability of MLP's to solve systems of
			
 
				+differential equations. In~\Cref{sec:pandemicModel:sir}, we describe the SIR
			
 
				+model, which models the relations of susceptible, infectious and removed
			
 
				+individuals and simulates the progress of a disease in a population with a
			
 
				+constant size. A system of differential equations models these relations. Shaier
			
 
				+\etal~\cite{Shaier2021} propose a method to solve the equations of the SIR model
			
 
				+using a PINN, which they call a \emph{disease-informed neural network} (DINN).\\
			
 
				+
			
 
				+To solve~\Cref{eq:sir} we need to find the transmission rate $\beta$ and the
			
 
				+recovery rate $\alpha$. As Shaier \etal~\cite{Shaier2021} point out, there are
			
 
				+different approaches to solve this set of equations. For instance, building on
			
 
				+the assumption, that at the beginning one infected individual infects $-n$ other
			
 
				+people, concluding in $\frac{dS(0)}{dt} = -n$. Then,
			
 
				+\begin{equation}
			
 
				+  \beta=-\frac{\frac{dS}{dt}}{S_0I_0}
			
 
				+\end{equation}
			
 
				+would calculate the initial transmission rate using the initial size of the
			
 
				+susceptible group $S_0$ and the infectious group $I_0$. The recovery rate, then
			
 
				+could be defined using the amount of days a person between the point of
			
 
				+infection and the start of isolation $d$, $\alpha = \frac{1}{d}$. The analytical
			
 
				+solutions to the SIR models often use heuristic methods and require knowledge
			
 
				+like the sizes $S_0$ and $I_0$. A data-driven approach such as the one that
			
 
				+Shaier \etal~\cite{Shaier2021} propose does not have these problems. Since the
			
 
				+model learns the parameters $\beta$ and $\alpha$ while learning the training
			
 
				+data consisting of the time points $\boldsymbol{t}$,  and the corresponding
			
 
				+measured sizes of the groups $\boldsymbol{S}, \boldsymbol{I}, \boldsymbol{R}$.
			
 
				+Let $\hat{\boldsymbol{S}}, \hat{\boldsymbol{I}}, \hat{\boldsymbol{R}}$ be the
			
 
				+model predictions of the groups and
			
 
				+$r_S=\frac{d\hat{\boldsymbol{S}}}{dt}+\beta \hat{\boldsymbol{S}}\hat{\boldsymbol{I}},
			
 
				+  r_I=\frac{d\hat{\boldsymbol{I}}}{dt}-\beta \hat{\boldsymbol{S}}\hat{\boldsymbol{I}}+\alpha \hat{\boldsymbol{I}}$
			
 
				+and $r_R=\frac{d \hat{\boldsymbol{R}}}{dt} - \alpha \hat{\boldsymbol{I}}$ the
			
 
				+residuals of each differential equation using the model predictions. Then,
			
 
				+\begin{equation}
			
 
				+  \begin{split}
			
 
				+    \mathcal{L}_{SIR}() = ||r_S||^2 + ||r_I||^2 + ||r_R||^2 + \frac{1}{N_t}\sum_{i=1}^{N_t} ||\hat{\boldsymbol{S}}^{(i)}-\boldsymbol{S}^{(i)}||^2 &+\\
			
 
				+    ||\hat{\boldsymbol{I}}^{(i)}-\boldsymbol{I}^{(i)}||^2 &+\\
			
 
				+    ||\hat{\boldsymbol{R}}^{(i)}-\boldsymbol{R}^{(i)}||^2 &,
			
 
				+  \end{split}
			
 
				+\end{equation}
			
 
				+is the loss function of a DINN, with $\alpha$ and $beta$ being learnable
			
 
				+parameters.
			
 
				 % -------------------------------------------------------------------
			
--- a/thesis.bbl
+++ b/thesis.bbl
@@ -88,6 +88,12 @@
 
				 \newblock \emph{Analysis}.
			
 
				 \newblock Oldenbourg Wissenschaftsverlag GmbH, 2007
			
 
				 
			
 
				+\bibitem[SRS21]{Shaier2021}
			
 
				+\textsc{Shaier}, Sagi ; \textsc{Raissi}, Maziar  ; \textsc{Seshaiyer},
			
 
				+  Padmanabhan:
			
 
				+\newblock \emph{Data-driven approaches for predicting spread of infectious
			
 
				+  diseases through DINNs: Disease Informed Neural Networks}
			
 
				+
			
 
				 \bibitem[TP85]{Tenenbaum1985}
			
 
				 \textsc{Tenenbaum}, Morris ; \textsc{Pollard}, Harry:
			
 
				 \newblock \emph{Ordinary Differential Equations}.
			
--- a/thesis.pdf
+++ b/thesis.pdf