当前位置：网站首页>Detailed explanation of extended physics informedneural networks paper

Detailed explanation of extended physics informedneural networks paper

2022-07-26 02:44:00 【Pinn Shanliwa】

author

Ameya D. Jagtap1,∗ and George Em Karniadakis1,2

Periodical

Communications in Computational Physics

date

2020

Code

Code link

1 Abstract

Propose a more flexible decomposition domain XPINN Method , Than cPINN Domain decomposition is more flexible , And use with all equations .

2 background

cPINN Through domain decomposition , Each area uses a small network for training , So that different regions can be calculated in parallel . Proposed by the paper XPINN have cPINN The advantages of domain decomposition , There are also the following advantages

Generalized space-time domain decomposition,XPINN The formula provides highly irregular 、 Convex / Non convex space-time domain decomposition , Because of this decomposition XPINN The formula provides highly irregular 、 Convex / Non convex space-time domain decomposition
XPINN The formula provides highly irregular 、 Convex / Non convex space-time domain decomposition
Simple intermediate condition , stay XPINN in , For interfaces of any shape , The interface condition is very simple , Normal direction is not required , therefore , The proposed method can be easily extended to any complex geometry , Even higher dimensional geometry .

Accurately solve complex equations , In particular, high-dimensional equations have become one of the biggest challenges of Scientific Computing .XPINN Its advantages make it a candidate for such high-dimensional complex simulation , This high-dimensional simulation usually requires a lot of training costs .

3 XPINN Method

describe ：

Subdomains ： Subdomain $\Omega_{q}, q=1,2, \cdots N_{s d}$ Is the entire computing domain $\Omega$ Non overlapping subdomains of , Satisfy $\Omega=\bigcup_{q=1}^{N_{s d}} \Omega_{q}$ and $\Omega_{i} \cap \Omega_{j}=\partial \Omega_{i j}, i \neq j$ Represents the number of decomposition fields , The intersection of subdomains is only at the boundary $\partial \Omega_{i j}$
Interface ： Represents the subnet corresponding to the common boundary of two or more subdomains （sub-Nets） To communicate with each other
sub-Net： Son PINN It refers to the individuals with their own set of optimization super parameters used in each sub domain PINN
Interface Conditions: These conditions are used to connect the decomposed subdomains , Thus, the solution of the governing partial differential equation on the complete field is obtained , According to the properties of the governing equation , One or more interface conditions can be applied to a common interface , Such as solution continuity 、 Flux continuity, etc

![image.png](https://img-blog.csdnimg.cn/img_convert/4fdc725110b157da81451d08473b009f.png#align=left&display=inline&height=399&margin=[object Object]&name=image.png&originHeight=797&originWidth=770&size=302982&status=done&style=none&width=385)
Above picture X Is the solution domain , The black solid line indicates the boundary of the area , The black dotted line indicates interface.XPINN Basic interface Conditions include strong form continuity conditions and in common interface Force the average solution given by different subnets .cPINN Mentioned in the text , For stability , There is no need to add the condition of average solution , But experiments also show that it will accelerate the convergence speed .XPINN have cPINN All the advantages of , Such as parallelization ability 、 Great presentation ability 、 An optimization method 、 Activation function 、 Efficient selection of super parameters such as network depth or width . And cPINN Different ,XPINN It can be used to solve any type of partial differential equation , Not necessarily the law of conservation . stay XPINN Under the circumstances , It is not necessary to find the normal direction to adopt the normal flux continuity condition . This greatly reduces the complexity of the algorithm , Especially in large-scale problems with complex fields and mobile interface problems .

The first $q^{t h}$ The neural network output of the subdomain is defined as
$u_{\tilde{\mathbf{\Theta}}_{q}}(\mathbf{z})=\mathcal{N}^{L}\left(\mathbf{z} ; \tilde{\mathbf{\Theta}}_{q}\right) \in \Omega_{q}, \quad q=1,2, \cdots, N_{s d}$
The final solution is defined as
$u_{\tilde{\mathbf{\Theta}}}(\mathbf{z})=\sum_{q=1}^{N_{s d}} u_{\tilde{\mathbf{\Theta}}_{q}}(\mathbf{z}) \cdot \mathbb{1}_{\Omega_{q}}(\mathbf{z})$
among
$\ Common interface in the q t h subdomain 1 S if z ∈ Common interface in the q t h subdomain \mathbb{1}_{\Omega_{q}}(\mathbf{z}):=\left\{\begin{array}{ll} 0 & \text { if } \mathbf{z} \notin \Omega_{q} \\ 1 & \text { if } \mathbf{z} \in \Omega_{q} \backslash \text { Common interface in the } q^{t h} \text { subdomain } \\ \frac{1}{\mathcal{S}} & \text { if } \mathbf{z} \in \text { Common interface in the } q^{t h} \text { subdomain } \end{array}\right.$
$S$ Express S Indicates the number of subdomains that intersect along the public interface

3.1 just 、 The loss function of the subdomain of the inverse problem

(1) Positive problem
stay $q^{t h}$ Subdomain $\left\{\mathbf{x}_{u_{q}}^{(i)}\right\}_{i=1}^{N_{u q}},\left\{\mathbf{x}_{F_{q}}^{(i)}\right\}_{i=1}^{N_{F q}} \text { and }\left\{\mathbf{x}_{I_{q}}^{(i)}\right\}_{i=1}^{N_{I q}}$ Express training, residual, and the common interface points. $N_{u_{q}}, N_{F_{q}} and N_{I q}$ Respectively represent the number of corresponding points , Each subdomain uses one PINN, $u_{q}=u_{\tilde{\Theta}_{t}}$ , The first $q^{t h}$ The sub domain loss function is defined as
$\begin{aligned} \mathcal{J}\left(\tilde{\mathbf{\Theta}}_{q}\right)=& W_{u_{q}} \operatorname{MSE}_{u_{q}}\left(\tilde{\mathbf{\Theta}}_{q} ;\left\{\mathbf{x}_{u_{q}}^{(i)}\right\}_{i=1}^{N_{u q}}\right)+W_{\mathcal{F}_{q}} \operatorname{MSE}_{\mathcal{F}_{q}}\left(\tilde{\boldsymbol{\Theta}}_{q} ;\left\{\mathbf{x}_{F_{q}}^{(i)}\right\}_{i=1}^{N_{F q}}\right) \\ &+W_{I_{q}} \underbrace{\operatorname{MSE}_{u_{a v g}}\left(\tilde{\boldsymbol{\Theta}}_{q} ;\left\{\mathbf{x}_{I_{q}}^{(i)}\right\}_{i=1}^{N_{I q}}\right)}_{\text {Interface condition }}+W_{I_{\mathcal{F}_{q}}} \underbrace{\operatorname{MSE}_{\mathcal{R}}\left(\tilde{\boldsymbol{\Theta}}_{q} ;\left\{\mathbf{x}_{I_{q}}^{(i)}\right\}_{i=1}^{N_{I q}}\right)}_{\text {Interface condition }} \\ &+\underbrace{\text { Additional Interface Condition's }}_{\text {Optional }} \end{aligned}$
$W_{u_{q}}, W_{\mathcal{F}_{q}}, W_{I_{\mathcal{F}_{q}}} \text { and } W_{I_{q}}$ Parameters representing different losses ,
$\begin{array}{l} \operatorname{MSE}_{u_{q}}\left(\tilde{\mathbf{\Theta}}_{q} ;\left\{\mathbf{x}_{u_{q}}^{(i)}\right\}_{i=1}^{N_{u q}}\right)=\frac{1}{N_{u_{q}}} \sum_{i=1}^{N_{u q}}\left|u^{(i)}-u_{\tilde{\mathbf{\Theta}}_{q}}\left(\mathbf{x}_{u_{q}}^{(i)}\right)\right|^{2} \\ \operatorname{MSE}_{\mathcal{F}_{q}}\left(\tilde{\mathbf{\Theta}}_{q} ;\left\{\mathbf{x}_{F_{q}}^{(i)}\right\}_{i=1}^{N_{F q}}\right)=\frac{1}{N_{F_{a}}} \sum_{i=1}^{N_{F q}}\left|\mathcal{F}_{\tilde{\mathbf{\Theta}}_{q}}\left(\mathbf{x}_{F_{q}}^{(i)}\right)\right|^{2} \end{array}$
$\begin{array}{l} \operatorname{MSE}_{u_{a v g}}\left(\tilde{\mathbf{\Theta}}_{q} ;\left\{\mathbf{x}_{I_{q}}^{(i)}\right\}_{i=1}^{N_{I q}}\right)=\sum_{\forall q^{+}}\left(\frac{1}{N_{I_{q}}} \sum_{i=1}^{N_{I_{q}}}\left|u_{\tilde{\mathbf{\Theta}}_{q}}\left(\mathbf{x}_{I_{q}}^{(i)}\right)-\left\{\left\{u_{\tilde{\mathbf{\Theta}}_{q}}\left(\mathbf{x}_{I_{q}}^{(i)}\right)\right\}\right\}\right|^{2}\right) \\ \operatorname{MSE}_{\mathcal{R}}\left(\tilde{\mathbf{\Theta}}_{q} ;\left\{\mathbf{x}_{I_{q}}^{(i)}\right\}_{i=1}^{N_{I q}}\right)=\sum_{\forall q^{+}}\left(\frac{1}{N_{I_{q}}} \sum_{i=1}^{N_{I_{q}}}\left|\mathcal{F}_{\tilde{\mathbf{\Theta}}_{q}}\left(\mathbf{x}_{I_{q}}^{(i)}\right)-\mathcal{F}_{\tilde{\Theta}_{q^{+}}}\left(\mathbf{x}_{I_{q}}^{(i)}\right)\right|^{2}\right) \end{array}$
The last two represent interface Condition loss , The fourth item is in the subdomain $q$ and $q^{+}$ The residual continuity condition of two different networks , $q^{+}$ representative $q$ The field of MSER and $MSE_{uavg}$ , Are defined in all adjacent sub domains , In the above formula $\left\{\left\{u_{\tilde{\mathbf{\Theta}}_{q}}\right\}\right\}=u_{\text {avg }}:=\frac{u_{\tilde{\mathbf{\Theta}}_{q}}+u_{\tilde{\mathbf{\Theta}}_{q^{+}}}}{2}$ ( Suppose only two subdomains intersect on the public interface ),additional interface conditions, for example flux continuity , $c^{k}$ It can also be based on PDE And the type of interface The direction is added to the loss .
remark：

interface conditions The type of determines the regularity of the solution of the whole interface , Thus affecting the convergence speed . stay interface The solution on is sufficiently continuous , So as to meet its control PDE
Enough interface point To connect subdomains , This is very important for the convergence of the algorithm , Especially for internal

For the inverse problem :
$\begin{aligned} \mathcal{J}\left(\tilde{\mathbf{\Theta}}_{q}, \lambda\right)=& W_{u_{q}} \operatorname{MSE}_{u_{q}}\left(\tilde{\boldsymbol{\Theta}}_{q}, \lambda ;\left\{\mathbf{x}_{u_{q}}^{(i)}\right\}_{i=1}^{N_{u_{q}}}\right)+W_{\mathcal{F}_{q}} \operatorname{MSE}_{\mathcal{F}_{q}}\left(\tilde{\boldsymbol{\Theta}}_{q}, \lambda ;\left\{\mathbf{x}_{u_{q}}^{(i)}\right\}_{i=1}^{N_{u_{q}}}\right) \\ &+W_{I_{q}} \underbrace{\left\{\operatorname{MSE}_{u_{a v g}}\left(\tilde{\boldsymbol{\Theta}}_{q}, \lambda ;\left\{\mathbf{x}_{I_{q}}^{(i)}\right\}_{i=1}^{N_{I q}}\right)+\operatorname{MSE}_{\lambda}\left(\tilde{\boldsymbol{\theta}}_{q}, \lambda ;\left\{\mathbf{x}_{I_{q}}^{(i)}\right\}_{i=1}^{N_{I q}}\right)\right\}}_{\text {Interface condition's }} \\ &+W_{I_{\mathcal{F}_{q}}} \underbrace{\operatorname{MSE}_{\mathcal{R}}\left(\tilde{\boldsymbol{\Theta}}_{q}, \lambda ;\left\{\mathbf{x}_{I_{q}}^{(i)}\right\}_{i=1}^{N_{I q}}\right)}_{\text {Intarf }}+\underbrace{\text { Additional Interface Condition's }}_{\text {Optional }} \end{aligned}$
among
$\begin{array}{l} \operatorname{MSE}_{\mathcal{F}_{q}}\left(\tilde{\boldsymbol{\Theta}}_{q}, \lambda ;\left\{\mathbf{x}_{u_{q}}^{(i)}\right\}_{i=1}^{N_{u_{q}}}\right)=\frac{1}{N_{u_{q}}} \sum_{i=1}^{N_{u_{q}}}\left|\mathcal{F}_{\tilde{\mathbf{\Theta}}_{q}}\left(\mathbf{x}_{u_{q}}^{(i)}\right)\right|^{2} \\ \operatorname{MSE}_{\lambda}\left(\tilde{\mathbf{\Theta}}_{q}, \lambda ;\left\{\mathbf{x}_{I_{q}}^{(i)}\right\}_{i=1}^{N_{I q}}\right)=\sum_{\forall q^{+}}\left(\frac{1}{N_{I_{q}}} \sum_{i=1}^{N_{l q}}\left|\lambda_{q}\left(\mathbf{x}_{I_{q}}^{(i)}\right)-\lambda_{q^{+}}\left(\mathbf{x}_{I_{q}}^{(i)}\right)\right|^{2}\right) \end{array}$
Other residual losses are the same as positive losses .
**Remark：** It should be noted that , because XPINN Highly nonconvex loss function , It is very difficult to locate its global minimum . however , For several local minima , The value of the loss function is similar , The accuracy of the corresponding prediction solution is similar .

3.2 An optimization method

Automatic derivation

3.3 error

$\begin{aligned} \mathcal{E}_{\text {app }} q &=\left\|u_{a_{q}}-u_{q}^{e x}\right\| \\ \mathcal{E}_{\text {gen }} q &=\left\|u_{g_{q}}-u_{a_{q}}\right\| \\ \mathcal{E}_{\text {opt }} q &=\left\|u_{\tau_{q}}-u_{g_{q}}\right\| \end{aligned}$
Represent the approximation error、 generalization error as well as optimization error.

$u_{a_{q}}=\arg \min _{f \in F_{q}}\left\|f-u_{q}^{e x}\right\|$ True solution $u_{q}^{e x}$ Approximation of
$u_{g_{q}}=\arg \min _{\tilde{\mathbf{\Theta}}_{q}} \mathcal{J}\left(\tilde{\mathbf{\Theta}}_{q}\right)$ Is the global optimal solution
$u_{\tau_{q}}=\arg \min _{\tilde{\mathbf{\Theta}}_{q}} \mathcal{J}\left(\tilde{\mathbf{\Theta}}_{q}\right)$ It is the solution obtained after the subnetwork training ,

Last XPINN The error of can be summarized as
$\mathcal{E}_{X P I N N}:=\left\|u_{\tau}-u^{e x}\right\| \leq\left\|u_{\tau}-u_{g}\right\|+\left\|u_{g}-u_{a}\right\|+\left\|u_{a}-u^{e x}\right\|$
among , $\left(u^{e x}, u_{\tau}, u_{g}, u_{a}\right)(\mathbf{z})=\sum_{q=1}^{N_{s d}}\left(u_{q}^{e x}, u_{\tau_{q}}, u_{g_{q}}, u_{a_{q}}\right)(\mathbf{z}) \cdot \mathbb{1}_{\Omega_{q}}(\mathbf{z})$

Insert picture description here

Remark：

When the estimation error decreases （ Data fitting is better ）, Generalization error will increase , This is a kind of bias variance trade-off, The two main factors affecting generalization error are the number and distribution of residual points
The optimization error is affected by the complexity of the loss function , The network structure deeply affects the optimization error

3.4 XPINN、cPINN,PINN contrast

Insert picture description here

And PINN and cPINN Compared to the framework ,XPINN Framework has many advantages , But it also has the same limitations as the previous framework . Absolute error
PDE Solution , Not lower than the level , This is due to the inaccuracy involved in solving high-dimensional nonconvex optimization problems , May lead to a bad minimum

原网站

版权声明
本文为[Pinn Shanliwa]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/207/202207260241383374.html