Name	Name	Last commit message	Last commit date
parent directory ..
p1.png	p1.png
p2.png	p2.png
p3.png	p3.png
p4.png	p4.png
p5.png	p5.png
p6.png	p6.png
p7.png	p7.png
p8.png	p8.png
p9.png	p9.png
readme.org	readme.org
readme.pdf	readme.pdf
readme.tex	readme.tex

Physics Informed Neural Networks

1 Introduction

1.0.1 Finding the inverse function of a parabola

Given a function \(\mathcal{P}: y → y²\), where \(y ∈ [0, 1]\), find a unknow function \(f\) that satisfies \(\mathcal{P}(f(x)) = x,\ ∀ x ∈ [0, 1]\).

1.0.1.1 PIC

1.0.2 Classical

1.0.2.1 MLP

A classical approach is to use a neural network to approximate the data points \(Ω\):

\[f_θ(x) ≈ y,\ ∀ (x, y) ∈ Ω\]

where \(θ\) is the parameters of the neural network.

However, nowhere near the solution.

1.0.2.2 Results

1.0.3 Physics approach

1.0.3.1 PINN

\[\mathcal{P}(f_θ(x)) ≈ y,\ ∀ (x, y) ∈ Ω\]

Now, minimize the error between \(f²_θ(x)\) and \(y\).

1.0.3.2 Results

1.0.4 Physics Informed Neural Networks (PINNs) Definition

Neural networks that are trained to solve supervised learning tasks while respecting any given law of physics described by general nonlinear partial differential equations (PDE).

2 Partial differential equations (PDE)

2.1 Introduction

2.1.1 What is PDE?

Equation containing unknown functions and its partial derivative.
Describe the relationship between independent variables, unknown functions and partial derivative.

2.1.1.1 Example

\(f(x, y) = ax + by + c\), where \(a, b, c\) are unknown parameters.
\(u(x, y) = α u(x, y) + β f(x, y)\) where \(u\) is the unknown function.
\(u_x(x, y) = α u_y(x, y) + β f_xy(x, y)\) where \(u_x\) is the partial derivative of \(u\) with respect to \(x\), \(u_y\) is the partial derivative of \(u\) with respect to \(y\), and \(f_xy\) is the partial derivative of \(f\) with respect to \(x\) and \(y\).

2.1.2 Notations

\(\dot{u} = \frac{∂{u}}{∂{t}}\)
\(u_xy = {∂ ²u \over ∂ y\,∂ x}\)
\(∇ u (x, y, z) = u_x + u_y + u_z\)
\(∇ ⋅ ∇ u(x, y, z) = Δ u(x,y,z) = u_xx + u_yy + u_zz\)
\(∇\) : nabla, or del.

2.2 PDE in the real world

2.2.1 Laplace’s equation

\[Δ \varphi = 0\]

\[∇ ⋅ ∇ \varphi = 0\]

or, in a 3D space:

\[\frac {∂ ²f}{∂ x²}+\frac {∂ ²f}{∂ y²}+\frac {∂ ²f}{∂ z²} = 0\]

2.2.2 Poisson’s equation

\[Δ \varphi = f\]

2.2.3 Heat equation

\[\dot{u} = \frac {∂ u}{∂ t} = α Δ u\]

where \(α\) is the thermal diffusivity.

2.2.4 Wave equation

\[\ddot {u}=c²∇ ²u\]

where \(c\) is the wave speed.

2.2.5 Burgers’ equation

\[u_t + u u_x = ν u_xx\]

\(t\): temporal coordinate
\(x\): spatial coordinate
\(u(x, t)\): speed of fluid at the indicated spatial and temporal coordinates
\(ν\): viscosity of fluid

2.3 Boundary conditions

2.3.1 Boundary conditions

For a equation \(∇²y+y=0\) in domain \(Ω\).

Dirichlet boundary condition: \(y(x)=f(x)\quad ∀ x∈ ∂ Ω\)
Neumann boundary condition: \(\frac {∂ y}{∂ \mathbf {n} }(\mathbf {x} )=f(\mathbf {x} )\quad ∀ \mathbf {x} ∈ ∂ Ω\)
- Where \(f\) is a known scalar function defined on the boundary domain \(∂ Ω\), \(\mathbf{n}\) denotes the (typically exterior) normal to the boundary.
- The normal derivative, which shows up on the left side, is defined as \(\frac {∂ y}{∂ \mathbf {n} }(\mathbf {x} )=∇ y(\mathbf {x} )⋅ \mathbf {\hat {n}} (\mathbf {x} )\), where \(\mathbf {\hat {n}}\) is the unit normal.
Robin boundary condition
- Combine Dirichlet and Neumann boundary conditions.
Periodic boundary condition

3 PINNs

3.0.1 Paper

Physics Informed Deep Learning (Part I): Data-driven Solutions of Nonlinear Partial Differential Equations[cite/ft/f:@raissiPhysicsInformedDeep2017]
Physics Informed Deep Learning (Part II): Data-driven Discovery of Nonlinear Partial Differential Equations[cite/ft/f:@raissiPhysicsInformedDeep2017a]

3.0.2 Problem

Data-driven solution and data-driven discovery
Continuous time and discrete time models

3.1 Data-driven solution with continuous time

3.1.1 Data-driven solution with continuous time

General PDE Form:

\[u_t + \mathcal{N}[u] = 0,\ x ∈ Ω, \ t∈[0,T]\]

where:

\(\mathcal{N}[u]\): nonlinear differential operator
\(u(t, x)\): unknown function (solution).
\(Ω\): spatial domain.
\(t\): time.

3.1.2 Physics informed neural network

A neural network \(u_θ ≈ u\), where \(θ\) is the parameters of the neural network.
A physics informed neural network \(f_θ = {u_θ}_t + \mathcal{N}[u_θ]\).
Target: \(f_θ ≈ u_t + \mathcal{N}[u]\) and \(u_θ ≈ u\).
- \(\mathcal{L} = \mathcal{L}_f + \mathcal{L}_u\)

3.1.3 Example (Burgers’ Equation)

The equation:

\[u_t + u u_x = ν u_xx\]

Here, already know \(ν = 0.01 / π\), \(x ∈ [-1, 1], t ∈ [0, 1]\),

Thus,

\[u_t + uu_x - 0.01/π u_xx = 0 \]

And the equation along with Dirichlet boundary conditions can be written as:

\(u(0, x) = -sin(π x)\)
\(u(t, -1) = u(t, 1) = 0\)

3.1.4 Target

Data:
- Boundary only data from boundary conditions.
Input: \(\{t, x\}\)
Output: \(u(t, x)\)
Target: \(f_θ ≈ u_t + \mathcal{N}[u]\) and \(u_θ ≈ u\).
- \(\mathcal{L} = \mathcal{L}_f + \mathcal{L}_u\)

3.1.5 Example (Burgers’ Equation) with codes

def u_theta(theta, t, x):
    # u_theta.apply(theta, t, x) to approx u(x, t)
    return net(theta, t, x)

def f_theta(theta, t, x):
    # See the auto diff cookbook
    # https://jax.readthedocs.io/en/latest/notebooks/autodiff_cookbook.html
    u = u_theta.apply
    u_t = jax.jacrev(u, argnums=1)(theta, t, x)
    u_x = jax.jacrev(u, argnums=2)(theta, t, x)
    u_xx = jax.hessian(u, argnums=2)(theta, t, x)
    # or jax.jacfwd(jax.jacrev(u, argnums=2), argnums=2)
    f = lambda: u_t + u * u_x - 0.01 * u_xx
    return f

3.1.6 Train and Results

Train with MLPs with L-BFGS solver (quasi-newton method).
Cannot use ReLU but tanh, because when we do the second order derivative, the ReLU will be 0.

3.2 Data-driven discovery with continuous time

3.2.1 Data-driven discovery with continuous time

General PDE Form:

\[u_t + \mathcal{N}[u;λ] = 0,\ x ∈ Ω, \ t∈[0,T]\]

where:

\(\mathcal{N}[u;λ]\): nonlinear differential operator with parameters \(λ\).
\(u(t, x)\): unknown function (solution).
\(Ω\): spatial domain.
\(t\): time.

3.2.2 Example (Incompressible Navier-Stokes Equation (convection–diffusion equations))

The equations:

\[u_t + λ_1 (u u_x + v u_y) = -p_x + λ_2(u_xx + u_yy),\] \[v_t + λ_1 (u v_x + v v_y) = -p_y + λ_2(v_xx + v_yy)\],

where:

\(u(t, x, y)\): \(x\)-component of the velocity field,
\(v(t, x, y)\): \(y\)-component of the velocity field,
\(p(t, x, y)\): pressure,
\(λ\): the unknown parameters.

Additional physical constraints:

Solutions to the Navier-Stokes equations are searched in the set of divergence-free functions, i.e.:
- \(u_x + u_y = 0\)
- which describes the conservation of mass of the fluid
\(u\) and \(v\) can written as a latent function \(ψ(t, x, y)\) with an assumption:
- \(u = ψ_y, v = -ψ_x\)

3.2.3 NS Equation figure

3.2.4 Example (Navier-Stokes Equation) – Target

The neural network equations:
- \(f := u_t + λ_1 (u u_x + v u_y) + p_x - λ_2(u_xx + u_yy),\)
- \(g := v_t + λ_1 (u v_x + v v_y) + p_y - λ_2(v_xx + v_yy)\)
Inptu: \(\{t,x,y,u,v\}\) with noisy.
Output: \((ψ(t, x, y), p(t, x, y))\).
Target:
- \(f_θ ≈ f \)
- \(g_θ ≈ g\)
- \(u_θ ≈ u\)
- \(v_θ ≈ v\)

3.2.5 Results

4 JAX

4.0.1 Introduction

JAX is Autograd and XLA, brought together for high-performance numerical computing and machine learning research. It provides composable transformations of Python+NumPy programs: differentiate, vectorize, parallelize, Just-In-Time compile to GPU/TPU, and more.

4.0.2 Pure functional

\(f(x) = y\), always.
non-pure function:
- IO operator: print
- No seed random function
- time
- Runtime error.

4.0.3 Ecosystem

JAX (jax, jaxlib)
- jax
- jax.numpy
Haiku (dm-haiku) from deepmind
- Modules
Optax (optax) from deepmind
- Light
- Linear system optimizers (\(Ax = b\))
JAXopt (jaxopt)
- Other optimizers.
Jraph (jraph)
- Standardized data structures for graphs.
JAX, M.D. (jax-md)
- JAX and Molecular Dynamics
RLax (rlax), and Coax (coax)
- Reinforcement Learning

4.0.4 Example (def)

import jax
import jax.numpy as jnp
import haiku as hk

def _u(t, x):
    return hk.MLP(jnp.concatenate([t, x], axis=-1), [10, 10, 1])

u = hk.transform_with_state(_u)

4.0.5 Example (init)

fake_t = jnp.ones([batch, size])
fake_x = jnp.ones([batch, size])

# theta: params
# state: training state
# rng:   random number generator
params, state = u.init(rng, fake_t, fake_x)

hk.experimental.tabulate(u)(fake_t, fake_x)

4.0.6 Example (loss)

def loss_fn(config, ...):

    def _loss(params, t, x):
        u_theta = u.apply(params, t, x)
        ...
        loss = _f
        return loss

    return _loss

loss = loss_fn(config, ...)

4.0.7 Example (optim)

import optax

lr = optax.linear_schedule(
    0.001,       # init
    0.001 / 10,  # final
    1,           # steps change to final
    150          # start linear decay after steps
)

opt = optax.adam(learning_rate=lr)
opt = optax.adamax(learning_rate=lr)

4.0.8 Example (solver)

import jaxopt

# Linear solver
solver = jaxopt.OptaxSolver(
    loss,
    opt,
    maxiter=epochs,
    ...
)

# non-linear solver
solver = jaxopt.LBFGS(
    loss,
    maxiter=epochs,
    ...
)

opt_state = solver.init(params, state)
update = solver.update

4.0.9 Example (train)

# init
params, state, opt_state, update


for batch in data:
    params, state = update(params, state, batch)

4.0.10 Example (parallel)

# Use pjit
from jax.experimental.maps import Mesh, ResourceEnv, thread_resources
from jax.experimental.pjit import PartitionSpec, pjit

mesh = Mesh(np.asarray(jax.devices(), dtype=object), ["data", ...])
thread_resources.env = ResourceEnv(physical_mesh=mesh, loops=())

update = pjit(
    solver.update,
    in_axis_resources=[
        None,  # params
        None,  # state
        PartitionSpec("data"),  # batch
    ],
    out_axis_resources=None,
)

5 Conclusion

5.0.1 Conclusion

Find an inverse function of a parabola
- Classical
- Physics informed
PDE
- PDE example
- PDE boundary
PINNs
- Data-driven solution with continuous time
  - Burgers’ equation
- Data-driven discovery with continuous time
  - Navier-Stokes equation

Files

221118-pinns

Directory actions

More options

Directory actions

More options

Latest commit

History

221118-pinns

Folders and files

parent directory

readme.org

Physics Informed Neural Networks

1 Introduction

1.0.1 Finding the inverse function of a parabola

1.0.1.1 PIC

1.0.2 Classical

1.0.2.1 MLP

1.0.2.2 Results

1.0.3 Physics approach

1.0.3.1 PINN

1.0.3.2 Results

1.0.4 Physics Informed Neural Networks (PINNs) Definition

2 Partial differential equations (PDE)

2.1 Introduction

2.1.1 What is PDE?

2.1.1.1 Example

2.1.2 Notations

2.2 PDE in the real world

2.2.1 Laplace’s equation

2.2.2 Poisson’s equation

2.2.3 Heat equation

2.2.4 Wave equation

2.2.5 Burgers’ equation

2.3 Boundary conditions

2.3.1 Boundary conditions

3 PINNs

3.0.1 Paper

3.0.2 Problem

3.1 Data-driven solution with continuous time

3.1.1 Data-driven solution with continuous time

3.1.2 Physics informed neural network

3.1.3 Example (Burgers’ Equation)

3.1.4 Target

3.1.5 Example (Burgers’ Equation) with codes

3.1.6 Train and Results

3.2 Data-driven discovery with continuous time

3.2.1 Data-driven discovery with continuous time

3.2.2 Example (Incompressible Navier-Stokes Equation (convection–diffusion equations))

3.2.3 NS Equation figure

3.2.4 Example (Navier-Stokes Equation) – Target

3.2.5 Results

4 JAX

4.0.1 Introduction

4.0.2 Pure functional

4.0.3 Ecosystem

4.0.4 Example (def)

4.0.5 Example (init)

4.0.6 Example (loss)

4.0.7 Example (optim)

4.0.8 Example (solver)

4.0.9 Example (train)

4.0.10 Example (parallel)

5 Conclusion

5.0.1 Conclusion

6 Refs

6.0.1 Refs