ML_ODE
provide the code to build a Machine Learning model for solving a Ordinary differential equation (ODE).
An ODE problem consist in finding a function of time t
f(t;x_0, Omega)
, dependent parametrically on x_0
and Omega
, such that:
f'(t) = F(t,x,Omega)
f(t=0) = x_0
in this context x_0
is the value of the function at some initial time and Omega
is a set of additional parameters useful to specify the time derivative of the problem.
As it is well known, a neural network can approximate (at least in principle) any real function, so: why not approximating the ODE solution with a Neural Network? In doing so we follow the seminal work Artificial Neural Networks for Solving Ordinary and Partial Differential Equations. The idea is further developed in a more modern paper and this is where we take inspiration from. Basically, we use the following ansatz for the solution:
f(t;x_0, Omega) = x_0 + (1-exp(-t))*NN(t, x_0, Omega)
This ansatz is pretty smart: it satisfies identically the boundary conditions and you can easily check how far f
is from the actual solution by comparing its time derivative with the function F(t,Omega)
of the problem. This idea is implemented in the loss function, which takes the following form:
L(t,x_0,Omega) = | f'(t;x_0, Omega) - F(t, f(t,_x0, Omega), Omega) |**2 * exp(-lambda*t)
Where lambda
is a regularization constant to ensure that the training is more gradual.
Once you have a loss function, it's all done: Tensorflow takes care of minimizing your loss function with respect to the weights of the NN.
You may wonder why we need a NN to solve an ODE, when we already have plenty of finite difference methods that works pretty well. The answer is simple: a NN is fast! This of course saves you a lot of computational resourches and at the same time you get the same degree of accuracy. Not convinced yet? A NN also provides a closed form expression for the solution, and once you have such expression you can differentiate it as many times as you wish: nice, isn't?
All the interesting code is wrapped in a single class, which provides you a (almost) ready to use model to fit. It is built to resemble a Keras model. You will find method fit()
to train the model and method call()
to evaluate the NN. You can use methods get_solution()
or ODE_solution()
to compute the solution that you are looking for.
You can inherit the provided baseclass and customize your own model easily. You only need to specify the following:
- The network architecture (by writing function
set_model_properties()
), given as a list of dense keras layers (list should beself._l_list
) - Some hyperparameters of the networks: the dimensionality of the problem (
n_vars
), the number of variables Omega (n_params
), the range in which every variable lives (in listconstraints
), as well as the range of time the model should be trained at. You might also set the value of a regularizer for the loss function, you can safely use a small value, such as 0.1~1. (See lambda parameter in eq.3 here for more information) - The function
F(t,x,Omega)
. It should be a tensorflow function, which outputs a batch with shape(None, n_vars)
That's all. Not simple enough? Below you will find a simple example:
import ML_ODE
class my_ML_ODE(ML_ODE.ML_ODE_Basemodel):
"Class for solving a ODE problem. Inherit from ML_ODE.ML_ODE_Basemodel and implements some methods."
def set_model_properties(self):
"This class should initialise some important paramters for the network"
self.n_vars = 2 #number of variables in the problem
self.n_params = 2 #number of parameters in the problem
self.constraints = [(0.,1.),(-1., 1.), (-1., 1.), (-1., 1.),(-1., 1.)] #(t_range, x0_range, x1_range, *params_range)
self.regularizer = 1. #regularizer for the loss function (not compulsory, default 0)
#The regularizer act on the loss function L(t,x,Omega) s.t. L' = L * exp(regularizer*t): this is to ensure that earlier times are fitted with more accuracy
def build_NN(self):
"This class builds the fully connected NN architechture. Layer sequence should be provided by a list of layers (in self._l_list)"
self._l_list.append(tf.keras.layers.Dense(128*2, activation=tf.nn.sigmoid) )
self._l_list.append(tf.keras.layers.Dense(128, activation=tf.nn.sigmoid) )
self._l_list.append(tf.keras.layers.Dense(self.n_vars, activation=tf.keras.activations.linear))
def ODE_derivative(self, t, X, Omega): #actually z, D_L, Omegas (N,2)
"""Here we write down the derivative of x: x' = F(t,x,Omega).
Function should be written in pure tensorflow (input and outputs are tf tensors).
Inputs:
t (None,) times
X (None, n_vars) values of the variables
Omega (None, n_params) parameters
Output:
x_prime (None,n_vars)/(None,) derivative of the function
"""
return tf.math.multiply(X,Omega) #simple example where F_i(t,x,Omega) = x_i*Omega_i
You can find working examples in examples/example_model.py
and examples/cosmological_model.py
: the trained models are in folder examples/example_model
and examples/cosmological_model
.
It's super simple: you only need the file ML_ODE.py, which keeps all the code you need. Get it with:
wget https://raw.githubusercontent.com/stefanoschmidt1995/ML_ODE/main/ML_ODE.py
and you're done! You can import ML_ODE
from every python script you like: that's all.
If you need also the examples, you might want to clone the whole repository
git clone https://github.com/stefanoschmidt1995/ML_ODE.git
Every function is documented (I did my best to make it clear).
Once you have instantiated a model, you can use the python help function for understanding the behaviour of the ML_ODE
class or of a specific method:
help(ML_ODE)
help(ML_ODE.method_you_want_to_know_more_about)
Furthermore, you can have a look at the examples I provided.
If still something is not clear, or you wish to have more information, you can contact me at [email protected]