.. Automatically generated Sphinx-extended reStructuredText file from DocOnce source
   (https://github.com/hplgit/doconce/)

.. Document title:

A Tutorial for the Odespy Interface to ODE Solvers
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

:Authors: Hans Petter Langtangen, Liwei Wang
:Date: May 1, 2015

The Odespy package makes it easy to specify an ODE problem in
Python and get it solved by a wide variety of different numerical
methods and software.

.. _ode:sec:motivation:

.. !split

Motivation
==========

The Odespy package grew out of the desire to have a unified interface
to lots of different methods and software for ODEs. Consider the
ODE problem

.. math::
         y'' = 3 (1 - y^2) y' - y,\quad y(0)=2,\ y'(0)=1, 

known as the van der Pool oscillator. The solution is
desired at 150 equally spaced time levels in
the interval ``[0, 30]``.

Traditional Approach
--------------------

We want to solve this problem
by three well-known routines:

 1. ``LSODE`` from ODEPACK (adaptive Adams and BDF methods)

 2. ``ode45`` from MATLAB (adaptive Runge-Kutta 4-5-th order)

 3. ``vode`` from Python (adaptive Adams and BDF methods)

All of these routines require the ODE problem to be on the form
:math:`u'=f(u,t)`, which means that
the second-order differential equation must be
recast as a system of two ODEs,

.. math::
         \frac{d}{dt}u^{(0)} = u^{(1)},\quad \frac{d}{dt}u^{(1)} =3(1-{(u^{(0)})}^2) u^{(1)} - u^{(0)},

and we have to identify the two components
of the :math:`f(u,t)` function:

.. math::
         f^{(0)}(u^{(0)}, u^{(1)}, t) = u^{(1)}, \quad
           f^{(1)}(u^{(0)}, u^{(1)}, t) =3(1-{(u^{(0)})}^2)u^{(1)} - u^{(0)}

The corresponding boundary conditions become

.. math::
         u^{(0)}(0)=2,\quad u^{(1)}(0)=1. 

The mentioned ODE software needs a specification of the :math:`f(u,t)` formulas
through some user-written function that takes :math:`u` and :math:`t` as input and
delivers the vector :math:`f` as output.

LSODE
~~~~~

Application of ``LSODE`` and other ODEPACK routines requires the
ODE problem to be specified in FORTRAN and the solver to be called
from FORTRAN:

.. code-block:: fortran

             PROGRAM MAIN
             EXTERNAL F
             INTEGER I, IOPT, IOUT, ISTATE, ITASK, ITOL, IWORK,
            1   LRW, LIW, MF, NEQ, NOUT
             DOUBLE PRECISION ATOL, T, TOUT, RTOL, RWORK, U, URR
             DIMENSION U(2), RWORK(52), IWORK(20), U1(5), U2(5)
             NEQ = 2
        C    SET ADAMS METHOD:
             MF = 10
        C    LET TOLERANCES BE SCALARS (NOT ARRAYS):
             ITOL = 1
        C    USE ONLY ABSOLUTE TOLERANCE:
             RTOL = 0.0D0
             ATOL = 1.0D-6
             LRW = 52
             LIW = 20
        C    NUMBER OF TIME STEPS:
             NOUT = 150
        C    FINAL TIME:
             TOUT = 30.0D0
        C    INITIAL CONDITIONS
             T = 0.0D0
             U(1)= 2.0D0
             U(2) = 0.0D0
             ITASK = 1
             ISTATE = 1
        C    CALL ADAPTIVE TIME STEPPING AT EACH OF THE TARGET TIME LEVELS
             DO 100 IOUT = 1, NOUT
               CALL DLSODE(F,NEQ,U,T,TOUT,ITOL,RTOL,ATOL,ITASK,
            1     ISTATE,IOPT,RWORK,LRW,IWORK,LIW,JAC,MF)
               U1(IOUT) = U(1)
               U2(IOUT) = U(2)
               TOUT = TOUT + 2.0D-1
         100 CONTINUE
             END
        
             SUBROUTINE F(NEQ, T, U, UDOT)
             INTEGER NEQ
             DOUBLE PRECISION T, U, UDOT
             DIMENSION U(2), UDOT(2)
             UDOT(1) = U(2)
             UDOT(2) = 3.0D0*(1.0D0 - U(1)*U(1))*U(2) - U(1)
             RETURN
             END

MATLAB
~~~~~~

The problem can be solved with very compact code in MATLAB. The definition
of the ODE system, the :math:`f(u,t)` function, is placed in
a function in a file, say ``myode.m``:

.. code-block:: matlab

        function F = myode(t, u);
        F(1,1) = u(2)
        F(2,1) = 3*(1 - u(1)*u(1))*u(2) - u(1)

In MATLAB we can then solve the problem by

.. code-block:: ipy

        >> options = odeset('RelTol',0.0,'AbsTol',1e-6);
        >> tspan = [0 30];
        >> u0 = [2; 0]
        >> [t, u] = ode45('myode', tspan, u0, options]);

Python
~~~~~~

Calling up the ``vode`` method from the ``scipy`` library in Python
also results in fairly compact code:

.. code-block:: python

        def f(t, u):
            return [u[1], 3.*(1. - u[0]*u[0])*u[1] - u[0]]
        
        from scipy.integrate import ode
        r = ode(f).set_integrator('vode', method='adams',
                                  order=10, rol=0, atol=1e-6,
                                  with_jacobian=False)
        u0 = [2.0, 0.0]
        r.set_initial_value(u0, 0)
        T = 30
        dt = T/150.
        u = [];  t = []
        while r.successful() and r.t <= T:
            r.integrate(r.t + dt)
            u.append(r.y);  t.append(r.t)

Suppose you want to compare these methods and their implementations.
This requires three different main programs, but even worse: three
different implementations of the definition of the mathematical
problem.  Some specifications of :math:`f` has the signature :math:`f(u,t)` while
others require :math:`f(t,u)`, and such differences between packages are
often a cause of programming errors.

Odespy's Unified Interface
--------------------------

The Odespy package provides a unified interface to all the three
mentioned types of methods, which makes it easy to run all of them
in a loop (program `motivation.py <https://github.com/hplgit/odespy/blob/master/doc/src/tutorial/src-odespy/motivation1.py>`__):

.. code-block:: python

        def f(u, t):
            return [u[1], 3.*(1. - u[0]*u[0])*u[1] - u[0]]
        
        u0 = [2.0, 0.0]
        import odespy, numpy
        
        for method in odespy.Lsode, odespy.DormandPrince, odespy.Vode:
        
            solver = method(f, rtol=0.0, atol=1e-6,
                            adams_or_bdf='adams', order=10)
            solver.set_initial_condition(u0)
            t_points = numpy.linspace(0, 30, 150)
            u, t = solver.solve(t_points)

Note in particular that the same ``f`` and the same call syntax can be
reused across methods and the underlying software.

Methods and Implementations Offered by Odespy
---------------------------------------------

Odespy features a unified interface to the following collection of
numerical methods and implementations:

  * Pure Python implementations of classical explicit schemes such as
    the Forward Euler method (also called Euler);
    Runge-Kutta methods of 2nd, 3rd, and 4th order; Heun's method;
    Adams-Bashforth methods of 2nd, 3rd, and 4th order;
    Adams-Bashforth-Moulton methods of 2nd and 3rd order.

  * Pure Python implementations of classical implicit schemes such as
    Backward Euler; 2-step backward scheme; the :math:`\theta` rule;
    the Midpoint (or Trapezoidal) method.

  * Pure Python implementations of adaptive explicit Runge-Kutta
    methods of type Runge-Kutta-Fehlberg of order (4,5), Dormand-Prince
    of order (4,5), Cash-Karp of order (4,5), Bogacki-Shampine of order (2,3).

  * Wrappers for all FORTRAN solvers in `ODEPACK <http://www.netlib.org/odepack/>`__.

  * Wrappers for the wrappers of FORTRAN solvers in `scipy <http://www.scipy.org>`__:
    ``vode`` and ``zvode`` (adaptive Adams or BDF from `vode.f <http://www.netlib.org/ode/vode.f>`__);
    ``dopri5`` (adaptive Dormand-Prince method of order (4,5));
    ``dop853`` (adaptive Dormand-Prince method of order 8(5,3));
    ``odeint`` (adaptive switching between Adams or BDF from the implementation ``LSODA`` in `ODEPACK <http://www.netlib.org/odepack/>`__).

  * Wrapper for the Runge-Kutta-Chebyshev formulas of order 2 as
    offered by the well-known FORTRAN code `rkc.f <http://www.netlib.org/ode/rkc.f>`__.

  * Wrapper for the Runge-Kutta-Fehlberg method of
    order (4,5) as provided by the well-known FORTRAN code `rkf45.f <http://www.netlib.org/ode/rkf45.f>`__.

  * Wrapper for the Radau5 method as provided by the well-known FORTRAN code
    `radau5.f <http://www.unige.ch/~hairer/prog/stiff/radau5.f>`__.

  * Wrapper for some solvers in the `odelab <https://github.com/olivierverdier/odelab>`__ package.

The ODE problem can always be specified in Python, but for wrappers of
FORTRAN codes one can also implement the problem in FORTRAN and avoid
callback to Python.

.. !split

Installation
============

The Odespy package is most easily installed using ``pip``:

.. code-block:: text

        sudo pip install -e \ 
           git+https://github.com/hplgit/odespy.git#egg=odespy

Checking out the source code is almost as easy:

.. code-block:: text

        git clone git@github.com:hplgit/odespy.git
        cd odespy
        sudo python setup.py install

You will at least also need Python v2.7 and the `numpy <http://numpy.scipy.org/>`__ package.  The FORTRAN codes ``rkc.f``,
``rkf45.f``, ``radau5.f``, and ODEPACK comes with Odespy and are compiled
and installed by ``setup.py``. If you lack a FORTRAN compiler, you
can drop the installation of the FORTRAN solvers by running

.. code-block:: python

        sudo python setup.py install --no-fortran

There have been various problems with compiling Odespy on Windows,
usually related to the Fortran compiler.
One recommended technique is to rely on Anaconda on Windows,
install the ``ming32`` compiler, and
then run

.. code-block:: text

        Terminal> python setup.py install build --compiler=ming32

This may give problems of the type

.. code-block:: text

        File "C:\Anaconda\lib\site-packages\numpy\distutils\fcompiler\gnu.py",
        line 333, in get_libraries
        raise NotImplementedError(...)
        NotImplementedError: Only MS compiler supported with gfortran on win64

A remedy is to edit the ``gnu.py`` file and comment out the
``NotImplementedError``:

.. code-block:: python

        else:
            #raise NotImplementedError("Only MS compiler ...")
            pass

The Odespy package depends on several additional packages:

 * `scipy <http://scipy.org/>`__ for running the ``Vode`` Adams/BDF solver, the
   Dormand-Prince adaptive methods ``Dop853``, and ``Dopri5``,
   and the ``scipy`` wrapper ``odeint`` of the FORTRAN code ``LSODA``
   (Odespy features an alternative wrapper of the latter, in class ``Lsoda``).

 * `sympy <http://sympy.org/en/index.html>`__ for running the
   extremely accurate ``odefun_sympy`` solver.

 * `odelab <https://github.com/olivierverdier/odelab>`__
   for accessing solvers in that package.

For plotting you will need `matplotlib <http://matplotlib.sourceforge.net/>`__ or `scitools <https://github.com/hplgit/scitools/>`__.

These packages are readily downloaded and installed by the
standard ``setup.py`` script, as shown above.
On Ubuntu and other Debian-based Linux systems the following
line installs all that Odespy may need:

.. code-block:: text

        sudo apt-get install python-scipy python-nose python-sympy \ 
             python-matplotlib python-scitools python-pip

The ``odelab`` package is installed by either

.. code-block:: text

        pip install -e git+https://github.com/olivierverdier/odelab#egg=odelab

or downloading the source and running ``setup.py``:

.. code-block:: text

        git clone git://github.com/olivierverdier/odelab.git
        cd odelab
        sudo python setup.py install


.. admonition:: Note

   Despite Odespy's many dependencies on other software, you can run the
   basic solvers implemented in pure Python without any additional software
   packages.


.. !split

Basic Usage
===========

This section explains how to use Odespy. The general principles and
program steps are first explained. Thereafter, we present a series of
examples with progressive complexity with respect to Python constructs
and numerical methods.

Overview
--------

A code using Odespy to solve ODEs consists of six steps. These are
outlined in generic form below.

Step 1
~~~~~~

*Write the ODE problem in generic form* :math:`u' = f(u, t)`,
where :math:`u(t)` is the unknown function to be solved for, or a vector
of unknown functions of time in case of a system of ODEs.

Step 2
~~~~~~

*Implement the right-hand side function* :math:`f(u, t)` as a Python function
``f(u, t)``.  The argument ``u`` is either a ``float`` object, in case of a
scalar ODE, or a ``numpy`` array object, in case of a system of ODEs.
Some solvers in this package also allow implementation of :math:`f` in
FORTRAN for increased efficiency.

Step 3
~~~~~~

*Create a solver object*

.. code-block:: python

        solver = classname(f)

where ``classname`` is the name of a class in this package implementing
the desired numerical method.

Many solver classes has a range of parameters that the user can set to
control various parts of the solution process. The parameters are
documented in the doc string of the class (``pydoc classname`` will list
the documentation in a terminal window). One can either specify parameters
at construction time, via extra keyword arguments to the constructor,

.. code-block:: python

        solver = classname(f, prm1=value1, prm2=value2, ...)

or at any time using the ``set`` method:

.. code-block:: python

        solver.set(prm1=value1, prm2=value2, prm3=value3)
        ...
        solver.set(prm4=value4)

Step 4
~~~~~~

*Set the initial condition* :math:`u(0)=U_0`,

.. code-block:: python

        solver.set_initial_condition(U0)

where ``U0`` is either a number, for a scalar ODE, or a sequence (list, tuple,
``numpy`` array), for a system of ODEs.

Step 5
~~~~~~

*Solve the ODE problem*, which means to compute :math:`u(t)` at
some discrete user-specified time points :math:`t_1, t_2, \ldots, t_N`.

.. code-block:: python

        T = ...  # end time
        time_points = numpy.linspace(0, T, N+1)
        u, t = solver.solve(time_points)

In case of a scalar ODE, the returned solution ``u`` is a one-dimensional
``numpy`` array where ``u[i]`` holds the solution at time point ``t[i]``.
For a system of ODEs, the returned ``u`` is a two-dimensional ``numpy``
array where ``u[i,j]`` holds the solution of the :math:`j`-th unknown
function at the :math:`i`-th time point ``t[i]`` (:math:`u_j(t_i)` in mathematics
notation).

By giving the parameter ``disk_storage=True`` to the solver's constructor,
the returned ``u`` array is memory mapped (i.e., of type ``numpy.memmap``)
such that all the data are stored on file, but parts of the array can
be efficiently accessed.

The ``time_points`` array specifies the time points where we want the
solution to be computed. The returned array ``t`` is the same as
``time_points``.  The simplest numerical methods in the Odespy
package apply the ``time_points`` array directly for the time stepping.
That is, the time steps used are given by

.. code-block:: python

        time_points[i] - time_points[i-1]  #  i=0,1,...,len(time_points)-1

The adaptive schemes typically compute between each time point in
the ``time_points`` array, making this array a specification where
values of the unknowns are desired.

The ``solve`` method in solver classes also allows a second argument,
``terminate``, which is a user-implemented Python function specifying
when the solution process is to be terminated. For example,
terminating when the solution reaches an asymptotic (known) value
``a`` can be done by

.. code-block:: python

        def terminate(u, t, step_no):
            # u and t are arrays. Most recent solution is u[step_no].
            tolerance = 1E-6
            return abs(u[step_no] - a) < tolerance
        
        u, t = solver.solve(time_points, terminate)

The arguments transferred to the ``terminate`` function are the
solution array ``u``, the corresponding time points ``t``, and
an integer ``step_no`` reflecting the most recently computed ``u``
value. That is, ``u[step_no]`` is most recently computed value of :math:`u`.
(The array data ``u[step_no+1:]`` will typically be zero as these
are uncomputed future values.)

Step 6
~~~~~~

*Extract solution components* for plotting and further analysis.
Since the ``u`` array returned from ``solver.solve`` stores all unknown
functions at all discrete time levels, one usually wants to extract
individual unknowns as one-dimensional arrays. Here is an example
where unknown number :math:`0` and :math:`k` are extracted in individual arrays
and plotted:

.. code-block:: python

        u_0 = u[:,0]
        u_k = u[:,k]
        
        from matplotlib.pyplot import plot, show
        plot(t, u_0, t, u_k)
        show()

.. _ode:sec:exgr:

First Example: Logistic Growth
------------------------------

Our first example concerns the simple scalar ODE problem

.. math::
        
        \frac{du}{dt}=au\left(1-\frac{u}{R}\right),\quad u(0)=A,
        

where :math:`A>0`, :math:`a>0`, and :math:`R>0` are known constants. This is
a common model for population dynamics in ecology where :math:`u` is the
number of individuals, :math:`a` the initial growth rate, :math:`R` is the
maximum number of individuals that the environment allows (the so-called
*carrying capacity* of the environment).

Using a standard
Runge-Kutta method of order four, the code for solving the problem in
the time interval :math:`[0,10]` with :math:`N=30` time steps, looks like this
(program `logistic1.py <https://github.com/hplgit/odespy/blob/master/doc/src/tutorial/src-odespy/logistic1.py>`__):

.. code-block:: python

        def f(u, t):
            return a*u*(1 - u/R)
        
        a = 2
        R = 1E+5
        A = 1
        
        import odespy
        solver = odespy.RK4(f)
        solver.set_initial_condition(A)
        
        from numpy import linspace, exp
        T = 10  # end of simulation
        N = 30  # no of time steps
        time_points = linspace(0, T, N+1)
        u, t = solver.solve(time_points)

With the ``RK4`` method and other non-adaptive methods
the time steps are dictated by the ``time_points`` array.
A constant time step of size is implied in the present example.
Running an alternative numerical method just means replacing ``RK4`` by, e.g.,
``RK2``, ``ForwardEuler``, ``BackwardEuler``, ``AdamsBashforth2``, etc.

We can easily plot the numerical solution and compare with the exact
solution (which is known for this equation):

.. code-block:: python

        def u_exact(t):
            return R*A*exp(a*t)/(R + A*(exp(a*t) - 1))
        
        from matplotlib.pyplot import *
        
        plot(t, u, 'r-',
             t, u_exact(t), 'bo')
        savefig('tmppng'); savefig('tmp.pdf')
        show()

.. figure:: logistic1.png
   :width: 400

   Solution of the logistic equation with the 4-th order Runge-Kutta method (solid line) and comparison with the exact solution (dots)

All the examples in this tutorial are found in the GitHub directory
`<https://github.com/hplgit/odespy/blob/master/doc/src/tutorial/src-odespy/>`_.
If you download the tarball or clone the GitHub repository, the examples
reside in the directory ``doc/src/odespy/src-odespy``.

.. _ode:sec:exgr:farg:

Parameters in the Right-Hand Side Function
------------------------------------------

The right-hand side function and all physical parameters are often
lumped together in a class, for instance,

.. code-block:: python

        class Logistic:
            def __init__(self, a, R, A):
                self.a = a
                self.R = R
                self.A = A
        
            def f(self, u, t):
                a, R = self.a, self.R  # short form
                return a*u*(1 - u/R)
        
            def u_exact(self, t):
                a, R, A = self.a, self.R, self.A  # short form
                return R*A*exp(a*t)/(R + A*(exp(a*t) - 1))

Note that introducing local variables like ``a`` and ``R``, instead of
using ``self.a`` and ``self.A``, makes the code closer to the mathematics.
This can be convenient when proof reading the implementation of
complicated ODEs.

The numerical solution is computed by

.. code-block:: python

        import odespy
        problem = Logistic(a=2, R=1E+5, A=1)
        solver = odespy.RK4(problem.f)
        solver.set_initial_condition(problem.A)
        
        T = 10  # end of simulation
        N = 30  # no of time steps
        time_points = linspace(0, T, N+1)
        u, t = solver.solve(time_points)

The complete program is available in the file program `logistic2.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/logistic2.py>`__.

Instead of having the problem parameters ``a`` and ``R`` in the ODE as
global variables or in a class, we may include them as extra arguments
to ``f``, either as positional arguments or as keyword
arguments. Positional arguments can be sent to ``f`` via the constructor
argument ``f_args`` (a list/tuple of variables), while a dictionary
``f_kwargs`` is used to transfer keyword arguments to ``f`` via the
constructor. Here is an example on using keyword arguments:

.. code-block:: python

        def f(u, t, a=1, R=1):
            return a*u*(1 - u/R)
        
        A = 1
        
        import odespy
        solver = odespy.RK4(f, f_kwargs=dict(a=2, R=1E+5))

In general, a mix
of positional and keyword arguments can be used in ``f``:

.. code-block:: python

        def f(u, t, arg1, arg2, arg3, ..., kwarg1=val1, kwarg2=val2, ...):
            ...
        
        solver = odespy.classname(f,
                            f_args=[arg1, arg2, arg3, ...],
                            f_kwargs=dict(kwarg1=val1, kwarg2=val2, ...))
        
        # Alternative setting of f_args and f_kwargs
        solver.set(f_args=[arg1, arg2, arg3, ...],
                   f_kwargs=dict(kwarg1=val1, kwarg2=val2, ...))

Solvers will call ``f`` as ``f(u, t, *f_args, **f_kwargs)``.

Continuing a Previous Simulation
--------------------------------

It is easy to simulate for some time interval :math:`[0, T_1]`,
then continue with :math:`u(T_1)` as new initial condition and simulate for
:math:`t` in :math:`[T_1, T_2]` and so on. Let us divide the time
domain into subdomains and compute the solution for
each subdomain in sequence. The following program performs the steps
(`logistic4.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/logistic4.py>`__).

.. code-block:: python

        def f(u, t, a=1, R=1):
            return a*u*(1 - u/R)
        
        A = 1
        
        import odespy, numpy
        from matplotlib.pyplot import plot, hold, show, axis
        
        solver = odespy.RK4(f, f_kwargs=dict(a=2, R=1E+5))
        
        # Split time domain into subdomains and
        # integrate the ODE in each subdomain
        T = [0, 1, 4, 8, 12]     # subdomain boundaries
        
        N_tot = 30               # total no of time steps
        dt = float(T[-1])/N_tot  # time step, kept fixed
        u = []; t = []           # collectors for u and t in each domain
        
        for i in range(len(T)-1):
            T_interval = T[i+1] - T[i]
            N = int(round(T_interval/dt))
            time_points = numpy.linspace(T[i], T[i+1], N+1)
        
            solver.set_initial_condition(A)  # at time_points[0]
            print 'Solving in [%s, %s] with %d intervals' % \ 
                  (T[i], T[i+1], N)
            ui, ti = solver.solve(time_points)
            A = ui[-1]  # newest ui value is next initial condition
        
            plot(ti, ui)
            hold('on')
        
            u.append(ui);  t.append(ti)
        
        axis([0, T[-1], -0.1E+5, 1.1E+5])
        # Can concatenate all the elements of u and t, if desired
        u = numpy.concatenate(u);  t = numpy.concatenate(t)
        savefig('tmppng'); savefig('tmp.pdf')

Termination Criterion for the Simulation
----------------------------------------

We know that the solution :math:`u` of the logistic equation approaches
:math:`R` as :math:`t\rightarrow\infty`. Instead of
using a trial and error process for determining an appropriate
time integral for integration, the ``solver.solve`` method accepts
a user-defined function ``terminate`` that can be used to implement
a criterion for terminating the solution process.
Mathematically, the relevant criterion is
:math:`||u-R|| < \hbox{tol}`, where tol is an acceptable
tolerance, say :math:`100` in the present case where :math:`R=10^5`.
The ``terminate`` function implements the criterion
and returns true if the criterion is met:

.. code-block:: python

        def terminate(u, t, step_no):
            """u[step_no] holds (the most recent) solution at t[step_no]."""
            return abs(u[step_no] - R) < tol

Note that the simulation is anyway stopped for :math:`t > T` so :math:`T`
must be large enough for the termination criterion to be reached (if not,
a warning will be issued).
With a ``terminate`` function it is also convenient to specify the
time step ``dt`` and not the total number of time steps.

A complete program can be as follows (`logistic5.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/logistic5.py>`__):

.. code-block:: python

        def f(u, t):
            return a*u*(1 - u/R)
        
        a = 2
        R = 1E+5
        A = 1
        
        import odespy, numpy
        solver = odespy.RK4(f)
        solver.set_initial_condition(A)
        
        T = 20  # end of simulation
        dt = 0.25
        N = int(round(T/dt))
        time_points = numpy.linspace(0, T, N+1)
        
        tol = 100   # tolerance for termination criterion
        
        def terminate(u, t, step_no):
            """u[step_no] holds (the most recent) solution at t[step_no]."""
            return abs(u[step_no] - R) < tol
        
        u, t = solver.solve(time_points, terminate)
        print 'Final u(t=%g)=%g' % (t[-1], u[-1])
        
        from matplotlib.pyplot import *
        plot(t, u, 'r-')
        savefig('tmppng'); savefig('tmp.pdf')
        show()

A Class-Based Implementation
----------------------------

The previous code example can be recast into a more class-based
("object-oriented programming") example. We lump all data related
to the problem (the "physics") into a problem class ``Logistic``, while
all data related to the numerical solution and its quality are
taken care of by class ``Solver``. The code below illustrates
the ideas (`logistic6.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/logistic26py>`__):

.. code-block:: python

        import numpy as np
        import matplotlib.pyplot as plt
        import odespy
        
        class Logistic:
            def __init__(self, a, R, A, T):
                """
                a` is (initial growth rate), `R` the carrying capacity,
                `A` the initial amount of u, and `T` is some (very) total
                simulation time when `u` is very close to the asymptotic
                value `R`.
                """
                self.a, self.R, self.A = a, R, A
                self.tol = 0.01*R # tolerance for termination criterion
        
            def f(self, u, t):
                """Right-hand side of the ODE."""
                a, R = self.a, self.R  # short form
                return a*u*(1 - u/R)
        
            def terminate(self, u, t, step_no):
                """u[step_no] holds solution at t[step_no]."""
                return abs(u[step_no] - self.R) < self.tol
        
            def u_exact(self, t):
                a, R, A = self.a, self.R, self.A  # short form
                return R*A*np.exp(a*t)/(R + A*(np.exp(a*t) - 1))
        
        
        class Solver:
            def __init__(self, problem, dt, method='RK4'):
                self.problem = problem
                self.dt = dt
                self.method_class = eval('odespy.' + method)
                self.N = int(round(T/dt))
        
            def solve(self):
                self.solver = self.method_class(self.problem.f)
                self.solver.set_initial_condition(self.problem.A)
                time_points = np.linspace(0, self.problem.T, self.N+1)
                self.u, self.t = self.solver.solve(
                    time_points, self.problem.terminate)
                print 'Final u(t=%g)=%g' % (t[-1], u[-1])
        
            def plot(self):
                plt.plot(self.t, self.u, 'r-',
                         self.t, self.u_exact(self.t), 'bo')
                plt.legend(['numerical', 'exact'])
                plt.savefig('tmp.png'); plt.savefig('tmp.pdf')
                plt.show()
        
        def main():
            problem = Logistic(a=2, R=1E+5, A=1, T=20)
            solver = Solver(problem, dt=0.25, method='RK4')
            solver.solve()
            solver.plot()
        
        if __name__ == '__main__':
            main()

Using Other Symbols
-------------------

The Odespy package applies ``u`` for the unknown function or vector of
unknown functions and ``t`` as the name of the independent
variable. Many problems involve other symbols for functions and
independent variables. These symbols should be reflected in the user's
code.  For example, here is a coding example involving the logistic
equation written as :math:`y'(x)=au(x)(1-u(x)/R(x))`, where now a variable
:math:`R=R(x)` is considered. Following the setup from the very first
program above solving the logistic ODE, we can easily introduce our
own nomenclature (`logistic7.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/logistic7.py>`__):

.. code-block:: python

        def f(y, x):
            return a*y*(1 - y/R)
        
        a = 2;  R = 1E+5;  A = 1
        
        import odespy, numpy
        solver = odespy.RK4(f)
        solver.set_initial_condition(A)
        
        L = 10  # end of x domain
        N = 30  # no of time steps
        x_points = numpy.linspace(0, L, N+1)
        y, x = solver.solve(x_points)
        
        from matplotlib.pyplot import *
        plot(x, y, 'r-')
        xlabel('x'); ylabel('y')
        show()

As shown, we use ``y`` for ``u``, ``x`` for ``t``, and ``x_points`` instead
of ``time_points``.

.. _ode:sec:ex:osc:

Example: Solving an ODE System
------------------------------

We shall now explain how to solve a system of ODEs using a scalar
second-order ODE as starting point.
The angle :math:`\theta` of a pendulum with mass :math:`m` and length :math:`L`
is governed by the equation
(neglecting air resistance for simplicity)

.. math::
        
        mL\ddot\theta + mg\sin\theta = 0,\quad \theta (0)=\Theta,\ 
        \dot\theta (0)=0 .
        

A dot over :math:`\theta` implies differentiation with respect to time.
The ODE can be written as :math:`\ddot\theta + c\sin\theta=0` by
introducing :math:`c = g/L`.

This problem must be expressed as a first-order ODE system if it is
going to be solved by the tools in the Odespy package.
Introducing :math:`\omega = \dot\theta` (the angular velocity) as auxiliary
unknown, we get the system

.. math::
        
        \dot\theta &= \omega,\\ 
        \dot\omega &= -c\sin\theta,
        

with :math:`\theta(0)=\Theta` and :math:`\omega(0)=0`.

Now the ``f`` function must return a list or array with the two
right-hand side functions:

.. code-block:: python

        def f(u, t):
            theta, omega = u
            return [omega, -c*sin(theta)]

Note that when we have a system of ODEs with ``n`` components, the ``u``
object sent to the ``f`` function is an array of length ``n``,
representing the value of all components in the ODE system at time ``t``.
Here we extract the two components of ``u`` in separate local variables
with names equal to what is used in the mathematical description of
the current problem.

The initial conditions must be specified as a list:

.. code-block:: python

        solver = odespy.Heun(f)
        solver.set_initial_condition([Theta, 0])

To specify the time points we here first decide on a number of periods
(oscillations back and forth) to simulate and then on the time resolution
of each period. Note that when :math:`\Theta` is small we can replace
:math:`\sin\theta` by :math:`\theta` and find an analytical
solution
:math:`\theta (t)=\Theta\cos\left(\sqrt{c}t\right)`
whose period is :math:`2\pi/\sqrt{c}`. We use this expression
as an approximation for the period also when :math:`\Theta` is not
small.

.. code-block:: python

        freq = sqrt(c)      # frequency of oscillations when Theta is small
        period = 2*pi/freq  # the period of the oscillations
        T = 10*period       # final time
        N_per_period = 20   # resolution of one period
        N = N_per_period*period
        time_points = numpy.linspace(0, T, N+1)
        
        u, t = solver.solve(time_points)

The ``u`` returned from ``solver.solve`` is a two-dimensional array, where the
columns hold the various solution functions of the ODE system. That is,
the first column holds :math:`\theta` and the second column holds
:math:`\omega`. For convenience we extract the individual solution
components in individual arrays:

.. code-block:: python

        theta = u[:,0]
        omega = u[:,1]
        
        from matplotlib.pyplot import *
        plot(t, theta, 'r-')
        savefig('tmppng'); savefig('tmp.pdf')
        show()

The complete program is available in the file `osc1a.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/osc1a.py>`__.

Looking at the plot reveals that the numerical solution has
an alarming feature: the amplitude grows (indicating increasing
energy in the system). Changing ``T`` to 28 periods instead of 10
makes the numerical solution explode.
The increasing amplitude is a numerical artifact that some of the simple
solution methods suffer from.

.. figure:: osc1a.png
   :width: 500

   Heun's method used to simulate oscillations of a pendulum

Using a more sophisticated method, say the 4-th order Runge-Kutta method,
is just a matter of substituting ``Heun`` by ``RK4``:

.. code-block:: python

        solver = odespy.RK4(f)
        solver.set_initial_condition([Theta, 0])
        freq = sqrt(c)      # frequency of oscillations when Theta is small
        period = 2*pi/freq  # the period of the oscillations
        T = 10*period       # final time
        N_per_period = 20   # resolution of one period
        N = N_per_period*period
        time_points = numpy.linspace(0, T, N+1)
        
        u, t = solver.solve(time_points)
        
        theta = u[:,0]
        omega = u[:,1]
        
        from matplotlib.pyplot import *
        plot(t, theta, 'r-')
        savefig('tmppng'); savefig('tmp.pdf')
        show()

The amplitude now becomes (almost) constant in time as expected.
Another very good and popular method for this problem is presented next.

The Euler-Cromer Method
-----------------------

Physicists will most likely
solve the model problem in the section :ref:`ode:sec:ex:osc` by
the Euler-Cromer method. For a single degree of freedom system,

.. math::
         \ddot x + f(x, \dot x, t) = 0,

typically modeling an oscillatory system, the Euler-Cromer method
writes the system as two ODEs,

.. math::
        
        \dot v &= -g(x, v, t),\\ 
        \dot x &= v\thinspace .
        

A Forward Euler scheme is used for the first equation, while a Backward
Euler scheme is used for the second:

.. math::
        
        v^{n+1} &= v^n - \Delta t\, g(x^n, v^n, n\Delta t),\\ 
        x^{n+1} &= x^n + \Delta t\,v^{n+1}\thinspace .
        

The two first-order methods used in this symmetric fashion results in
a second-order method that will preserve the amplitude of the oscillations.

For general multi degree of freedom systems, we have some vector ODE
arising from, typically, Newton's second law of motion,

.. math::
         \ddot{\boldsymbol{r}} + \boldsymbol{g}(\boldsymbol{r},\dot{\boldsymbol{r}},t) = 0\thinspace .

This is rewritten as

.. math::
        
        \dot{\boldsymbol{v}} &= -\boldsymbol{g}(\boldsymbol{r}, \boldsymbol{v}, t),\\ 
        \dot{\boldsymbol{r}} &= \boldsymbol{v},
        

and discretized as

.. math::
        
        \boldsymbol{v}^{n+1} &= \boldsymbol{v}^n -\Delta t\,\boldsymbol{g}(\boldsymbol{r}^n, \boldsymbol{v}^n, t),\\ 
        \boldsymbol{r}^{n+1} &= \boldsymbol{r}^n + \Delta t\,\boldsymbol{v}^{n+1}\thinspace .
        

The convention in Odespy is to group all the unknowns as velocity and position
for each degree of freedom. That is, if the component form of :math:`\boldsymbol{r}`
and :math:`\boldsymbol{v}` is written as

.. math::
         \boldsymbol{r} = (r^{(0)}, r^{(0)}, \ldots, r^{(N)}),
        \quad \boldsymbol{v} = (v^{(0)}, v^{(0)}, \ldots, v^{(N)}),
        

the :math:`u` vector of all unknowns in the Euler-Cromer method in Odespy
must be

.. math::
         u=(v^{(0)}, r^{(0)}, v^{(1)}, r^{(1)}, \ldots,v^{(N)}, r^{(N)})\thinspace .

The corresponding set of ODEs are

.. math::
        
        \dot v^{(0)} &= -g^{(0)}(\boldsymbol{r}, \boldsymbol{v}, t),\\ 
        \dot r^{(0)} &= v^{(0)},\\ 
        \dot v^{(1)} &= -g^{(1)}(\boldsymbol{r}, \boldsymbol{v}, t),\\ 
        \dot r^{(1)} &= v^{(1)},\\ 
        &\cdots\\ 
        \dot v^{(N)} &= -g^{(N)}(\boldsymbol{r}, \boldsymbol{v}, t),\\ 
        \dot r^{(N)} &= v^{(N)}\thinspace .
        

For the particular case of a pendulum we write our system as

.. math::
        
        \dot\omega &= -c\sin\theta,\\ 
        \dot\theta &= \omega,
        

and let :math:`u=(\omega, \theta)`. The relevant right-hand side function becomes

.. code-block:: python

        def f(u, t):
            omega, theta = u
            return [-c*sin(theta), omega]

With some imports,

.. code-block:: python

        import odespy
        from numpy import *
        from matplotlib.pyplot import *

we can write the rest of the program in a standard fashion:

.. code-block:: python

        c = 1
        Theta0_degrees = 30
        
        solver = odespy.EulerCromer(f)
        Theta0 = Theta0_degrees*pi/180
        solver.set_initial_condition([0, Theta0])
        # Solve for num_periods periods using formulas for small theta
        freq = sqrt(c)          # frequency of oscillations
        period = 2*pi/freq      # one period
        N = 40                  # intervals per period
        dt = period/N           # time step
        num_periods = 10
        T = num_periods*period  # total simulation time
        
        time_points = linspace(0, T, num_periods*N+1)
        u, t = solver.solve(time_points)
        
        # Extract components and plot theta
        theta = u[:,1]
        omega = u[:,0]
        theta_linear = lambda t: Theta0*cos(sqrt(c)*t)
        plot(t, theta, t, theta_linear(t))
        legend(['Euler-Cromer', 'Linearized problem'], loc='lower left')

With :math:`\Theta_0` as 30 degrees, the fully nonlinear solution is
slightly out of phase with the solution of the linearized
problem :math:`\ddot\theta + c\theta =0`, see Figure :ref:`ode:sec:EC:fig1`.
As :math:`\Theta_0\rightarrow 0`, the two curves approach each other.
The Euler-Cromer method is significantly better than Heun's method
used in the previous section and reproduces the exact amplitude.

.. _ode:sec:EC:fig1:

.. figure:: osc_EC_30.png
   :width: 500

   *Euler-Cromer method applied to the pendulum problem*

Testing Several Methods
-----------------------

[**hpl 1**: After Euler-Cromer, change the order of the ODEs and unknowns!]

We shall now make a more advanced solver by
extending the pendulum example. More specifically, we shall

  * represent the right-hand side function as class,

  * compare several different solvers,

  * compute error of numerical solutions.

Since we want to compare numerical errors in the various
solvers we need a test problem where the exact solution is known.
Approximating :math:`\sin(\theta)` by :math:`\theta`
(valid for small :math:`\theta`), gives the ODE system

.. math::
        
        \dot\theta &= \omega,\\ 
        \dot\omega &= -c\theta,
        

with :math:`\theta(0)=\Theta` and :math:`\omega(0)=0`.

Right-hand side functions with parameters can be handled by
including extra arguments via the ``f_args`` and ``f_kwargs`` functionality,
or by using a class where the parameters are attributes and
an ``f`` method defines :math:`f(u,t)`.
The section :ref:`ode:sec:exgr:farg` exemplifies the details.
A minimal class representation of the right-hand side
function in the present case looks like this:

.. code-block:: python

        class Problem:
            def __init__(self, c, Theta):
                self.c, self.Theta = float(c), float(Theta)
        
            def f(self, u, t):
                theta, omega = u;  c = self.c
                return [omega, -c*theta]
        
        problem = Problem(c=1, Theta=pi/4)

It would be convenient to add an attribute ``period`` which holds
an estimate of the period of oscillations as we need this for
deciding on the complete time interval for solving the differential
equations. An appropriate extension of class ``Problem`` is therefore

.. code-block:: python

        class Problem:
            def __init__(self, c, Theta):
                self.c, self.Theta = float(c), float(Theta)
        
                self.freq = sqrt(c)
                self.period = 2*pi/self.freq
        
            def f(self, u, t):
                theta, omega = u;  c = self.c
                return [omega, -c*theta]
        
        problem = Problem(c=1, Theta=pi/4)

The second extension is to loop over many solvers. All
solvers can be listed by

.. code-block:: ipy

        >>> import odespy
        >>> methods = list_all_solvers()
        >>> for method in methods:
        ...   print method
        ...
        AdamsBashMoulton2
        AdamsBashMoulton3
        AdamsBashforth2
        ...
        Vode
        lsoda_scipy
        odefun_sympy
        odelab

A similar function, ``list_available_solvers``, returns a list of the
names of the solvers that are available in the current installation
(e.g., the ``Vode`` solver is only available if the comprehensive
``scipy`` package is installed).
This is the list that is usually most relevant.

For now we explicitly choose a subset of the commonly available solvers:

.. code-block:: python

        import odespy
        solvers = [
            odespy.ThetaRule(problem.f, theta=0),   # Forward Euler
            odespy.ThetaRule(problem.f, theta=0.5), # Midpoint method
            odespy.ThetaRule(problem.f, theta=1),   # Backward Euler
            odespy.RK4(problem.f),
            odespy.MidpointIter(problem.f, max_iter=2, eps_iter=0.01),
            odespy.LeapfrogFiltered(problem.f),
            ]

To see what a method is and its arguments to the constructor, invoke
the doc string of the class, e.g., ``help(ThetaRule)`` inside a
Python shell like IPython, or run ``pydoc odespy.ThetaRule`` in a
terminal window, or invoke the `Odespy API documentation <http://hplgit.github.com/odespy/doc/api/>`__.

It will be evident that the ``ThetaRule`` solver with ``theta=0`` and
``theta=1`` (Forward and Backward Euler methods) gives growing and
decaying amplitudes, respectively, while the other solvers are
capable of reproducing the constant amplitude of the oscillations of
in the current mathematical model.

The loop over the chosen solvers may look like

.. code-block:: python

        N_per_period = 20
        T = 3*problem.period   # final time
        import numpy
        import matplotlib.pyplot as plt
        legends = []
        
        for solver in solvers:
            solver_name = str(solver)  # short description of solver
            print solver_name
        
            solver.set_initial_condition([problem.Theta, 0])
            N = N_per_period*problem.period
            time_points = numpy.linspace(0, T, N+1)
        
            u, t = solver.solve(time_points)
        
            theta = u[:,0]
            legends.append(solver_name)
            plt.plot(t, theta)
            plt.hold('on')
        plt.legend(legends)
        plotfile = __file__[:-3]
        plt.savefig(plotfile + '.png'); plt.savefig(plotfile + '.pdf')
        plt.show()

A complete program is available as `osc2.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/osc2.py>`__.

.. figure:: osc2.png
   :width: 500

   Comparison of methods for solving the ODE system for a pendulum

We can extend this program to compute the error in each numerical
solution for different time step sizes.
Let ``results`` be a dictionary with the method name as
key, containing two sub dictionaries ``dt`` and ``error``, which hold
a sequence of time steps and a sequence of corresponding
errors, respectively. The errors are computed by subtracting
the numerical solution from the exact solution,

.. code-block:: python

        theta_exact = lambda t: problem.Theta*numpy.cos(sqrt(problem.c)*t)
        u, t = solver.solve(time_points)
        theta = u[:,0]
        error = numpy.abs(theta_exact(t) - theta)

The so-called L2 norm of the ``error`` array is a suitable
scalar error measure (square root of total error squared and integrated,
here numerically):

.. code-block:: python

        error_L2 = sqrt(numpy.sum(error**2)/dt)

where ``dt`` is the time step size.

Typical loops over solvers and resolutions look as follows (`osc3.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/osc3.py>`__):

.. code-block:: python

        try:
            num_periods = int(sys.argv[1])
        except IndexError:
            num_periods = 8  # default
        
        T = num_periods*problem.period       # final time
        results = {}
        resolutions = [10, 20, 40, 80, 160]  # intervals per period
        import numpy
        
        for solver in solvers:
            solver_name = str(solver)
            results[solver_name] = {'dt': [], 'error': []}
        
            solver.set_initial_condition([problem.Theta, 0])
        
            for N_per_period in resolutions:
                N = N_per_period*num_periods
                time_points = numpy.linspace(0, T, N+1)
        
                u, t = solver.solve(time_points)
        
                theta = u[:,0]
                error = numpy.abs(theta_exact(t) - theta)
                error_L2 = sqrt(numpy.sum(error**2)/N)
                if not numpy.isnan(error_L2):  # drop nan (overflow)
                    results[solver_name]['dt'].append(t[1] - t[0])
                    results[solver_name]['error'].append(error_L2)

Assuming the error to be of the form :math:`C\Delta t^r`, we can estimate
:math:`C` and :math:`r` from two consecutive experiments to obtain a sequence
of :math:`r` values which (hopefully) convergences to a value that we can
view as the empirical convergence rate of a method.
Given the sequence of time steps and errors, we can compare two
experiments :math:`i` and :math:`i-1`, with errors :math:`E_{i}=C\Delta t_i^r`, and
estimate :math:`r=\ln(E_i/E_{i-1})/\ln(\Delta t_i/\Delta t_{i-1})`:

.. code-block:: python

        from math import log
        print '\n\nConvergence results for %d periods' % num_periods
        for solver_name in results:
            r_h = results[solver_name]['dt']
            r_E = results[solver_name]['error']
            rates = [log(r_E[i]/r_E[i-1])/log(r_h[i]/r_h[i-1]) for i
                     in range(1, len(r_h))]
            # Reformat rates with 1 decimal for rate
            rates = ', '.join(['%.1f' % rate for rate in rates])
            print '%-20s r: %s E_min=%.1E' % \ 
                  (solver_name, rates, min(results[solver_name]['error']))

With 4 periods we get

.. code-block:: text

        ThetaRule(theta=0)   r: 4.2,  2.4, 1.7, 1.3 E_min=1.9E-01
        LeapfrogFiltered     r: 10.3, 0.3, 0.5, 0.7 E_min=1.8E-01
        ThetaRule(theta=1)   r: 0.2,  0.4, 0.6, 0.8 E_min=1.3E-01
        ThetaRule            r: 2.3,  2.0, 2.0, 2.0 E_min=2.1E-03
        RK4                  r: 4.0,  4.0, 4.0, 4.0 E_min=1.6E-07
        Leapfrog             r: 2.2,  2.0, 2.0, 2.0 E_min=2.1E-03
        RK2                  r: 2.3,  2.0, 2.0, 2.0 E_min=2.1E-03
        MidpointIter         r: 2.0,  1.0, 2.0, 2.0 E_min=2.1E-03

The rates of the Forward and Backward Euler methods (1st and 3rd line) have
not yet converged to unity, as expected, while the 2nd-order
Runge-Kutta method, Leapfrog, and the :math:`\theta` rule with
:math:`\theta =0.5`
(``ThetaRule`` with default value of ``theta``) shows the expected
:math:`r=2` value. The 4th-order Runge-Kutta holds the promise of being of 4th
order, while the filtered Leapfrog method has slow convergence and
a fairly large error, which is also evident in the previous figure.

Extending the time domain to 20 periods makes many of the
simplest methods inaccurate and the rates computed on coarse
time meshes are irrelevant. Also in this case, three of the methods
are useless, while the others deliver their promised convergence
rates (Forward Euler, i.e., ``ThetaRule`` with ``theta=0`` is left
out because of ridiculous results):

.. code-block:: text

        LeapfrogFiltered     r: 63.7, 0.0, 0.1, 0.2 E_min=4.3E-01
        ThetaRule(theta=1)   r: 0.0,  0.1, 0.1, 0.3 E_min=3.8E-01
        ThetaRule            r: 3.7,  2.1, 2.0, 2.0 E_min=1.0E-02
        RK4                  r: 4.0,  4.0, 4.0, 4.0 E_min=8.0E-07
        Leapfrog             r: 0.5,  1.9, 2.0, 2.0 E_min=1.0E-02
        RK2                  r: 3.7,  2.1, 2.0, 2.0 E_min=1.0E-02
        MidpointIter         r: 1.8,  0.8, 1.9, 2.0 E_min=1.0E-02

.. !split

More Advanced Implementations
=============================

Make a Subclass of Class Problem
--------------------------------

Odespy features a module ``problems`` for defining ODE problems.
There is a superclass ``Problem`` in this module defining what we
expect of information about an ODE problem, as well as some
convenience functions that are inherited in subclasses.
A rough sketch of class ``Problem`` is listed here:

.. code-block:: python

        class Problem:
            stiff = False    # classification of the problem is stiff or not
            complex_ = False # True if f(u,t) is complex valued
            not_suitable_solvers = []  # list solvers that should be be used
            short_description = ''     # one-line problem description
        
            def __init__(self):
                pass
        
            def __contains__(self, attr):
                """Return True if attr is a method in instance self."""
        
            def terminate(self, u, t, step_number):
                """Default terminate function, always returning False."""
                return False
        
            def default_parameters(self):
                """
                Compute suitable time_points, atol/rtol, etc. for the
                particular problem. Useful for quick generation of test
                cases, demos, unit tests, etc.
                """
                return {}
        
            def u_exact(self, t):
                """Implementation of the exact solution."""
                return None

Subclasses of ``Problem`` typically implements the constructor, for
registering parameters in the ODE and the initial condition, and
a method ``f`` for defining the right-hand side. For implicit solution method
we may provide a method ``jac`` returning the Jacobian of :math:`f(u,t)` with
respect to :math:`u`. Some problems may
also register an analytical solution in ``u_exact``. Here is an
example of implementing the logistic ODE from the section :ref:`ode:sec:exgr`:

.. code-block:: python

        import odespy
        
        class Logistic(odespy.problems.Problem):
            short_description = "Logistic equation"
        
            def __init__(self, a, R, A):
                self.a = a
                self.R = R
                self.U0 = A
        
            def f(self, u, t):
                a, R = self.a, self.R  # short form
                return a*u*(1 - u/R)
        
            def jac(self, u, t):
                a, R = self.a, self.R  # short form
                return a*(1 - u/R) + a*u*(1 - 1./R)
        
            def u_exact(self, t):
                a, R, U0 = self.a, self.R, self.U0  # short form
                return R*U0*numpy.exp(a*t)/(R + U0*(numpy.exp(a*t) - 1))

The ``stiff``, ``complex_``, and ``not_suitable_solvers`` class variables
can just be inherited. Note that ``u_exact`` should work for a vector ``t``
so ``numpy`` versions of mathematical functions must be used.

The initial condition is by convention stored as the attribute ``U0``
in a subclass of ``Problem``, and specified as argument to the constructor.

Here are the typical steps when using such a problem class:

.. code-block:: python

        problem = Logistic(a=2, R=1E+5, A=1)
        solver = odespy.RK4(problem.f)
        solver.set_initial_condition(problem.U0)
        u, t = solver.solve(time_points)

The problem class may also feature additional methods:

.. code-block:: python

        class MyProblem(odespy.problems.Problem)
            ...
            def constraints(self, u, t):
                """Python function for additional constraints: g(u,t)=0."""
        
            def define_command_line_arguments(self, parser):
                """
                Initialize an argparse object for reading command-line
                option-value pairs. `parser` is an ``argparse`` object.
                """
        
            def verify(self, u, t, atol=None, rtol=None):
                """
                Return True if u at time points t coincides with an exact
                solution within the prescribed tolerances. If one of the
                tolerances is None, return max computed error (infinity
                norm). Return None if the solution cannot be verified.
                """

The module ``odespy.problems`` contains many predefined ODE problems.

Example: Solving a Complex ODE Problem
--------------------------------------

Many of the solvers offered by Odespy can deal with complex-valued
ODE problems. Consider

.. math::
         u' = iwu,\quad u(0)=1,

where :math:`i=\sqrt{-1}` is the imaginary unit.
The right-hand side is implemented as ``1j*w*u`` in Python since
Python applies ``j`` as the imaginary unit in complex numbers.

Quick Implementation
~~~~~~~~~~~~~~~~~~~~

For complex-valued ODEs, i.e., complex-valued right-hand side functions
or initial conditions, the argument ``complex_valued=True`` must be
supplied to the constructor. A complete program reads

.. code-block:: python

        def f(u, t):
            return 1j*w*u
        
        import odespy, numpy
        
        w = 2*numpy.pi
        solver = odespy.RK4(f, complex_valued=True)
        solver.set_initial_condition(1+0j)
        u, t = solver.solve(numpy.linspace(0, 6, 101))

The function ``odespy.list_not_suitable_complex_solvers()``
returns a list of all the classes in Odespy that are not suitable
for complex-valued ODE problems.

Comparison of Methods
~~~~~~~~~~~~~~~~~~~~~

We can try three classes that do work for complex-valued ODEs: ``Vode``,
``RK4``, and ``RKFehlberg``. Comparing these with respect to CPU time and
final error for a very long time integration of 600 periods is carried
out by the following program.

.. code-block:: python

        def f(u, t):
            return 1j*w*u
        
        import odespy, numpy, time
        
        w = 2*numpy.pi
        n = 600  # no of periods
        r = 40   # resolution of each period
        tp = numpy.linspace(0, n, n*r+1)
        
        solvers = [odespy.Vode(f, complex_valued=True,
                               atol=1E-7, rtol=1E-6,
                               adams_or_bdf='adams'),
                   odespy.RK4(f, complex_valued=True),
                   odespy.RKFehlberg(f, complex_valued=True,
                                     atol=1E-7, rtol=1E-6)]
        cpu = []
        for solver in solvers:
            solver.set_initial_condition(1+0j)
            t0 = time.clock()
            solver.solve(tp)
            t1 = time.clock()
            cpu.append(t1-t0)
        
        # Compare solutions at the end point:
        exact = numpy.exp(1j*w*tp).real[-1]
        min_cpu = min(cpu); cpu = [c/min_cpu for c in cpu]  # normalize
        print 'Exact: u(%g)=%g' % (tp[-1], exact)
        for solver, cpu_time in zip(solvers, cpu):
            print '%-15s u(%g)=%.6f (error: %10.2E, cpu: %.1f)' % \ 
                  (solver.__class__.__name__,
                   solver.t[-1], solver.u[-1].real,
                   exact - solver.u[-1].real, cpu_time)

We remark that the solution and the corresponding time values can always
be recovered as ``solver.u`` and ``solver.t``, respectively.

The output from the program may read

.. code-block:: text

        Exact: u(600)=1
        Vode            u(600)=1.001587 (error:  -1.59E-03, cpu: 1.0)
        RK4             u(600)=0.997328 (error:   2.67E-03, cpu: 1.3)
        RKFehlberg      u(600)=1.000953 (error:  -9.53E-04, cpu: 7.5)

The ``Vode`` solver is a wrapper of the FORTRAN code ``zvode.f`` in
``scipy.integrate.ode`` and is an adaptive Adams method (with default
settings, as used here),
``RK4`` is a compact and straightforward Runge-Kutta method of order 4 in
pure Python with constant step size, and ``RKFehlberg`` is a pure Python
implementation of the adaptive Runge-Kutta-Fehlberg method of order
(4,5).  These methods give approximately the same final error, but
with different CPU times. We observe that the very simple ``RK4`` solver
in pure Python compares favorably with the much more sophisticated
FORTRAN subroutine ``zvode``.

Avoiding Callbacks to Python
----------------------------

The ODE solvers that are implemented in FORTRAN calls, by default,
the user's Python implementation of :math:`f(u,t)`. Making many calls from
FORTRAN to Python may introduce significant overhead and slow down the
solution process. When the algorithm is implemented in FORTRAN we should
also implement the right-hand side in FORTRAN and call this right-hand
side subroutine directly. Odespy offers this possibility.

The idea is that the user writes a FORTRAN subroutine defining :math:`f(u,t)`.
Thereafter, ``f2py`` is used to make this subroutine callable from Python.
If we specify the Python interface to this subroutine as an ``f_f77``
argument to the solver's constructor, the Odespy class will make sure
that no callbacks to the :math:`f(u,t)` definition go via Python.

The Logistic ODE
~~~~~~~~~~~~~~~~

Here is a minimalistic example involving the logistic ODE from
the section :ref:`ode:sec:exgr`. The FORTRAN implementation of :math:`f(u,t)`
is more complicated than the Python counterpart. The subroutine
has the signature

.. code-block:: fortran

              subroutine f_f77(neq, t, u, udot)
        Cf2py intent(hide) neq
        Cf2py intent(out) udot
              integer neq
              double precision t, u, udot
              dimension u(neq), udot(neq)

This means that there are two additional arguments: ``neq`` for the number
of equations in the ODE system, and ``udot`` for the array of :math:`f(u,t)`
that is output from the subroutine.

We write the FORTRAN implementation of :math:`f(u,t)` in a string:

.. code-block:: python

        a = 2
        R = 1E+5
        
        f_f77_str = """
              subroutine f_f77(neq, t, u, udot)
        Cf2py intent(hide) neq
        Cf2py intent(out) udot
              integer neq
              double precision t, u, udot
              dimension u(neq), udot(neq)
              udot(1) = %.3f*u(1)*(1 - u(1)/%.1f)
              return
              end
        """ % (a, R)

Observe that we can transfer problem parameters to the FORTRAN subroutine
by writing their values directly into the FORTRAN source code.
The other alternative would be to transfer the parameters as global
(COMMON block) variables to the FORTRAN code, which is technically
much more complicated.
Also observe that we need to deal with ``udot`` and ``u`` as arrays even
for a scalar ODE.

Using ``f2py`` to compile the string into a Python module is automated
by the ``odespy.compile_f77`` function:

.. code-block:: python

        import odespy
        f_f77 = odespy.compile_f77(f_f77_str)

The returned object ``f_f77`` is a callable object that allows the
FORTRAN subroutine to be called as ``udot = f_f77(t, u)`` from Python.
(However, the Odespy solvers will not use ``f_f77`` directly, but rather
its function pointer to the FORTRAN subroutine, and transfer this pointer
to the FORTRAN solver. The switching between ``t, u`` and ``u, t`` arguments
is taken care of. All necessary steps are automatically done behind
the scene.)

The solver can be declared as

.. code-block:: python

        solver = odespy.Lsode(f=None, f_f77=f_f77)

Several solvers accept FORTRAN definitions of the right-hand side:
``Lsode``, ``Lsoda``, and the other ODEPACK solvers, ``RKC``, ``RKF45``,
``Radau5``. Look up the documentation of their ``f_f77`` parameter to
see exactly what arguments and conventions that the FORTRAN subroutine
demand.

The file `logistic10.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/logistic10.py>`__ contains a complete program for solving the logistic ODE
with :math:`f(u,t)` implemented in Fortran.

Implementing the van der Pol Equation in FORTRAN
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

As a slightly more complicated example, also involving a subclass
of ``Problem`` and computation of the Jacobian and :math:`f(u,t)` in
FORTRAN, we consider the van der Pol equation,

.. math::
         y'' = \mu (1 - y^2) y' - y,\quad y(0)=2,\ y'(0)=1, 

written as a system as shown in the section :ref:`ode:sec:motivation`.
We start by implementing a problem class with Python code for
:math:`f(u,t)` and its Jacobian:

.. code-block:: python

        import odespy
        
        class VanDerPolOscillator(odespy.problems.Problem):
            short_description = "Van der Pol oscillator"
        
            def __init__(self, U0=[2, 1], mu=3.):
                self.U0 = U0
                self.mu = mu
        
            def f(self, u, t):
                u_0, u_1 = u
                mu = self.mu
                return [u_1, mu*(1 - u_0**2)*u_1 - u_0]
        
            def jac(self, u, t):
                u_0, u_1 = u
                mu = self.mu
                return [[0., 1.],
                        [-2*mu*u_0*u_1 - 1, mu*(1 - u_0**2)]]

Now, we want to provide ``f`` and ``jac`` in FORTRAN as well.
The FORTRAN code for :math:`f(u,t)` can be returned from a method:

.. code-block:: python

        class VanDerPolOscillator(odespy.problems.Problem):
            ...
            def str_f_f77(self):
                """Return f(u,t) as Fortran source code string."""
                return """
              subroutine f_f77(neq, t, u, udot)
        Cf2py intent(hide) neq
        Cf2py intent(out) udot
              integer neq
              double precision t, u, udot
              dimension u(neq), udot(neq)
              udot(1) = u(2)
              udot(2) = %g*(1 - u(1)**2)*u(2) - u(1)
              return
              end
        """ % self.mu

While all FORTRAN solvers supported by Odespy so far employ the same
signature for the :math:`f(u,t)` function, different solvers apply different
signatures for the Jacobian. Here are two versions for ODEPACK and
``Radau5``, respectively:

.. code-block:: python

        class VanDerPolOscillator(odespy.problems.Problem):
            ...
            def str_jac_f77_fadau5(self):
                return """
              subroutine jac_f77_radau5(neq,t,u,dfu,ldfu,rpar,ipar)
        Cf2py intent(hide) neq,rpar,ipar
        Cf2py intent(in)   t,u,ldfu
        Cf2py intent(out) dfu
              integer neq,ipar,ldfu
              double precision t,u,dfu,rpar
              dimension u(neq),dfu(ldfu,neq),rpar(*),ipar(*)
              dfu(1,1) = 0
              dfu(1,2) = 1
              dfu(2,1) = -2*%g*u(1)*u(2) - 1
              dfu(2,2) = %g*(1-u(1)**2)
              return
              end
        """ % (self.mu, self.mu)
        
            def str_jac_f77_lsode_dense(self):
                return """
              subroutine jac_f77(neq, t, u, ml, mu, pd, nrowpd)
        Cf2py intent(hide) neq, ml, mu, nrowpd
        Cf2py intent(out) pd
              integer neq, ml, mu, nrowpd
              double precision t, u, pd
              dimension u(neq), pd(nrowpd,neq)
              pd(1,1) = 0
              pd(1,2) = 1
              pd(2,1) = -2*%g*u(1)*u(2) - 1
              pd(2,2) = %g*(1 - u(1)**2)
              return
              end
        """ % (self.mu, self.mu)

For the ``Lsode`` solver we can also provide the Jacobian in banded matrix
format (this is not yet supported for ``Radau5``, but the underlying
FORTRAN code allows a banded Jacobian).

Having some methods returning FORTRAN code, we need to turn
the source code into Python modules. This is done by
``odespy.compile_f77`` in the constructor of class ``VanDerPolOscillator``:

.. code-block:: python

        class VanDerPolOscillator(odespy.problems.Problem):
        
            def __init__(self, U0=[2, 1], mu=3.):
                self.U0 = U0
                self.mu = mu
        
                # Compile F77 functions
                self.f_f77, self.jac_f77_radau5, self.jac_f77_lsode = \ 
                            compile_f77([self.str_f_f77(),
                                         self.str_jac_f77_radau5(),
                                         self.str_jac_f77_lsode()])

The application of this problem class goes as follows:

.. code-block:: python

        problem = VanDerPolOscillator()
        import odespy
        solver = odespy.Radau5(f=None, f=problem.f_f77,
                               jac=problem.jac_f77_radau5)
        solver.set_initial_condition(problem.U0)
        u, t = solver.solve(time_points)

Example: Solving a Stochastic Differential Equation
---------------------------------------------------

We consider an oscillator driven by stochastic white noise:

.. math::
         x''(t) + bx'(t) + cx(t) = N(t),\ x(0)=X,\ x'(0) =0,

where :math:`N(t)` is the white noise function computed numerically as

.. math::
         N(t_i) \approx \sigma\frac{\Delta W_i}{\sqrt{t_{i+1}-t_i}},

where :math:`\Delta W_1,\Delta W_2,\ldots` are independent normally
distributed random variables with mean zero and unit standard
deviation, and :math:`\sigma` is the strength of the noise.  The physical
feature of this problem is that :math:`N(t)` provides an excitation
containing "all" frequencies, but the oscillator is a strong filter:
with low damping one of the frequencies in :math:`N(t)` will hit the
resonance frequency :math:`\sqrt{c}/(2\pi)` which will then dominate the
output signal :math:`x(t)`.

The noise is additive in this stochastic differential equation so
there is no difference between the Ito and Stratonovich interpretations
of the equation.

The challenge with this model problem is that stochastic differential
equations do not fit with the user interface offered by Odespy,
since the right-hand side function is assumed to be dependent only
on the solution and the present time (``f(u,t)``), and additional
user-defined parameters, but for the present problem the right-hand
side function needs information about :math:`N(t)` and hence
the size of the current time step.

We can solve this issue by having a reference to the solver in the
right-hand side function, precomputing :math:`N(t_i)` for all time intervals
:math:`i`, and using the ``n`` attribute in the solver for selecting the right
force term (recall that some methods will call the right-hand side
function many times during a time interval - all these calls must use
the same value of the white noise).

The right-hand side function must do many things so a class is
appropriate:

.. code-block:: python

        class WhiteNoiseOscillator:
            def __init__(self, b, c, sigma=1):
                self.b, self.c, self.sigma = b, c, sigma
        
            def connect_solver(self, solver):
                """Solver is needed for time step number and size."""
                self.solver = solver
        
            def f(self, u, t):
                if not hasattr(self, 'N'):  # is self.N not yet computed?
                    # Compute N(t) for all time intervals
                    import numpy
                    numpy.random.seed(12)
                    t = self.solver.t
                    dW = numpy.random.normal(loc=0, scale=1, size=len(t)-1)
                    dt = t[1:] - t[:-1]
                    self.N = self.sigma*dW/numpy.sqrt(dt)
        
                x, v = u
                N = self.N[self.solver.n]
                return [v, N -self.b*v -self.c*x]

Note that :math:`N(t)` is computed on demand the first time the right-hand side
function is called. We need to wait until the ``f`` method is called since
we need access to the solver instance to compute the ``self.N`` array.

It is easy to compare different methods for solving this stochastic equation:

.. code-block:: python

        problem = WhiteNoiseOscillator(b=0.1, c=pi**2, sigma=1)
        solvers = [odespy.Heun(problem.f), odespy.RK4(problem.f),
                   odespy.ForwardEuler(problem.f)]
        for solver in solvers:
            f.connect_solver(solver)
            solver.set_initial_condition([0,0])  # start from rest
            T = 60   # with c=pi**2, the period is 1
            u, t = solver.solve(linspace(0, T, 10001))
        
            x = u[:,0]
            plot(t, x)
            hold(True)
        
        legend([str(s) for s in solvers])

All solutions are also stored in the solver objects
as attributes ``u`` and ``t``, so we
may easily extract the solution of ``RK4`` by

.. code-block:: python

        solvers[1].u, solvers[1].t

.. figure:: sode1.png
   :width: 500

   Oscillator driven by white noise

The ``Heun`` and ``RK2`` methods give coinciding solutions while
the ``ForwardEuler`` method gives too large amplitudes.
The frequency is 0.5 (period 2) as expected.

In this example the white noise force is computed only once since
the ``f`` instance is reused in all methods. If a new ``f`` is created
for each method, it is crucial that the same seed of the random
generator is used for all methods, so that the time evolution of
the force is always the same - otherwise the solutions will be
different.

The complete code is available in `sode.py <https://github.com/hplgit/odespy/blob/master/doc/src/tutorial/src-odespy/sode1.py>`__.

.. !split

Adaptive Methods
================

The solvers used in the previous examples have all employed a constant
time step :math:`\Delta t`. Many solvers available through the Odespy
interface are adaptive in the sense that :math:`\Delta t` is adjusted
throughout the solution process to meet a prescribed tolerance for
the estimated error.

Simple methods such as ``RK4`` apply time steps

.. code-block:: python

        dt = time_points[k+1] - time_points[k]

while adaptive methods will use several (smaller) time steps than ``dt``
in each ``dt`` interval to ensure that the estimated numerical error is
smaller than some prescribed tolerance. The estimated numerical error
may be a rather crude quantitative measure of the true numerical
error (which we do not know since the exact solution of the problem is
in general not known).

Some adaptive solvers record the intermediate solutions in each ``dt``
interval in arrays ``self.u_all`` and ``self.t_all``.  Examples include
``RKFehlberg``, ``Fehlberg``, ``DormandPrince``, ``CashKarp``, and
``BogackiShampine``. Other adaptive solvers (``Vode``, ``Lsode``, ``Lsoda``,
``RKC``, ``RKF45``, etc.)  do not give access to intermediate solution
steps between the user-given time points, specified in the
``solver.solve`` call, and then we only have access to the solution at
the user-given time points as returned by this call. One can run ``if
solver.has_u_t_all()`` to test if the ``solver.u_all`` and ``solver.t_all``
arrays are available.  These are of interest to see how the adaptive
strategy works between the user-specified time points.

The Test Problem
----------------

We consider the ODE problem for testing adaptive solvers:

.. math::
   :label: gaussian:ode:eq
        
        u' = -\frac{t-c}{s^2} (u-1)
        
        
The exact solution is a Gaussian function,

.. math::
        
        u(t) = 1 + \exp{\left(-\frac{1}{2}\left(\frac{t-c}{s}\right)^2\right)}
        

centered around :math:`t=c` and width characteristic width ("standard
deviation") :math:`s`. The initial condition is taken as the exact :math:`u` at :math:`t=0`.

Since the Gaussian function is significantly different from zero only in the
interval :math:`[c-3s, c+3s]`, one may expect that adaptive methods will
efficiently take larger steps when :math:`u` is almost constant and increase
the resolution when :math:`u` changes substantially in the vicinity
of :math:`t=c`. We can test if this is the case with several solvers.

Running Simple Methods
----------------------

Let us first use a simple standard method like the 2nd- and 4th-order
Runge-Kutta methods with constant step size. With the former method
(``RK2``), :math:`c=3`, :math:`s=0.5`, and :math:`41` uniformly distributed points,
the discrepancy between the numerical and exact solution in
Figure :ref:`gaussian:fig:RK2:41` is substantial. Increasing the number
of points by a factor of 10 gives a solution much closer to the
exact one, and switching to the 4th-order method (``RK4``) makes the
curves visually coincide. The problem is therefore quite straightforward
to solve using a sufficient number of points (400) and a higher-order method
such as ``RK4``.
For curiosity we can mention that the ``ForwardEuler`` method produces
a maximum value of 0.98 with 20,000 points and 0.998 with 200,000 points.

.. _gaussian:fig:RK2:41:

.. figure:: gaussian1_RK2_41.png
   :width: 500

   *2nd-order Runge-Kutta method with 41 points*

A simple program testing one numerical method goes as follows (`gaussian1.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/gaussian1.py>`__).

.. code-block:: python

        import odespy, numpy as np, matplotlib.pyplot as plt
        
        center_point = 3
        s = 0.5
        
        problem = odespy.problems.Gaussian1(c=center_point, s=s)
        
        npoints = 41
        tp = np.linspace(0, 2*center_point, npoints)
        
        method = odespy.RK2
        solver = method(problem.f)
        solver.set_initial_condition(problem.U0)
        
        u, t = solver.solve(tp)
        
        method = solver.__class__.__name__
        print '%.4f  %s' % (u.max(), method)
        
        if solver.has_u_t_all():
            plt.plot(solver.t_all, solver.u_all, 'bo',
                     tp, problem.u_exact(tp))
            print '%s used %d steps (%d specified)' % \
                  (method, len(solver.u_all), len(tp))
        else:
            plt.plot(tp, solver.u, tp, problem.u_exact(tp))
        plt.legend([method, 'exact'])
        plt.savefig('tmppng'); plt.savefig('tmp.pdf')
        plt.show()

Running the Runge-Kutta-Fehlberg Method
---------------------------------------

One of the most widely used general-purpose, adaptive methods for
ODE problems is the
Runge-Kutta-Fehlberg method of order (4,5). This method is available
in three alternative implementations in Odespy: a direct Python
version (``RKFehlberg``), a specialization of a generalized
implementation of explicit adaptive Runge-Kutta methods
(``Fehlberg``), and as a FORTRAN code (``RKF45``). We can try one of
these,

.. code-block:: python

        method = odespy.Fehlberg

Figure :ref:`gaussian:fig:Fehlberg:41` shows how ``Fehlberg`` with 40 intervals produces a solution of reasonable accuracy. The dots show the actual computational points used by the algorithm (57 adaptively selected points in time).

.. _gaussian:fig:Fehlberg:41:

.. figure:: gaussian1_Fehlberg_41.png
   :width: 500

   *Adaptive Runge-Kutta-Fehlberg method with 57 points (starting with 41)*

Adaptive algorithms apply an error estimate based on considering a higher-order method as exact, in this case a method of order 5, and a method of lower order (here 4) as the numerically predicted solution. The user can specify an error tolerance. In the program above we just relied to the default tolerance, which can be printed by

.. code-block:: python

        print solver.get()

yielding a list of all optional parameters:

.. code-block:: text

        {'f_kwargs': {}, 'f_args': (),
         'max_step': 1.5000000000000036, 'verbose': 0,
         'min_step': 0.0014999999999999946,
         'first_step': 0.14999999999999999,
         'rtol': 1e-06, 'atol': 1e-08,
         'name of f': 'f', 'complex_valued': False,
         'disk_storage': False, 'u_exact': None}

The tolerances involved are of relative and absolute type, i.e.,

.. code-block:: python

        estimated_error <= tol = rtol*abs(u) + atol

is the typical test on whether the solution is sufficiently.
For very small ``u``, ``atol`` comes into play, while for large ``u``, the
relative tolerance ``rtol`` dominates.

In this particular example, running ``RK4`` with 57 equally spaced points
yields a maximum value of 1.95, while ``Fehlberg`` with 57 adaptively
selected points results in 1.98. Note that the tolerances used are
:math:`10^{-6}` while the real error is of the order :math:`10^{-2}`.

We can specify stricter tolerances and also control the minimum
allowed step size, ``min_step``, which might be too large to achieve
the desired error level (`gaussian2.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/gaussian2.py>`__):

.. code-block:: python

        rtol = 1E-12
        atol = rtol
        min_step = 0.000001
        
        solver = odespy.Fehlberg(problem.f, atol=atol, rtol=rtol,
                                 min_step=min_step)

The ``Fehlberg`` solver now applies 701 points and achieves a maximum
value of 2.00005. However, ``RK4`` with the same number of (equally spaced) points
achieves the same accuracy and is much faster.

Testing More Adaptive Solvers
-----------------------------

We have already solved :eq:`gaussian:ode:eq` with sufficient accuracy, but
let us see how other methods perform, as this will most likely result in
a surprise. Below is a program (`gaussian3.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/gaussian3.py>`__) that compares several famous and
widely used methods in the same plot.

.. code-block:: python

        import odespy, numpy as np, matplotlib.pyplot as plt
        
        def run(problem, tp, solver):
            method = solver.__class__.__name__
        
            solver.set_initial_condition(problem.U0)
        
            u, t = solver.solve(tp)
            solver.u_max = u.max()
            print '%.4f  %s' % (solver.u_max, method)
        
            if solver.has_u_t_all():
                plt.plot(solver.t_all, solver.u_all)
                print '%s used %d steps (%d specified)' % \
                      (method, len(solver.u_all), len(tp))
            else:
                plt.plot(solver.t, solver.u)
            legend.append(method)
            plt.hold('on')
        
        rtol = 1E-6
        atol = rtol
        s = 0.5
        npoints = 41
        center_point = 3
        problem = odespy.problems.Gaussian1(c=center_point, s=s)
        tp = np.linspace(0, 2*center_point, npoints)
        min_step = 0.0001
        
        methods = ['DormandPrince', 'BogackiShampine',
                   'RKFehlberg', 'Vode', 'RKF45', 'Lsoda']
        solvers = [eval('odespy.' + method)(
                   problem.f, atol=atol, rtol=rtol,
                   min_step=min_step)
                   for method in methods]
        # Run Vode with implicit BDF method of order 5
        solvers[1].set(adams_or_bdf='bdf', order=5, jac=problem.jac)
        
        legend = []
        for solver in solvers:
            run(problem, tp, solver)
        
        plt.plot(tp, problem.u_exact(tp))
        legend.append('exact')
        plt.legend(legend)
        plt.savefig('tmp1.png')
        
        # Plot errors
        plt.figure()
        exact = problem.u_exact(tp)
        for solver in solvers:
            plt.plot(tp, exact - solver.u)
            plt.hold('on')
        plt.legend(legend)
        plt.savefig('tmp2.png')
        plt.show()

The default discretization applies :math:`N=40` equal-sized time intervals,
but adaptive methods should be able to adjust themselves to the
desired error level :math:`10^{-6}`. Figures :ref:`gaussian3:fig1:41` and
:ref:`gaussian3:fig1err:41` show that this expected behavior is not the
case. There is substantial discrepancy between the methods!
Surprisingly, the well-known FORTRAN codes accessed by the ``Vode``
(``vode.f``) and ``Lsoda`` (from ODEPACK) methods give very inaccurate
results, despite setting ``Vode`` to use a stiff BDF solver of order 5, and
``Lsoda`` should automatically select between nonstiff and stiff solvers
(of default order 4 in this case).

.. _gaussian3:fig1:41:

.. figure:: gaussian3_41.png
   :width: 500

   *Comparison of adaptive methods with default parameters (tolerance ``E-6``)*

.. _gaussian3:fig1err:41:

.. figure:: gaussian3e_41.png
   :width: 500

   *Comparison of errors in adaptive methods with default parameters (tolerance ``E-6``)*

The program writes out the following results for the
maximum value of the solution, which should equal 2:

.. code-block:: text

        1.9976  DormandPrince
        1.6591  BogackiShampine
        1.9812  RKFehlberg
        1.0406  Vode
        1.9734  RKF45
        3.2905  Lsoda

The clearly most accurate solver among these is ``DormandPrince``  - the default
method used by MATLAB's ``ode45`` solver, which is perhaps the world's
most popular ODE solver.

The remedy to get all the tested solvers to perform well is to
choose a much stricter tolerance, say :math:`10^{-10}`.
Figure :ref:`gaussian3:fig2:41` shows
coinciding curves. Numerically, we now have

.. code-block:: text

        1.9991  DormandPrince
        1.9912  BogackiShampine
        1.9942  RKFehlberg
        2.0122  Vode
        1.9916  RKF45
        2.0056  Lsoda

For the methods ``DormandPrince``,  ``RKFehlberg``, and
``BogackiShampine`` we have information about the number of adaptive
time points used: 270, 326, and 3307, respectively.

.. _gaussian3:fig2:41:

.. figure:: gaussian3_41_Em10.png
   :width: 500

   *Comparison of adaptive methods with error tolerance `E-10`*

The lesson learned in this example is two-fold: 1) several methods should
be tested to gain reliability of the results, and Odespy makes such tests
easy to conduct, and 2) strict tolerances, far below
the default values, may be necessary for some methods, here ``Vode`` and
``Lsoda`` in particular. We remark that it is the ODE problem that causes
difficulties: changing the problem to ``odespy.problems.Logistic``
(see the file `logistic9.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/logistic9.py>`__)
shows that all the curves coincide and cannot be distinguished visually.

The present test problem with a Gaussian function as solution
can be made increasingly more difficult by
increasing the value of :math:`c/s`, i.e., a more peaked function moved to
the right.

Extensive Testing
-----------------

The program `gaussian4.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/gaussian4.py>`__
sets up an extensive experiments involving a lot of solvers, several
:math:`c/s` values, and several error tolerances.  The experiments clearly
demonstrate how challenging this ODE problem is for many adaptive
solvers unless :math:`c/s` is moderate and a strict tolerance (much lower
than the real accuracy level) is used. One can especially see the
fundamental difficulties that ``Vode``, ``Lsode``, and ``Lsoda`` (all stiff
and nonstiff versions) face when :math:`c/s\geq 8`: these solvers do not
manage to pick up any variation in :math:`u`. Nevertheless, for less
demanding ODEs these solvers may perform very well, but it is highly
recommended to always use the power of the unified Odespy interface
to test several different adaptive methods.

.. !split

Solving Partial Differential Equations
======================================

Let us now turn the attention to the method of lines for
partial differential equations  (PDEs) where one reduces a PDE to
a system of ODE and then applies standard methods ODEs.

We address a diffusion problem in one dimension:

.. math::
   :label: odespy:pde:diffusion
        
        \frac{\partial u(x,t)}{\partial t} = \beta \frac{\partial^{2}u(x,t)}{\partial x^2} + f(x,t), x \in \left(0,L\right),  t \in (0,T],
          
        
.. math::
   :label: odespy:pde:diffusion:x0
          
        u(0,t) = s(t),  t \in (0,T],
          
        
.. math::
   :label: odespy:pde:diffusion:xL
          
        \frac{\partial}{\partial x}u(L,t) = 0, t \in (0,T],
          
        
.. math::
   :label: odespy:pde:diffusion:I
          
        u(x,0) = I(x), x \in \left[0,L\right]
        
        \thinspace .
        

Discretization in Space
-----------------------

Discretizing the 2nd-order derivative in space with a finite difference,
on a mesh :math:`x_i=i\Delta x`, :math:`i=1,\ldots,N-1`, gives the ODEs

.. math::
        
        \frac{\partial u_i(t)}{\partial t} = \beta
        \frac{u_{i+1}(t) - 2u_i(t) + u_{i-1}(t)}{\Delta x^2} + f_i(t),\quad i=1,\ldots,N-1
        \thinspace .
        

Here we have introduced the notation :math:`u_i(t)` as an approximation to the
exact solution at mesh point :math:`x_i`.

The boundary condition on :math:`x=0`, :math:`u(0,t)=s(t)`, gives rise to the ODE

.. math::
         u_0'(t) = s'(t),\quad u_0(0)=s(0)\thinspace .

At the other end, :math:`x=L`, we use a centered difference approximation
:math:`(u_{N+1}(t)-u_{N-1}(t))/(2\Delta x) =0` and combine it with the
scheme for :math:`i=N` to obtain the modified boundary ODE

.. math::
        
        \frac{\partial u_N(t)}{\partial t} = \beta
        \frac{2u_{N-1}(t) - 2u_N(t)}{\partial x^2} + f_N(t)\thinspace .
        

To summarize, the ODE system reads

.. math::
   :label: sec:pde:diff1D:ode1:0
        
        \frac{du_0}{dt} = s'(t),
        
        
.. math::
   :label: sec:pde:diff1D:ode1:i
          
        \frac{du_i}{dt} =  \frac{\beta}{\Delta x^2}
        (u_{i+1}(t) - 2u_i(t) + u_{i-1}(t)) + f_i(t),\quad i=1,\ldots,N-1,
        
        
.. math::
   :label: sec:pde:diff1D:ode1:N
          
        \frac{du_N}{dt} =  \frac{2\beta}{\Delta x^2}
        (u_{N-1}(t) - u_N(t)) + f_i(t)\thinspace .
        
        
The initial conditions are

.. math::
        
        u_0(0) = s(0),
        

.. math::
          
        u_i(0) = I(x_i),\quad i=1,\ldots,N\thinspace .
        

We can apply any method for systems of ODEs to solve
:eq:`sec:pde:diff1D:ode1:0`-:eq:`sec:pde:diff1D:ode1:N`.

Implementation
--------------

Consider
the evolution of the temperature in a rod modeled by our diffusion problem.
At :math:`t=0`, the rod has the temperature 10 C. We then apply a heat source
at :math:`x=0` which keepes the temperature there at 50 C. This means that
:math:`I(x)=283` K, except at the end point:
:math:`I(0)=423` K, :math:`s(t) = 423` K, and :math:`f=0`.

Odespy solvers need the right-hand side function of
:eq:`sec:pde:diff1D:ode1:0`-:eq:`sec:pde:diff1D:ode1:N`:

.. code-block:: python

        def rhs(u, t, L=None, beta=None, x=None):
            N = len(u) - 1
            dx = x[1] - x[0]
            rhs = zeros(N+1)
            rhs[0] = dsdt(t)
            for i in range(1, N):
                rhs[i] = (beta/dx**2)*(u[i+1] - 2*u[i] + u[i-1]) + \ 
                         f(x[i], t)
            rhs[N] = (beta/dx**2)*(2*u[i-1] - 2*u[i]) + f(x[N], t)
            return rhs

This function requires the variables ``beta``, ``x``, ``dx``, and ``L``, which
we provide as keyword arguments and that can be transferred to ``rhs``
through the ``f_kwargs`` argument to the Odespy constructors.

We also need some helper functions

.. code-block:: python

        def s(t):
            return 423
        
        def dsdt(t):
            return 0
        
        def f(x, t):
            return 0

Parameters and initial conditions can be set as

.. code-block:: python

        N = 40
        L = 1
        x = linspace(0, L, N+1)
        f_kwargs = dict(L=L, beta=1, x=x)
        u = zeros(N+1)
        
        U_0 = zeros(N+1)
        U_0[0] = s(0)
        U_0[1:] = 283

The construction and execution of a solver is now a matter of

.. code-block:: python

        import odespy
        solver = odespy.ForwardEuler(rhs, f_kwargs=f_kwargs)
        solver.set_initial_condition(U_0)
        
        dx = x[1] - x[0]
        dt = dx**2/(2*beta) # Forward Euler limit
        N_t = int(round(T/float(dt)))
        time_points = linspace(0, T, N_t+1)
        
        u, t = solver.solve(time_points)

We can add some flexibility and set up several solvers, also
implicit methods:

.. code-block:: python

        import odespy
        solvers = {
            'FE': odespy.ForwardEuler(
                rhs, f_kwargs=f_kwargs),
            'BE': odespy.BackwardEuler(
                rhs, f_is_linear=True, jac=K,
                f_kwargs=f_kwargs, jac_kwargs=f_kwargs),
            'B2': odespy.Backward2Step(
                rhs, f_is_linear=True, jac=K,
                f_kwargs=f_kwargs, jac_kwargs=f_kwargs),
            'theta': odespy.ThetaRule(
                rhs, f_is_linear=True, jac=K, theta=0.5,
                f_kwargs=f_kwargs, jac_kwargs=f_kwargs),
            'RKF': odespy.RKFehlberg(
                rhs, rtol=1E-6, atol=1E-8, f_kwargs=f_kwargs),
            'RKC': odespy.RKC(
                rhs, rtol=1E-6, atol=1E-8, f_kwargs=f_kwargs,
                jac_constant=True),
            }
        
        try:
            method = sys.argv[1]
            dt = float(sys.argv[2])
            T = float(sys.argv[3])
        except IndexError:
            method = 'FE'
            dx = x[1] - x[0]
            dt = dx**2/(2*beta) # Forward Euler limit
            print 'Forward Euler stability limit:', dt
            T = 1.2
        
        solver = solvers[method]

The implicit solvers need the Jacobian of the right-hand side function:

.. code-block:: python

        def K(u, t, L=None, beta=None, x=None):
            N = len(u) - 1
            dx = x[1] - x[0]
            K = zeros((N+1,N+1))
            K[0,0] = 0
            for i in range(1, N):
                K[i,i-1] = beta/dx**2
                K[i,i] = -2*beta/dx**2
                K[i,i+1] = beta/dx**2
            K[N,N-1] = (beta/dx**2)*2
            K[N,N] = (beta/dx**2)*(-2)
            return K

Note that we work with *dense square matrices* while the mathematics
allows a tridiagonal matrix and corresponding solver.
However, in 1D problems, the computations
are so fast anyway so we can live with dense matrices.

Finally, some animation of the solution is desirable:

.. code-block:: python

        # Make movie
        import os
        os.system('rm tmp_*.png')  # remove old plot files
        plt.figure()
        plt.ion()
        y = u[0,:]
        lines = plt.plot(x, y)
        plt.axis([x[0], x[-1], 273, 1.2*s(0)])
        plt.xlabel('x')
        plt.ylabel('u(x,t)')
        counter = 0
        for i in range(0, u.shape[0]):
            print t[i]
            lines[0].set_ydata(u[i,:])
            plt.legend(['t=%.3f' % t[i]])
            plt.draw()
            if i % 5 == 0: # plot every 5 steps
                plt.savefig('tmp_%04d.png' % counter)
                counter += 1
            #time.sleep(0.2)

Vectorized Code
~~~~~~~~~~~~~~~

It is easy to get rid of the loops in the ``rhs`` and ``K`` functions:

.. code-block:: python

        def rhs_vec(u, t, L=None, beta=None, x=None):
            N = len(u) - 1
            dx = x[1] - x[0]
            rhs = zeros(N+1)
            rhs[0] = dsdt(t)
            rhs[1:N] = (beta/dx**2)*(u[2:N+1] - 2*u[1:N] + u[0:N-1]) + \ 
                       f(x[1:N], t)
            i = N
            rhs[i] = (beta/dx**2)*(2*u[i-1] - 2*u[i]) + f(x[N], t)
            return rhs
        
        def K_vec(u, t, L=None, beta=None, x=None):
            """Vectorized computation of K."""
            N = len(u) - 1
            dx = x[1] - x[0]
            K = zeros((N+1,N+1))
            K[0,0] = 0
            K[1:N-1] = beta/dx**2
            K[1:N] = -2*beta/dx**2
            K[2:N+1] = beta/dx**2
            K[N,N-1] = (beta/dx**2)*2
            K[N,N] = (beta/dx**2)*(-2)
            return K

Experiments
-----------

The program, `pde_diffusion.py <https://github.com/hplgit/odespy/blob/master/doc/src/odespy/src-odespy/pde_diffusion.py>`__, can be run to test different solvers and illustrate numerical
methods:

.. Note: the "Run" after the paragraph, before code, is essential for

.. latex to produce correct code (a latex, not a doconce, problem).

**Forward Euler Method.**
Run

.. code-block:: text

        python pde_diffusion.py

The graphics takes very long time in this simulation, because of the
small time step.

**Backward Euler.**
Run

.. code-block:: text

        python ode_diffusion.py BE 0.05 1.2

**Backward 2-step Method.**
Run

.. code-block:: text

        python ode_diffusion.py B2 0.05 1.2

**Crank-Nicolson Method.**
Run

.. code-block:: text

        python ode_diffusion.py theta 0.01 1.2

Observe the non-physical oscillations because of the steep initial
condition (and the lack of damping in the Crank-Nicolson scheme).

**Runge-Kutta-Fehlberg Method.**
Run

.. code-block:: text

        python ode_diffusion.py RKF 0.01 1.2

Note here that we specify a quite large :math:`\Delta t`, much larger than
what a Runge-Kutta method can work with (typically, an RK4 method
needs a :math:`\Delta t` as small as the critical step for the Forward Euler
method). However, the adaptive method figures out what it needs of
steps and produces a nice solution.

**Runge-Kutta-Chebyshev Method.**
Run

.. code-block:: text

        python ode_diffusion.py RKC 0.05 1.2

This is a promising method for the diffusion equation. It works like
an explicit method and can tolerate large time steps. This method calls
up the FORTRAN code ``rkc.f``.

.. !split

Inner Workings of the Package
=============================

There are three basic entities when solving ODEs numerically: the
definition of the *ODE system*, the *time-stepping method*, and
the *solver* that runs the "time loop" and stores results.

The information about the ODE system is made very simple: the user
provides 1) an object that can be called as a Python function ``f(u,
t)``, and 2) an array or list with the initial values.  Some users will
store this information in their own data structures, e.g., a class.

The time-stepping method and the algorithm for calling the time-stepping
are collected in a solver class. All the solver classes are
related in a class hierarchy. Each solver class initialized by the right-hand
side function (``f``) and an optional set of parameters for controlling
various parts of the solution process. The solver object is also
used to set the initial condition (``set_initial_condition``) and
to run the solution process (``solve``). The time-stepping scheme
is normally isolated in a method ``advance`` in the solver classes, but
for some schemes or external software packages the separation of
doing one step and doing the whole time integration is less feasible.
In those cases, ``solve`` will mix the time-stepping loop and the
numerical scheme.

The package does not interact with visualization tools - the array
containing the solution is returned to the user and must be further
processed and visualized in the user's code.

Below we describe how the classes in the solver hierarchy work and how
parameters are registered and initialized.

.. _odes:parameters:

Solver Parameters
-----------------

The ``solver`` module defines a global dictionary ``_parameters`` holding
all legal parameters in Odespy classes. These are parameters that
the user can adjust. Other modules imports this ``_parameters`` dict
and updates it with their own additional parameters.

For each parameter the ``_parameters`` dict stores the parameter name, a
default value, a description, the legal object type for the value of
the parameter, and other quantities if needed. A typical example
is

.. code-block:: text

        _parameters = dict(
        ...
        
        f = dict(
            help='Right-hand side f(u,t) defining the ODE',
            type=callable),
        
        f_kwargs = dict(
            help='Extra keyword arguments to f: f(u, t, *f_args, **f_kwargs)',
            type=dict,
            default={}),
        
        theta = dict(
            help='eight in [0,1] used for'\ 
                 '"theta-rule" finite difference approx.',
            default=0.5,
            type=(int,float),
            range=[0, 1])
        
        ...
        }

Each solver class defines a (static) class variable
``_required_parameters`` for holding the names of all required
parameters (in a list). In addition, each solver class defines another
class variable ``_optional_parameters`` holding the names of all the
optional parameters. The doc strings of the solver classes are
automatically equipped with tables of required and optional
parameters.

The optional parameters of a class consist of the optional parameters
of the superclass and those specific to the class. The typical
initialization of ``_optional_parameters`` goes like this:

.. code-block:: python

        class SomeMethod(ParentMethod):
            _optional_parameters = ParentMethod._optional_parameters + \ 
                                   ['prm1', 'prm2', ...]

where ``prm1``, ``prm2``, etc. are names registered in the global
``_parameters`` dictionary.

From a user's point of view, the parameters are set either at
construction time or through the ``set`` function:

.. code-block:: python

        >>> from odespy import RK2
        >>> def f(u, t, a, b=0):
        ...   return a*u + b
        ...
        >>> solver = RK2(f, f_kwargs=dict(b=1))
        >>> solver.f_kwargs
        {'b': 1}
        >>> solver.set(f_args=(3,))
        >>> solver.f_args
        (3,)
        >>> # Get all registered parameters in the method instance
        >>> solver.get()
        {'f_kwargs': {'b': 1}, 'f_args': (3,), 'complex_valued': False,
        'name of f': 'f'}

The ``set`` method sets parameters through keyword arguments and can
take an arbitrary collection of such arguments:

.. code-block:: python

        solver.set(name1=value1, name2=value2, name3=value3, ...)

The ``get`` method returns the parameters and their values as a dictionary.
We remark that the ``'f'`` key, which one might expect to appear in the
returned dictionary of parameters, are omitted because it is always
a lambda function wrapping the user's ``f`` function such that the
returned value is guaranteed to be a ``numpy`` array. Instead,
there is an entry ``'name of f'`` which reflects the name of the
user-supplied function. The same comment applies to the ``jac`` parameter
for specifying the Jacobian used in implicit methods.

Solver Classes
--------------

Each solver in this package is implemented as a class in a class hierarchy.
Basic, common functionality is inherited from super classes, and the
actual solver class implements what is specific for the method in question.

The Inherited Superclass Constructor
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Class ``Solver`` is the super class of the hierarchy. Subclasses
normally just inherit their constructor from class ``Solver``. This
constructor requires one mandatory argument: the right-hand side of
the ODE, :math:`f(u,t)`, coded as a Python function ``f(u, t)``.
Some solvers implemented in FORTRAN allows ``f`` to be a wrapper of
a FORTRAN function defining the right-hand side, but such a wrapper
is specified through the ``f_f77`` argument and using ``None`` for ``f``.
The constructor accepts a range of
additional keyword arguments for setting
parameters of the solver. Which keyword arguments that are
available depends on what the subclass has registered as legal
parameters in ``_optional_parameters`` and ``_required_parameters``.

The constructor performs a set of tasks that are common to all
the subclass solvers:

1. The set of optional and required parameters of a particular solver
   is loaded into ``self._parameters`` such that this dictionary
   can be used to look up all parameters of the solver.

2. The solver-specific method ``adjust_parameters`` is called to allow
   the programmer of a solver to manipulate ``self._parameters``.
   For example, some parameters may be modified or set
   according to the value of others.

3. Entries in ``self._parameters`` are mirrored by class
   attributes. The computations and the ``set`` and ``get`` methods will
   make use of the attributes rather than the ``self._parameters`` dict
   to extract data.  For example, the value of
   ``self._parameters['myvar']`` becomes available as ``self.myvar`` and
   in the algorithms we use ``self.myvar``, perhaps with a test
   ``hasattr(self, 'myvar')`` or a ``try-except`` clause (catching
   an ``AttributeError``).

4. The ``set`` method is called with all keyword arguments given to the
   constructor, which then modifies the default values of the
   parameters and sets the corresponding attributes.

5. The ``f`` function is wrapped in a lambda function such that
   ``f(u, t)`` is guaranteed to return an array (in case the user
   returns a list or scalar for convenience). The same is done with
   the Jacobian (``jac``) and other user-defined callback functions.

6. The ``initialize`` method is called to finalize the tasks in
   the constructor. The most common use of this method in subclasses
   is to import extension modules that the solver depends on and
   provide an error message if the extension modules are not available.
   If they are, the modules are normally stored through an attribute
   of the subclass.

Useful Methods
~~~~~~~~~~~~~~

Let ``solver`` be some instance of a subclass in the hierarchy. The
following methods are sometimes useful:

 * ``repr(solver)``: return the subclass name along with all
   registered parameters and their values. This string provides
   complete information on the initialization of a solver.

 * ``str(solver)``: return a short pretty print string reflecting
   the name of the method and the value of parameters that
   must be known to uniquely define the numerical method.
   This string, or the class name as given by ``solver.name()``,
   is useful for the legend in a plot or as method identifier in a table.

 * ``solver.get_parameter_info()``: return or print all registered
   parameters for the current solver and all properties for
   each parameter.

7. ``solver.switch_to(name)``: return a new solver of type ``name``,
   initialized with all
   parameters of the current solver that are legal in the new solver.
   The method is useful when trying out a range of solvers for a
   problem.

The Solve Method
~~~~~~~~~~~~~~~~

After the constructor is called, ``solver.set_initial_condition`` is
called to set the initial condition, and then ``solve`` is called.
The ``solve`` method features the following steps:

1. Call ``initialize_for_solve`` (implemented in subclasses) to
   precompute whatever is needed before the time loop.
   The super class allocates storage for the solution and
   loads the initial condition into that data structure.
   Any subclass implementation of ``initialize_for_solve`` must therefore
   also call this method in its super class.

2. Call ``validate_data`` to check if the data structures are consistent
   before starting the computations. Subclass implementations of
   this method must call the super class' version of the method.

3. Run a loop over all time levels ``n`` and call ``advance`` (implemented
   in subclasses) at each level to advance the solution from
   time ``t[n]`` to ``t[n+1]``. Also call ``terminate`` so that the
   user code can analyze, work with the solution, and terminate the
   solution process.

Some subclasses will override the ``solve`` method and provide their own,
but most subclasses just inherits the general one and implements
the ``advance`` method.

Solver Attributes
~~~~~~~~~~~~~~~~~

All classes have a set of attributes:

1. ``users_f``: holds the user's function for :math:`f(u, t)`.
   Implicit solvers may have a corresponding ``users_jac`` for
   the user's Jacobian.

2. One attribute for each parameter in the class.

3. ``u``: 1D ``numpy`` array holding the solution for a scalar ODE and
   a 2D array in case of a system of ODEs. The first index
   denotes the time level.

4. ``t``: the time levels corresponding to the first index in the ``u`` array.

5. ``quick_description``: a short one-line description of the method (this
   variable is static in the class, i.e., declared outside any method).

Most classes will also define two additional static variables,
``_required_parameters`` and ``_optional_parameters`` as explained
in the section :ref:`odes:parameters`.

Other Superclasses
~~~~~~~~~~~~~~~~~~

There are superclasses ``SolverImplicit`` for implicit methods,
``Adaptive`` for adaptive methods, ``RungeKutta1level`` for general,
explicit 1-level Runge-Kutta methods, ``RungeKutta2level`` for
general, explicit, adaptive 2-level Runge-Kutta methods,
``ode_scipy`` for interfaces to ODE solvers
in ``scipy``, and ``Odepack`` for interfaces to the ODEPACK family
of solvers.

A Very Simple Subclass
----------------------

To implement a simple explicit scheme for solving a scalar ODE or a system
of ODEs, you only need to write a subclass of ``Solver`` with an
``advance`` method containing the formula that updates the solution from
one time level to the next. For example, the Forward Euler scheme
reads

.. math::
         u_{n+1} = u_n + \Delta t f(u_n, t_n),

where subscript :math:`n` denotes the time level, and :math:`\Delta t = t_{n+1}-t_n` is
the current time step.
The implementation goes like

.. code-block:: python

        class ForwardEuler(Solver):
            """
            Forward Euler scheme::
        
                u[n+1] = u[n] + dt*f(u[n], t[n])
            """
            quick_description = 'The simple explicit (forward) Euler scheme'
        
            def advance(self):
                u, f, n, t = self.u, self.f, self.n, self.t
                dt = t[n+1] - t[n]
                unew = u[n] + dt*f(u[n], t[n])
                return unew

Remarks:

1. The ``quick_description`` string is necessary for the class to appear in the
   automatically generated overview of implemented methods
   (run ``pydoc odespy`` to see this table).

2. Extracting class attributes in local variables (here ``u``, ``f``, etc.)
   avoids the need for the ``self`` prefix so that the implemented formulas
   are as close to the mathematical formulas as possible.

Almost equally simple schemes, like explicit Runge-Kutta methods and Heun's
method are implemented in the same way (see `solvers.py <https://github.com/hplgit/odespy/tree/master/odespy/odespy/solvers.py>`__).

A Subclass with More Code
-------------------------

A 2nd-order Adams-Bashforth scheme is a bit more complicated since it
involves three time levels and therefore needs a separate method for
the first step. We should also avoid unnecessary evaluations of :math:`f(u,t)`.
The user can specify a parameter ``start_method`` for the name of the
solver to be used for the first step. This solver is initialized
by the ``switch_to`` method in class ``Solver``. Basically,

.. code-block:: python

        new_solver = solver.switch_to(solver_name)

creates a new solver instance ``new_solver``, of the class implied by
``solver_name``, where all relevant parameters from ``solver`` are copied
to ``new_solver``.

An implementation of a subclass for the
2nd-order Adams-Bashforth scheme can then look as follows.

.. code-block:: python

        class AdamsBashforth2(Solver):
            """
            Second-order Adams-Bashforth method::
        
                u[n+1] = u[n] + dt/2.*(3*f(u[n], t[n]) - f(u[n-1], t[n-1]))
        
            for constant time step dt.
        
            RK2 is used as default solver in first step.
            """
            quick_description = "Explicit 2nd-order Adams-Bashforth method"
        
            _optional_parameters = Solver._optional_parameters + \ 
                                  ['start_method',]
        
            def initialize_for_solve(self):
                # New solver instance for first step
                self.starter = self.switch_to(self.start_method)
                Solver.initialize_for_solve(self)
        
            def validate_data(self):
                if not self.constant_time_step():
                    print '%s must have constant time step' % self.__name__
                    return False
                else:
                    return True
        
            def advance(self):
                u, f, n, t = self.u, self.f, self.n, self.t
        
                if n >= 1:
                    dt = t[n+1] - t[n]  # must be constant
                    self.f_n = f(u[n], t[n])
                    unew = u[n] + dt/2.*(3*self.f_n - self.f_n_1)
                    self.f_n_1 = self.f_n
                else:
                    # User-specified method for the first step
                    self.starter.set_initial_condition(u[n])
                    time_points = [t[n], t[n+1]]
                    u_starter, t_starter = self.starter.solve(time_points)
                    unew = u_starter[-1]
                    self.f_n_1 = f(u[0], t[0])
        
                return unew

Three features are worth comments: 1) we extend the set of optional
parameters; 2) we must initialize a separate solver for the first
step, and this is done in the ``initialize_for_solve`` method that will
be called as part of ``solve`` before the time stepping; and 3) we
extend ``validate_data`` to check that the time spacing given by the
``time_points`` argument to ``solve`` is constant. The utility method
``constant_time_step`` provided in super class ``Solver`` carries out the
details of the check.

More advanced implementations of subclasses can be studied
in the `solvers.py <https://github.com/hplgit/odespy/tree/master/odespy/odespy/solvers.py>`__ and `RungeKutta.py <https://github.com/hplgit/odespy/tree/master/odespy/odespy/RungeKutta.py>`__ files.

A Simple Example of an Implicit Method
--------------------------------------

Class ``SolverImplicit`` acts as superclass for the implementation of
implicit methods. This class provides some basic functionality for
solving the system of nonlinear equations that normally arises in
implicit methods by Picard or Newton iteration.
The parameter ``nonlinear_solver`` can take the values ``Picard`` or
``Newton``. The user must in case of Newton's method provide
a ``jac`` parameter for a function evaluating the Jacobian of :math:`f(u,t)`
with respect to :math:`u`: :math:`J_{i,j} = \partial f_i/\partial u_j`.

Instead of implementing an ``advance`` method in subclasses, one provides
a method ``Picard`` and/or ``Newton`` to define key quantities in these
methods. The superclass implements ``advance``, which will run a Picard
or Newton iteration. The ``Picard`` method returns all the terms
on the right-hand side of the discrete equation when only ``u[n+1]`` is
on the left-hand side. ``Newton`` returns the right-hand side and the
Jacobian of the system to be solved in each Newton iteration.

Here is an example showing the complete code of the Backward Euler method.

.. code-block:: python

        class BackwardEuler(SolverImplicit):
            """
            Implicit Backward Euler method::
        
               u[n+1] = u[n] + dt*f(t[n+1], u[n+1])
        
            The nonlinear system is solved by Newton or Picard iteration.
            """
            quick_description = "Implicit 1st-order Backward Euler method"
        
            def Picard_update(self, ukp1):
                u, f, n, t = self.u, self.f, self.n, self.t
                dt = t[n+1] - t[n]
                return u[n] + dt*f(ukp1, t[n+1])
        
            def Newton_system(self, ukp1):
                u, f, n, t = self.u, self.f, self.n, self.t
                dt = t[n+1] - t[n]
                F = ukp1 - (u[n] + dt*f(ukp1, t[n+1]))
                J = np.eye(self.neq) - dt*self.jac(ukp1, t[n+1])
                return F, J

.. !split

Troubleshooting
===============

Constructor takes exactly two arguments, 5 given
------------------------------------------------

Constructors in the ``Solver`` hierarchy take only the ``f`` function
as positional argument. All other parameters to the constructor
must be keyword arguments.

**Remark.**
This document was written with aid of the
`DocOnce <https://github.com/hplgit/doconce>`__ tool, which allows
output in many different formats.