1 General linear multistep methods

Euler’s method and the trapezium method are both special cases of a wider class of so-called linear multistep methods. The following Key Point gives the most general situation that we will look at.

Key Point 8

The general k -step linear multistep method is given by

α k y n + k + + α 1 y n + 1 + α 0 y n = h β k f n + k + + β 1 f n + 1 + β 0 f n

or equivalently

j = 0 k α j y n + j = h j = 0 k β j f n + j .

It is always the case that α k 0 . Also, at least one of α 0 and β 0 will be non-zero.

A linear multistep method is defined by the choice of the quantities

k , α 0 , α 1 , , α k , β 0 , β 1 , , β k

  • If β k = 0 the method is called explicit . (Because at each step, when we are trying to find the newest y n + k , there is no appearance of this unknown on the right-hand side.)
  • If β k 0 the method is called implicit . (Because y n + k now appears on both sides of the equation (on the right-hand side it appears through f n + k = f ( ( n + k ) h , y n + k ) , and we cannot, in general, rearrange to get an explicit formula for y n + k .)

The next Example shows one such choice.

Example 5

Write down the linear multistep scheme defined by the choices k = 1 , α 0 = 1 , α 1 = 1 , β 0 = β 1 = 1 2 .

Solution

Here k = 1 so that

α 1 y n + 1 + α 0 y n = h ( β 1 f n + 1 + β 0 f n )

and substituting the values in for the four coefficients gives

y n + 1 y n = h 1 2 f n + 1 + 1 2 f n

which, as we know, is the trapezium method.

Task!

Write down the linear multistep scheme defined by the choices k = 1 , α 0 = 1 , α 1 = 1 , β 0 = 1 and β 1 = 0 .

Here k = 1 and we have

α 1 y n + 1 + α 0 y n = h ( β 1 f n + 1 + β 0 f n )

and substituting the values in for the four coefficients gives

y n + 1 y n = h f n

which, as we know, is Euler’s method.

Task!

Write down the linear multistep scheme defined by the choices k = 2 , α 0 = 0 , α 1 = 1 , α 2 = 1 , β 2 = 0 , β 1 = 3 2 and β 0 = 1 2 .

Here k = 2 (so we are looking at a 2-step scheme) and we have

α 2 y n + 1 + α 1 y n + 1 + α 0 y n = h ( β 2 f n + 2 + β 1 f n + 1 + β 0 f n )

Substituting the values in for the six coefficients gives

y n + 2 y n + 1 = h 2 3 f n + 1 f n

which is an example of a scheme that is explicit (because β k = β 2 is zero).

In the preceding Section we saw several examples implementing the Euler and trapezium methods.

The next Example deals with the explicit 2-step that was the subject of the Task above.

Example 6

A numerical scheme has been used to approximate the solution of

d y d t = t + y , y ( 0 ) = 3

and has produced the following estimates, to 6 decimal places,

y ( 0.4 ) 4.509822 , y ( 0.45 ) 4.755313

Now use the 2-step, explicit linear multistep scheme

y n + 2 y n + 1 = h 1.5 f n + 1 0.5 f n

to approximate y ( 0.5 ) .

Solution

Evidently the value h = 0.05 will serve our purposes and we seek y 10 y ( 0.5 ) . The values we will need to use in our implementation of the 2-step scheme are y 9 = 4.755313 and

f 9 = f ( 0.45 , y 9 ) = 5.205313 f 8 = f ( 0.4 , y 8 ) = 4.909822

to 6 decimal places since f ( t , y ) = t + y . It follows that

y 10 = y 9 + 0.05 × 1.5 f 9 0.5 f 8 = 5.022966

And we conclude that y ( 0.5 ) 5.022966 , where this approximation has been given to 6 decimal places.

Notice that in this implementation of a 2-step method we needed to use the values of the two y values preceding the one currently being sought. Both y 8 and y 9 were used in finding y 10 .

Similarly, a k -step method will use, in general, k previous y values at each time step.

This means that there is an issue to be resolved in implementing methods that are 2- or higher-step, because when we start we are only given one starting value y 0 . This issue will be dealt with towards the end of this Section. The following exercise involves a 2-step method, but (like the example above) it does not encounter the difficulty relating to starting values as it assumes that the numerical procedure is already underway.

Task!

A numerical scheme has been used to approximate the solution of

d y d t = t y y ( 0 ) = 2

and has produced the following estimates, to 6 decimal places,

y ( 0.24 ) 2.013162 , y ( 0.26 ) 2.015546

Now use the 2-step, explicit linear multistep scheme

y n + 2 1 2 y n + 1 1 2 y n = 3 2 h f n + 1

to approximate y ( 0.28 ) .

Evidently the value h = 0.02 will serve our purposes and we seek y 14 y ( 0.28 ) . The values we will need to use in our implementation of the 2-step scheme are y 13 = 2.015546 , y 12 = 2.013162 and

f 13 = f ( 0.26 , y 13 ) = 0.128997

to 6 decimal places since f ( t , y ) = t y . It follows that

y 14 = 1 2 y 13 + 1 2 y 12 + 0.02 × 3 2 f 13 = 2.018224

And we conclude that y ( 0.28 ) 2.018224 , to 6 decimal places.

1.1 Zero stability

We now begin to classify linear multistep methods. Some choices of the coefficients give rise to schemes that work well, and some do not. One property that is required if we are to obtain reliable approximations is that the scheme be zero stable . A scheme that is zero stable will not produce approximations which grow unrealistically with t .

We define the first characteristic polynomial

ρ ( z ) = α 0 + α 1 z + α 2 z 2 + α k z k

where the α i are the coefficients of the linear multistep method as defined in Key Point 8 (page 21). This polynomial appears in the definition of zero stability given in the following Key Point.

Key Point 10

The linear multistep scheme

j = 0 k α j y n + j = h j = 0 k β j f n + j .

is said to be zero stable if the zeros of the first characteristic polynomial are such that

  1. none is larger than 1 in magnitude
  2. any zero equal to 1 in magnitude is simple (that is, not repeated)

The second characteristic polynomial is defined in terms of the coefficients on the right-hand side (the β j ), but its use is beyond the scope of this Workbook.

Example 7

Find the roots of the first characteristic polynomial for each of the examples below and determine whether or not the method is zero stable.

  1. y n + 1 y n = h f n
  2. y n + 1 2 y n = h f n
  3. y n + 2 + 3 y n + 1 4 y n = h 2 f n + 2 + f n + 1 + 2 f n
  4. y n + 2 y n + 1 = 3 2 h f n + 1
  5. y n + 2 2 y n + 1 + y n = h ( f n + 2 f n )
  6. y n + 2 + 2 y n + 1 + 5 y n = h f n + 2 f n + 1 + 2 f n
Solution
  1. In this case ρ ( z ) = z 1 and the single zero of ρ is z = 1 . This is a simple (that is, not repeated) root with magnitude equal to 1, so the method is zero stable.
  2. ρ ( z ) = z 2 which has one zero, z = 2 . This has magnitude 2 > 1 and therefore the method is not zero stable.
  3. ρ ( z ) = z 2 + 3 z 4 = ( z 1 ) ( z + 4 ) . One root is z = 4 which has magnitude greater than 1 and the method is therefore not zero stable.
  4. Here α 2 = 1 , α 1 = 1 and α 0 = 0 , therefore

    ρ ( z ) = z 2 z = z ( z 1 )

    which has two zeros, z = 0 and z = 1 . These both have magnitude less than or equal to 1 and there is no repeated zero with magnitude equal to 1, so the method is zero stable.

  5. ρ ( z ) = z 2 2 z + 1 = ( z 1 ) 2 . Here z = 1 is not a simple root, it is repeated and, since it has magnitude equal to 1, the method is not zero stable.
  6. ρ ( z ) = z 2 + 2 z + 5 and the roots of ρ ( z ) = 0 can be found from the quadratic formula. In this case the roots are complex and are equal to Zero-stability requires that the absolute values have magnitude less than or equal to 1. Consequently we conclude that the method is not zero stable.
Task!

Find the roots of the first characteristic polynomial for the linear multistep scheme

y n + 2 2 y n + 1 + y n = h f n + 2 + 2 f n + 1 + f n

and hence determine whether or not the scheme is zero stable.

The first characteristic polynomial is

ρ ( z ) = α 2 z 2 + α 1 z + α 0 = z 2 2 z + 1

and the roots of ρ ( z ) = 0 are both equal to 1. In the case of roots that are equal, zero-stability requires that the absolute value has magnitude less than 1. Consequently we conclude that the method is not zero stable.

At this stage, the notion of zero stability is rather abstract, so let us try using a zero unstable method and see what happens. We consider the simple test problem

d y d t = y , y ( 0 ) = 1

which we know to have analytic solution y ( t ) = e t , a quantity which decays with increasing t . Implementing the zero unstable scheme

y n + 1 2 y n = h f n

on a spreadsheet package with h = 0.05 gives the following results

n t = n h y n y ( n h ) ̲ ̲ ̲ 0 0.00 1.00000 1 0.05 1.95000 2 0.10 3.80250 3 0.15 7.41488 4 0.20 14.45901 5 0.25 28.19506 6 0.30 54.98037 7 0.35 107.21172 8 0.40 209.06286 9 0.45 407.67258 10 0.50 794.96153 11 0.55 1550.17499 12 0.60 3022.84122 13 0.65 5894.54039 14 0.70 11494.35376 15 0.75 22413.98982

where 5 decimal places have been given for y n . The dramatic growth in the values of y n is due to the zero instability of the method. (There are in fact other things than zero instability wrong with the scheme y n + 1 2 y n = h f n , but it is the zero instability that is causing the large numbers.)

1.2 Consistency and order

A scheme that is zero stable will produce approximations that do not grow in size in a way that is not present in the exact, analytic solution. Zero stability is a required property, but it is not enough on its own. There remains the issue of whether the approximations are close to the exact values.

The truncation error of the general linear multistep method is a measure of how well the differential equation and the numerical method agree with each other. It is defined by

τ j = 1 β c 0 h y ( j h ) + c 1 y ( j h ) + c 2 h y ( j h ) + c 3 h 2 y ( j h ) + = 1 β h p = 0 c p h p y ( p ) ( j h )

where β = β j is a normalising factor.

It is the first few terms in this expression that will matter most in what follows, and it helps us that there are formulae for the coefficients which appear

c 0 = α j , c 1 = ( j α j β j ) , c 2 = j 2 2 α j j β j , c 3 = j 3 3 ! α j j 2 2 β j

and so on, the general formula for p 2 is c p = j p ( p ) ! α j j p 1 ( p 1 ) ! β j .

Recall that the truncation error is intended to be a measure of how well the differential equation and its approximation agree with each other. We say that the numerical method is consistent with the differential equation if τ j tends to zero as h 0 . The following Key Point says this in other words.

Key Point 11

The linear multistep scheme is said to be consistent if c 0 = 0 and c 1 = 0 .

Example 8

Show that Euler’s method ( y n + 1 = y n + h f n ) is consistent.

Solution

In this case α 1 = 1 , α 0 = 1 , β 1 = 0 and β 0 = 1 . It follows that

c 0 = α j = 1 1 = 0 and c 1 = j α j β j = 1 α 1 ( β 0 + β 1 ) = 1 ( 1 + 0 ) = 0

and therefore Euler’s method is consistent.

Task!

Show that the trapezium method ( y n + 1 = y n + h 2 ( f n + 1 + f n ) ) is consistent.

In this case α 1 = 1 , α 0 = 1 , β 1 = 1 2 and β 0 = 1 2 . It follows that

c 0 = α j = 1 1 = 0 and c 1 = j α j β j = 1 α 1 ( β 0 + β 1 ) = 1 ( 1 2 + 1 2 ) = 0

and therefore the trapezium method is consistent.

Task!

Determine the consistency (or otherwise) of the following 2-step linear multistep schemes

  1. y n + 2 2 y n + 1 + y n = h ( f n + 2 f n )
  2. y n + 2 y n + 1 = h ( f n + 1 2 f n )
  3. y n + 2 y n + 1 = h ( 2 f n + 2 f n + 1 )
  1. c 0 = α 2 + α 1 + α 0 = 1 2 + 1 = 0 , c 1 = 2 α 2 + 1 × α 1 + 0 × α 0 ( β 2 + β 1 + β 0 ) = 2 ( 1 ) + 1 ( 2 ) + 0 ( 1 1 ) = 0 . Therefore the method is consistent.
  2. c 0 = 1 1 + 0 = 0 , c 1 = 2 1 ( 1 2 ) = 2 so the method is inconsistent.
  3. This method is consistent, because c 0 = 1 1 = 0 and c 1 = 2 1 ( 2 1 ) = 0 .

(Notice also that the first characteristic polynomial ρ ( z ) , defined on page 6 of this Section, evaluated at z = 1 is equal to α 0 + α 1 + + α k = c 0 . It follows that a consistent scheme must always have z = 1 as one of the roots of its ρ ( z ) .)

Assuming that the method is consistent, the order of the scheme tells us how quickly the truncation error tends to zero as h 0 . For example, if c 0 = 0 , c 1 = 0 , c 2 = 0 and c 3 0 then the first non-zero term in τ j will be the one involving h 2 and the linear multistep method is called second-order . This means that if h is small then τ j is dominated by the h 2 term (because the h 3 and subsequent terms will be tiny in comparison) and halving h will cause τ j to decrease by a factor of approximately 1 4 . The decrease is only approximately known because the h 3 and other terms will have a small effect.

We summarise the general situation in the following Key Point.

Key Point 12

A linear multistep method is said to be of order p if

c 0 = c 1 = c 2 = = c p = 0 and c p + 1 0

Combining the last two Key Points gives us another way of describing consistency: “A linear multistep method is consistent if it is at least first order".

Example 9

Find the order of

  1. Euler’s method
  2. The trapezium method.
Solution
  1. We have already found that c 0 = c 1 = 0 so the first quantity to calculate is

    c 2 = j 2 2 α j j β j = 1 2 α 1 β 1 = 1 2

    which is not zero and therefore Euler’s method is of order 1. (Or, in other words, Euler’s method is first order.)

  2. We have already found that c 0 = c 1 = 0 so the first quantity to calculate is

    c 2 = j 2 2 α j j β j = 1 2 α 1 β 1 = 1 2 1 2 = 0

    this is equal to zero, so we must calculate the next coefficient

    c 3 = j 3 3 ! α j j 2 2 β j = 1 6 α 1 1 2 β 1 = 1 6 1 4 = 1 12

    which is not zero. Hence the trapezium method is of order 2 (that is, it is second order).

This finally explains some of the results we saw in the first Section of this Workbook. We saw that the errors incurred by the Euler and trapezium methods, for a particular test problem, were roughly proportional to h and h 2 respectively. This behaviour is dictated by the first non-zero term in the truncation error which is the one involving c 2 h for Euler and the one involving c 3 h 2 for trapezium.

We now apply the method to another linear multistep scheme.

Example 10

Find the order of the 4-step, explicit linear multistep scheme

y n + 4 y n + 3 = h 24 55 f n + 3 59 f n + 2 + 37 f n + 1 9 f n

Solution

In the established notation we have α 4 = 1 , α 3 = 1 , α 2 = 0 , α 1 = 0 and α 0 = 0 . The β terms similarly come from the coefficients on the right hand side (remembering the denominator of 24).

Now

c 0 = α j = 0 and c 1 = j α j β j = 0

from which we conclude that the method is consistent.

We also find that

c 2 = 1 2 j 2 α j j β j = 0 , c 3 = 1 6 j 3 α j 1 2 j 2 β j = 0 , c 4 = 1 24 j 4 α j 1 6 j 3 β j = 0 c 5 = 1 120 j 5 α j 1 24 j 4 β j = 0.348611 to 6 d.p.

(The exact value of c 5 is 251 720 .)

Because c 5 is the first non-zero coefficient we conclude that the method is of order 4.

So the scheme in Example 10 has the property that the truncation error will tend to zero proportional to h 4 (approximately) as h 0 . This is a good thing, as it says that the error will decay to zero very quickly, when h is decreased.

Task!

Find the order of the 2-step linear multistep scheme

y n + 2 y n + 1 = h 12 f n + 2 + 8 f n + 1 f n

In the established notation we have α 2 = 1 , α 1 = 1 and α 0 = 0 . Also β 2 = 5 12 , β 1 = 2 3 and β 0 = 1 12 . Now

c 0 = α j = 1 1 + 0 = 0 and c 1 = j α j β j = 2 α 2 + α 1 ( β 2 + β 1 + β 0 ) = 0

from which we conclude that the method is consistent.

We also find that

c 2 = 1 2 j 2 α j j β j = 1 2 ( 4 α 2 + α 1 ) ( 2 β 2 + β 1 ) = 0 c 3 = 1 6 j 3 α j 1 2 j 2 β j = 1 6 ( 8 α 2 + α 1 ) 1 2 ( 4 β 2 + β 1 ) = 0 c 4 = 1 24 j 4 α j 1 6 j 3 β j = 1 24 ( 16 α 2 + α 1 ) 1 6 ( 8 β 2 + β 1 ) = 1 24                                                   

so that the method is of order 3.

1.3 Convergence

The key result concerning linear multistep methods is given in the following Key Point.

Key Point 13

The numerical approximation to the initial value problem converges to the actual solution as h 0 if

1. the scheme is zero stable

2. the scheme is consistent

The proof of this result lies beyond the scope of this Workbook. It is worth pointing out that this is not the whole story. The convergence result is useful, but only deals with h as it tends to zero. In practice we use a finite, non-zero value of h and there are ways of determining how big an h it is possible to “get away with" for a particular linear multistep scheme applied to a particular initial value problem.

If, when implementing the methods described above, it is found that the numerical approximations behave in an unexpected way (for example, if the numbers are very large when they should not be, or if decreasing h does not seem to lead to results that converge) then one topic to look for in further reading is that of “absolute stability".