7/19 - 7/23

7/19-7/23 notes

7/19

Numerical Differentiation

Limit of the Difference Quotient

ƒ'(x) = limit(h→0) [ƒ(x+h)-ƒ(x)]/h

We approximate derivative by choosing a small h

How small does h need to be?

-depends on ƒ(x)

we start with a large h and make it smaller and smaller until we see that D_k(the approximation) is loosing precision.

Example: | D_k-D_k-1| < ε

D_k = [ ƒ(x+h_k) - ƒ(x )] / h_k

Example: ƒ(x)=e^x, we want to approximate ƒ'(x)

D_k = (e^(x+h_k)-e^x) / h_k

k        h_kD_k

0        .1              2.8588

1        .01            2.7319

2        .001          2.7196

3        .0001        2.7184

4        .00001      2.7183

5        .000001    2.7183

we stop at k = 5, h_k = .000001

The iterative approach is slow. There are alternatives:

Central Differences Formulas

-Centered Formula of order O(h²)

start with Taylor expansion

x-x₀ → x=x₀+h → x-x₀=h

1. ƒ(x+h) = ƒ(x) + ƒ'(x)h + ƒ''(x)h²/2! + ƒ'''(c₁)h³/3! ← truncation error

2. ƒ(x-h) = ƒ(x) - ƒ'(x)h + ƒ''(x)h²/2! - ƒ'''(c₂)h³/3! ← truncation error

subtracting 2 from 1

ƒ(x+h) - ƒ(x+h) = 2 ƒ'(x)h + [ƒ'''(c₁)+ƒ'''(c₂)]h³/3!

ƒ(x) = 1/2h[ ƒ(x+h) - ƒ(x-h) - [ƒ'''(c₁)+ƒ'''(c₂)]h³/3! ]

= [ƒ(x+h)-ƒ(x-h)]/2h - [ƒ'''(c₁)+ƒ'''(c₂)]h²/ (2*3!) ← error O(h²)

ƒ'(x) ≈ [ƒ(x+h)-ƒ(x-h)]/2h

Example: ƒ(x) = e^x , h=.0001

ƒ'(1) = [e^(1+.0001)-e^(1-.0001)] / [2(.0001)] = 2.7183

-Centered Formula of order O(h⁴)

Do the Taylor expansion

ƒ(x+h) = ƒ(x) + ƒ'(x)h + ƒ''(x)h²/2! + ƒ'''(x)h³/3! + ƒ⁴(x)h⁴/4! + ƒ⁵(c₁)h⁵/5!

ƒ(x+h) = ƒ(x) - ƒ'(x)h + ƒ''(x)h²/2! - ƒ'''(x)h³/3! + ƒ⁴(x)h⁴/4! - ƒ⁵(c₁)h⁵/5!

1. ƒ(x+h) - ƒ(x-h) = 2ƒ'(x)h + 2ƒ^3(x)h³/3! + [ƒ⁵(c₁)+f⁵(c₂)]h⁵/5!

using step size 2h instead of h in 1 we get

2. ƒ(x+2h) - ƒ(x-2h) = 4ƒ'(x)h + 16ƒ^3(x)h³/3! + [ƒ⁵(c₁)+ƒ⁵(c₂)]32h⁵/5!

now multiply 1 by 8 and 2

8[ƒ(x+h) - ƒ(x-h)] - [ƒ(x+2h) - ƒ(x-2h)] = 8ƒ(x+h) - 8ƒ(x-h) - ƒ(x+2h) + ƒ(x-2h)

= 12ƒ'(x)h + kƒ⁵(c₃)h³ ← error (k constant)

so:

ƒ'(x) = [ -ƒ(x+2h) + 8ƒ(x+h) - 8ƒ(x-2h) + ƒ(x+h) ] / 12h + [ kƒ⁵(c₃)h⁴] / 12 ← error O(h⁴)

Example: let ƒ(x)=sin(x), h=.001 Centered formula O(h⁴)

ƒ'(π/3) = [ -sin(π/3+.002) + 8sin(π/3+.001) - 8sin(π/3-.002) + sin(π/3-.001) ] / 12(.001) = .5

centered formula O(h²)

ƒ'(π/3) = [ sin(π/3+.001) - sin(π/3-.001) ] / 2(.001) = .499999917

using difference quotient

ƒ'(π/3) = [ sin(π/3+.001) - sin(π/3) ] / .001 = .499566904

7/20

Numerical Integration

Trapezoidal Rule: approximates area under the curve with trapezoids

A₁ = (ƒ₀+ƒ₁)h/2

A₂ = (ƒ₁+ƒ₂)h/2

A₃ = (ƒ₂+ƒ₃)h/2

A₁+A₂+A₃= (ƒ₀+ƒ₁)h/2 + (ƒ₁+ƒ₂)h/2 + (ƒ₂+ƒ₃)h/2

= h [ ƒ₀/2 + ƒ₁/2 + ƒ₁/2 + ƒ₂/2 + ƒ₂/2 + ƒ₃/2 ]

= h [ ƒ₀/2 + ƒ₁ + ƒ₂ + ƒ₃/2 ]

In general: ∫_a→b ƒ(x)dx = h[ (ƒ(a)+ƒ(b))/2 + ∑_k=1→M-1ƒ(x_k) ] for M+1 points x₀...x_M

Example: ∫_-1→1e^{^}(-x²)dx , M=4, 4areas

h = (b-a)/M = [1-(-1)]/4 = 1/2

∫_-1→1e^{^}(-x²)dx ≈ 1/2 [ [(e^-(-1)² + e^-(1)²]/2 + e^-(-.5)² + e^-(0)² + e^-(.5)²]

= 1.4627

Example: ∫_0→π/2sin(x)dx , M=3, 3areas

h = π/6

∫_0→π/2sin(x)dx ≈ π/6 [ [sin(0)+sin(π/2)]/2 + sin(π/6) + sin(π/3) ]

= .9770486

7/21

Simpson Rule

-It approximates the integral using the area under a parabola every 3 points.

-The quadratic polynomial gives a better approximation than the line used in the trapezoidal rule.

x₁-x₀= h

x₂-x₀= 2h

x₂-x₁= h

Using Lagrange Polynomials we can obtain P₂(x)

P₂(x) = ƒ₀ ((x-x₁)(x-x₂)) / ((x₀-x₁)(x₀-x₂)) + ƒ₁ ((x-x₀)(x-x₂)) / ((x₁-x₀)(x₁-x₂)) + ƒ₂ ((x-x₀)(x-x₁)) / ((x₂-x₀)(x₂-x₁))

∫_x0→x2P₂(x)dx = ∫_x0→x2[ ƒ₀ ((x-x₁)(x-x₂)) / ((-h)(-2h)) + ƒ₁ ((x-x₀)(x-x₂)) / ((h)(-h)) + ƒ₂ ((x-x₀)(x-x₁)) / ((2h)(h)) ]

To do the integral, we do the substitution x = x₀+ht

→ t = (x-x₀)/h , dx = hdt

t₀ = (x₀-x₀)/h = 0

t₂ = (x₂-x₀)/h = 2

∫_x0→x2P₂(x)dx = ∫_0→2[ ƒ₀ ((x₀+ht-x₁)(x₀+ht-x₂)) / 2h² + ƒ₁ ((x₀+ht-x₀)(x₀+ht-x₂)) / -h² + ƒ₂ ((x₀+ht-x₀)(x₀+ht-x₁)) / 2h² ] hdt

= ∫_0→2[ ƒ₀ ((ht-h)(ht-2h)) / 2h² + ƒ₁ ((ht)(ht-2h)) / -h² + ƒ₂ ((ht)(ht-h)) / 2h² ] hdt

= ∫_0→2[ƒ₀/2 *(t-1)(t-2)] hdt + ∫_0→2[ƒ₁/-1 *(t)(t-2)] hdt + ∫_0→2[ƒ₂/2 *(t)(t-1)] hdt

= ƒ₀h/2 ∫_0→2(t²-3t+2) dt - ƒ₁h ∫_0→2(t²-2t) dt + ƒ₂h/2 ∫_0→2(t²-t) dt

= ƒ₀h/2 *(2/3) - ƒ₁h *(-4/3) + ƒ₂h/2 *(2/3)

= (ƒ₀+4ƒ₁+ƒ₂ )h/3

This equation is only for 3 points. If the interval [a,b] is subdivided into 2M intervals [x_k,x_k+1] of equal width we can use Simpson Rule every 3 points.

Example: 2M=6, M=3

2M intervals

2M+1 points

M parabolas

A = A₁ + A₂ + A₃

= h/3(ƒ₀+4ƒ₁+ƒ₂) + h/3(ƒ₂+4ƒ₃+ƒ₄) + h/3(ƒ₄+4ƒ₅+ƒ₆)

= h/3( ƒ₀+ 4ƒ₁+ 2ƒ₂+ 4ƒ₃+ 2ƒ₄+ 4ƒ₅+ ƒ₆)

In general (Simpson Rule):

A = h/3( ƒ₀+ 4ƒ₁+ 2ƒ₂+ 4ƒ₃+ 2ƒ₄+ ... + 4ƒ_2M-1+ ƒ_2M)

Example: ∫_0→π/2sin(x)dx , 2M=2, M=1

h = π/4

A = π/4/3 [ sin(0) + 4sin(π/4) + sin(π/2) ] = 1.002279878

Example: ∫_0→π/2sin(x)dx , 2M=4, M=2

h = π/8

A = π/8/3 [ sin(0) + 4sin(π/8) + 2sin(π/4) + 4sin(π3/8) + sin(π/2) ] = 1.000134585

Exact solution: ∫_0→π/2sin(x)dx = 1

7/22

Numerical Optimization

-Minimization of a function f(x)

let I=[a,b] an interval

-Local maximum value at x=p, in the interval I=[a,b]

if ƒ(x)≤ƒ(p) for all x є I

-Local minimum value at x=p, in the interval I=[a,b]

if ƒ(x)≥ƒ(p) for all x є I

-A function is increasing in the interval I = [a,b]

if x₁< x₂ and ƒ(x₁) < ƒ(x₂) for all x є I

if ƒ'(x)>0 for all x є I then ƒ(x) is increasing in the interval I

-A function is decreasing in the interval I = [a,b]

if x₁< x₂ and ƒ(x₁) > ƒ(x₂) for all x є I

if ƒ'(x)<0 for all x є I then ƒ(x) is decreasing in the interval I

-If there is a maximum of a minimum value at x=p then ƒ'(p) = 0

If ƒ''(p)<0 then ƒ(p) is a local maximum If ƒ''(p)>0 then ƒ(p) is a local minimum

Minimization Using Derivatives

Assume we want to minimize ƒ(x) and it has a unique minimum at p, a<p<b

We can solve the function ƒ'(x)=0 using any method for non-linear equations: Newthon, Secant, Bisection, Regula-Falsi

ƒ'(x) can be computed numerically using the centered formulas.

Golden Ratio

Assume ƒ(x) is unimodal (one minimum) in the interval [a,b]

we divide the interval in three subintervals a<c<d<b

Then we evaluate ƒ(x) in c and d

-if ƒ(c) < ƒ(d) then the new interval to search will be reduced to [a,d]

-if ƒ(c) > ƒ(d) then the new interval to search will be reduced to [c,b]

Why? if ƒ(c) < ƒ(d) then the minimum is in the left side of d

if ƒ(c) > ƒ(d) then the minimum is in the right side of c

c and d could be any two values such that a<c<d<b

We choose c and d in such a way that we reduce the number of computations done.

we choose c = a+(1-r)(b-a)

d = a+r(b-a)

we want:

(1-r)/r = r/1 → 1-r = r²

r²+r = 0

Golden Ratio: r = (-1±√(1+4))/2

= (√5 - 1)/2

By using the Golden Ratio to choose c,d we reduce the number of operations that we need to do

when ƒ(c) < ƒ(d)

the new interval becomes [a,d]

the old d will be the new b

d → b

the old c becomes the new d

c → d

the new c will have to be recomputed as well as ƒ(c)

when ƒ(c) > ƒ(d)

the new interval becomes [c,b]

the old c will be the new a

c → a

the old d becomes the new c

d → c

the new d will have to be recomputed as well as ƒ(d)

So at every iteration, we need to computer only one value either of ƒ(c) or ƒ(d) instead of two

Example: ƒ(x)=x²+1, r = (√5 - 1)/2 = .61803, start interval [-2,1]

i=1 a₁=-2 b₁=1

    c₁ = -2+(1-.61803)(1-(-2)) = -.8541

    d₁ = -2+.61803(1-(-2)) = -.14589

    ƒ(a₁)=5

    ƒ(b₁)=2

    ƒ(c₁)=1.72949

    ƒ(d₁)=1.02128

ƒ(c₁) > ƒ(d₁) → new interval [c₁,b₁]

7/23

continue example:

i=2 a₂=-.8541 b₂=1

    c₂ = -.8541+(1-.61803)(1-(-.8541)) = -.14589 (equal to d₁)

    d₂ = -.8541+.61803(1-(-.8541)) = .291789

    ƒ(a₂)=1.72949    →ƒ(c₁)

    ƒ(b₂)=2    →ƒ(b₁)

    ƒ(c₂)=1.02128    →ƒ(d₁)

    ƒ(d₂)=1.08514    →new value

ƒ(c₂) < ƒ(d₂) → new interval [a₂,d₂]

i=3 a₃=-.8541 b₃=.291789

c₃ = -.8541+(1-.61803)(.291789-(-.8541)) = -.4164

d₃ = c₂ = -.14589

ƒ(a₃)=1.72949    →ƒ(a₂)

ƒ(b₃)=1.08514    →ƒ(d₂)

ƒ(c₃)=1.17339    →new value

ƒ(d₃)=1.02128    →ƒ(c₂)

ƒ(c₃) > ƒ(d₃) → new interval [c₃,b₃]

Steepest Descent of Gradient Method:

-Assume we want to minimize ƒ(X) of N variables where X = (x₁,x₂,...x_n)

-The gradient ƒ(X) is a vector defined as:

Δƒ(X) = (dƒ(X)/dx₁, dƒ(X)/dx₂, ... dƒ(X)/dx_N)

From the concept of fradient we know that the gradient vector points in the direction of the greatest increase of ƒ(X)

Then -Δƒ(X) will point to the direction of greatest decrease.

To obtain the minimum using the gradient method, starting at point P₀ we have to move along the line in the direction of greatest decrease that is -Δƒ(X)

P₁= P₀- G₀h where G₀ is gradient at P₀ and h is a smaller constant

P_k+1= P_k- G_kh

Example: y = x²+1

y' = 2x, G = 2x

let h = .1

x₀ = -1

G₀ = 2(-1) = 2 x₁ = x₀-G₀h

= -1-(-2)(.1) = -.8

G₁ = 2(x₁) = -1.6 x₂ = x₁-G₁h

= -.8-(-1.6)(.1) = -.64

G₂ = 2(-.64) = -1.28 x₃ = -.64-(-1.28)(.1) = -.512

G₃ = -1.024 x₃ = -.4096

G₄ = -.8192 x₄ = -.32768

... ...

G_∞ = 0 x_∞ = 0

Numerical Methods for Differential Equations

Consider the differential equation y=g(t,y)

dy/dt = ky →solution y=f(t)

∫ dy/y = ∫ kdt

ln(y) = k₁t+k₂→solution y=e^(k₁t+k₂)

Some differential equations do not have an analytical solution so they have to be approximated using numerical methods.

Euler's Method

Let [a,b] be the interval over which we want to find the solution y'=f(t,y)