Lecture Notes On Numerical Methods

ITERATIVE METHODS FOR SOLVING SYSTEM OF LINEAR EQUATIONS
1. 1 Vector and Vector Norms
1.1.1 Vector
Definition 1.1 A vector is a rectangular array of elements. It is customary to represent
vectors by small underlined or bolded letters. The elements are given in parentheses
separated by commas or they are given in terms of directions i, j and k, or in the form of a
column enclosed by parentheses or square brackets.
Example 1.1
 3
 
v  v  3i  5 j  3k  3, 5, 3   5 
 3
 
1.1.2 Vector Norms
The size of a vector is represented in terms of a vector norm. The norm of a vector v is
denoted by v . The commonly used vector norms are the 2-norm, . 2 and infinity-norm
. 
, other norm is one-norm . 1
If v  x1 x2  xn  T , then
 max  xi , i  1,  , n
n
v 2
 x12  x22    xn2 , v 1
  xi , v 
i 1
Example 1.2
Given that x  3  4 2 T . Find i  x 1

ii  x 2
iii x 
Solution:
3
i  x 1
  xi  3   4  2  3  4  2  9
i 1
ii  x 2
 x12  x22  x32  32   4  22  29
2
1
iii x 
 max  xi , i  1, 2 , 3  max  3 ,  4, 2   max3 , 4 , 2  4
1.2 Properties of Vector Norms
Let x be a vector and  be a real number. Then
1. x 0
2. x 0  x0
3. x   x for all x  R
4. xy  x  y The triangular inequality
1.3 Matrices and Matrix Norms

1.3.1 Matrices
Definition 1.2 A matrix is a rectangular array of elements. Matrices are denoted by
capital letters and their elements by corresponding small letters. Elements within a matrix
are enclosed by square brackets   or parentheses   . Any matrix has an order
m n , where m is the number of rows and n is the number of columns of the matrix. A
3 3 matrix A is written as
 a11 a12 a13 
 
A   a 21 a 22 a 23 
a a33 
 31 a32
This can shortly be written as A  aij  1  i  3, 1  j  3
 2  1 2  1
e.g A    , or A   
3 4  3 4 
1.3.2 Matrix Operations
Transposition: A transpose of a matrix (say A) is a matrix obtained by interchanging

rows by columns of A. It is denoted by A ' or AT .
Example 1.3
2 3 4 
Find AT , given that A   
1 0  2
2
Solution:
2 1 
 
A  3 0 
T
 4  2
 
Addition and Subtraction

We can perform addition or subtraction of two or more matrices only if they are of the
same order. Addition or subtraction is performed by adding or subtracting the
corresponding elements of the matrices.
a a12  b b 
If A   11  , B   11 12  , then
 a 21 a33   b21 b22 
a b a12  b12  a b a12  b12 
A  B   11 11  and A  B   11 11 
 a 21  b21 a 22  b22   a 21  b21 a 22  b22 
Multiplication
Two matrices can be multiplied at a time only if they are compatible. Since matrix
multiplication is not commutative (i.e AB  BA), then order of multiplication becomes
important.
Compatibility is equivalent to saying that the number of columns of the first matrix must
be equal to the number of rows of the second matrix.
Suppose we need to perform multiplication AB such that A is m p and B is p  n , then

the resulting matrix C=AB is of order m n . Matrix multiplication is performed by
multiplying elements from the rows of the first matrix by the corresponding elements in
the columns of the second matrix and the resulting products are summed together.
Example 1.4
 1 3
1 3 2  
Given that A    and B   0 4  . Find (i) AB (ii) BA
6 4 1  2 6
 
Solution
 1 3 
 1 3 2     1  0  4 3  12  12   3 27 
i  AB    0 4       
 6 4 1  2 6    6  0  2 18  16  6    4 40 
 
3
 1 3   1  18  3  12  2  3   17 9 1 
  1 3 2     
ii  BA   0 4     0  24 0  16 0  4    24 16 4 
 2 6  6 4 1   2  36 6  24 4  6   38 30 10 
  
1.3.3 Properties of Matrix Operators
a   A  B T  AT  B T
b   AB T  B T AT
c  AT T  A
1.3.4 Special Types of Matrices
(a) Square Matrix: This is a matrix whose number of rows is equal to the number of
columns.
 1 9 7
 
e.g A   5  6 0  . This is 3 3 square matrix.
4 6 2 

(b) Diagonal Matrix: This is a square matrix in which all of its off-diagonal elements
are zero. This requires at least one non-zero element in the main diagonal. It is
denoted by D. The elements of D can be obtained from the following expression
d if i  j
d ij   ii
0 if i j
 2 0 0  2 0 0
   
e.g D   0  3 0  or D   0 0 0
0 0 7  0 0 0
   
(c) Identity Matrix: This is a diagonal matrix whose all elements in the main diagonal
are 1. It is denoted by capital I.
4
1 0 0
 
e.g I   0 1 0
0 0 1
 
(d) Symmetric Matrix: This is a square matrix whose transpose is equal to the matrix
itself. i.e A matrix A is said to be symmetric iff AT  A .
1 2 3  1 2 3 
   
e.g. A   2 5  1  AT   2 5  1
 3 1 4   3 1 4 
   
(e) Zero Matrix: This is a matrix of any order such that all of its elements are zero. It
is denoted by capital O.
 0 0 0
 
e.g. O   0 0 0
 0 0 0
 
(f) Strictly diagonally dominant: The n  n matrix A is said to be strictly diagonally
dominant if
n
aii  a
j 1
ij , i  1, , n .
j i
 7 2 0
 
e.g. The matrix A   1  3 1  is strictly diagonally dominant since
 4  3 8
 
7  2  0 72
3  1  1  3  2
8  4  3  8  7
(f) Positive definite: A symmetric n  n matrix A is called positive definite if for any
non-zero vector x , then xT Ax  0 .
Theorem 1.1 If A is a positive definite matrix, then
5
a  A is non singular
b aii  0 for each i  1, , n
c max a kj  max aii
1k , j  n 1 i  n
d  aij  2
 aii a jj for each i, j  1, , n with i  j
(g) Tridiagonal matrix: A tridiagonal matrix is the one whose all entries are situated
at the three main diagonals, and the rest of the entries are zeros.
Consider the following matrices,
4 3 0 0 0 4 3 0 0 0
   
1 2 2 0 0 1 2 2 0 0
A  0 2 4 1 0 B  0 2 4 1 0
   
0 0 1 7 3 0 0 1 7 3
0 5  1 5 
 0 0 2  0 0 2
Matrix A is a tridiagonal matrix while matrix B is not.
1.3.2 Matrix Norms

The sizes of matrices are also represented in terms of norms. The matrix norms are
defined in terms of vector norms. Let A be any non-zero matrix and x be a non-zero
vector, then the norm of matrix A is given by
Ax
A  max
x 0 x
Two commonly used matrix norms are . 1
and . 
, where
m 
A  max  aij  = maximum of column sum.
 i 1 
1 1 j  n
n 
A 
 max  aij  = maximum of row sum.
1i  m
 j 1 
6
 3 6  1
 
Example 1.5 Given A   4 5 3  . Find i  A 
ii  A 1
0 2 8 
 
Solution
i  A 
 max  3  6   1 , 4  5  3 , 0  2  8 
 max10 , 12 , 10   12
ii  A 1
 max  3  4  0 , 6  5  2 ,  1  3  8 
 max  7 , 13 , 12   13
The matrix norms satisfy one more axiom that

AB  A B
Definition 1.3 The condition number k  A of non-singular matrix A is defined by

k  A  A 1 A
The condition number is useful indicator to the sensitivity of the problem to small
changes of the data in the problem solving. If the condition number is large, the problem
is said to be ill-conditioned whereas if it is small, then the problem is called
well-conditioned. The only limitation of the computation of the condition number is that
it requires the inverse of the matrix. If the matrix is large, then the computation requires
the use of computer package, MATLAB or MAPLE.
Many mathematical and physical problems can be solved by the use of matrix methods.
Eigenvalues and eigenvectors are very useful components of solution to these problems.
1.4 Eigenvalues and Eigenvectors

Definition 1.4 An Eigenvalue of square matrix A is a number  such that
det  A   I   0 . Where I is an identity matrix with the same order as matrix A.
To find the eigenvalues of a given matrix is simply solving the characteristic equation
det  A   I   0 with the only unknown being  .
Definition 1.5 An eigenvector corresponding to an eigenvalue  is any non-zero vector
x such that
A x   x or  A  I x  0
7
Example 1.6
 4  1
Find all eigenvalues and eigenvectors of A given that A   
2 1 
Solution
To find the eigenvalues we solve the characteristic equation det  A   I   0
 4  1 1 0
 A  I  0         0
2 1  0 1
4   1
 0
2 1 
 4   1     2  0
  2  5  6  0
   2  3  0
   2 or   3
Thus the eigenvalues of the matrix A are 2 and 3
The eigenvector corresponding to   2 is a non-zero vector x such that  A  I x  0

Let x  x1 , x2  , then
T
 4  2  1   x1 
A   I x  0      0
 2 1  2   x 2 
 2  1  x1 
      0
 2  1  x 2 
 x  1
  1    
 x2   2 
The eigenvectors corresponding to   3 is a non-zero vector y such that  A  I y  0

Let y   y1 , y2  T , then
 4  3  1   y1 
A   I y  0       0
 2 1  3  y2 
 1  1   y1 
      0
 2  2   y2 
 y  1
  1    
 y 2  1
8
1
Thus the eigenvectors corresponding to the eigenvalues 2 and 3 are respectively   and
 2
1
  .
1
Definition 1.6 The spectral radius of a matrix A is defined by

  A  max  i , where i is the eigenvalue of A.
Note that if  is complex, such that   a  bi , then   a 2  b 2 .
Theorem 1.2 If A is an n  n matrix, then

(i) A 2   AT A  
(ii)   A  A , for any natural norm . .
Definition 1.7 The n  n matrix A is said to be convergent if for each 1  i, j  n ,
lim A k
k 
  ij  0.
Example 1.7
Determine whether or not the following matrix is convergent.
1 
 0
A2 
 1 1

4 2
Solution
By computing powers of A we obtain
1  1   1 k 
 0  0   0 
A2   4  , A3   8  , and in general A    2 
k 
 2 1 3 1  k k 
    k 1 1 
8 4  16 8   
2 2 
k
1 k
Since, lim   0 and lim k 1  0 , we conclude that A is convergent.
 
k  2 k  2
Proposition 1.1 If an n  n matrix A is convergent, then

(a)   A  1
(b) lim A n  0 , for any natural norm.
n 
(c) lim An x  0 , for any vector x

n
9
1.5 Iterative Methods
These are the methods that follow numerical techniques in approximating the solutions to
the system of linear equations. A system Ax  b can be written as x  Tx  c for some
matrix T while x and c are vectors.
We are going to discuss three iterative methods (1) Jacobi, Gauss-Seidel and SOR.
1.5.1 Jacobi iterative method

The method involves performing k  iterations for the iteration number k  1, , N
provided that the initial solution to the system is given. Before starting the iteration
process, the original system must be written in an appropriate format. For Jacobi, the
k th iteration can generally be given by
 
1  n
k 1 
 bi   ai j x j 
k 
xi 
aii  j 1 
 j i 
Thus to perform a certain iteration, we need the solution to the previous iteration only.
In matrix form, Jacobi iterations are written as
x k   D 1 L  U x k 1  D 1b
We denote the matrix  D 1 L  U  by TJ and the vector D 1b by c J
Therefore, for Jacobi we have,
x  x   TJ x k 1  c J
Example 1.8
Use Jacobi method to approximate the solutions of the system
x1  x2  2 x3  x4  8
2 x1  2 x2  3 x3  3 x4  20
x1  x2  x3  2
x1  x2  4 x3  3 x4  4
Perform only three iterations. Given the initial solutions as  2 , 2 , 2 , 2
Solution
The system is first written is appropriate form as follows
10
x1k   8  x 2k 1  2 x3k 1  x 4k 1
x 2k  
1
2

 20  2 x1k 1  3 x3k 1  3 x 4k 1 
x3k   2  x1k 1  x 2k 1
1
3
x 4k   
4  x1k 1  x 2k 1  4 x3k 1 
Now, when k =1 (first iteration)
x11  8  x 20   2 x30   x 40   8  2  2  2  2  8.000
 
x 21  0.5  20  2 x10   3 x30   3 x 40   0.5  20  2 2  3  2  3  2  8.000
x31  2  x10   x 20   2  2  2  2.000
x 41 
1
3
 3

4  x10   x 20   4 x30   4  2  2  4  2   0.0000
1 0
3
The first iteration gives x 1   8.000, 8.000 ,  2.000 , 0.0000
When k = 2 (second iteration)
x12   8  x 21  2 x31  x 41  8  8  2 2  0  4.000

 
x 22   0.5  20  2 x11  3 x31  3 x 41  0.5  20  2 8  3 2  0  1.000
x32   2  x11  x 21  2  8  8  2.000
x 42  
1
3
 3

4  x11  x 21  4 x31  4  8  8  4 2 
1 28
3
 9.333
 4.000,  1.000,  2.000, 9.333 .

2 
The second iteration gives x
And so on.
1.5.2 Gauss-Seidel Iterative Method

This is an improvement of Jacobi method. It utilizes both the approximate solutions of
the previous iteration and some of the current iteration. The general method is given by
1  i 1 n 
xi k 
  bi   ai j x j   ai j x jk 1 
 k 
aii  j 1 j i 1 
In matrix form this can be written as
x  x   Tg x k 1  c g
Where Tg  D  L  U and c g  D  L b

1 1
11
Example 1.9
Use Gauss-Seidel to find the approximate solutions to the following system. Perform
 1, 1, 1
0 
only two iterations use x
3x1  2 x2  x3  7.5
4 x1  3 x2  14.5
2 x1  x2  3 x3  12.5
Solution
Firstly we put our equations in an appropriate format as follows
1

x1k   7.5  2 x 2k 1  x3k 1
3

1

x 2k   14.5  4 x1k 
3

1

x3k   12.5  2 x1k   x 2k 
3

 1, 1, 1 we have,
0 
Now, when k = 1, with x
x11 
1
3
 3

7.5  2 x 20   x30   7.5  2  1  1 
1 8.5
3
 2.833
3
 3

x 21  14.5  4 x11  14.5  4  2.833 
1 1 3.168
3
 1.056
3
 3

x3k   12.5  2 x11  x 21  12.5  2  2.833  1.056 
1 1 7.89
3
 2.63
 2.833, 1.056, 2.630

1
Thus x
When k = 2, we have
x12  
1
3
 3

7.5  2 x 21  x31  7.5  2  1.056  2.63 
1 6.982
3
 2.327
x 22 
3
 
 14.5  4 x12   14.5  4  2.327 
1 1
3
5.192
3
 1.731
x32 
3
 3

 12.5  2 x12  x 22   12.5  2  2.327  1.731 
1 1 9.577
3
 3.192
 2.327, 1.731, 3.192 as required.

2 
Thus x
Lemma 1.1 If  T   1 , then I  T  exists, and

1

I  T 1  I  T  T 2     T s .
s 0
12
Theorem 1.3 For any x 0   R n , the sequence x k  k 0 defined by

x  x   Tx k 1  c , for k  1
converges to a unique solution x  Tx  c if and only if  T   1 .
1.5.3 Successive Over Relaxation (SOR) Method

This is the improvement of Gauss-Seidel method. In this method there is an introduction
of a successive over relaxation parameter  , such that 1    2 . This parameter has the
tendency of increasing the speed of convergence of Gauss-Seidel method. The general
method is given by
  i 1 n
k 1 
xik   1   xik 1  b   a x k 
  a x 
aii  
i i j j i j j
j 1 j i 1 
In matrix form we have
x  x   T x k 1  c
Where T  D   L  D   D  U  and c    D   L 
1 1
Example 1.10
Repeat example 1.9 by applying SOR method with   1.25
Solution
The system will be written in an appropriate format as follows
x1k   0.25x1k 1 

1.25
3

7.5  2 x2k 1  x3k 1 
x 2k   0.25x 2k 1 
1.25
3

14.5  4 x1k  
x3k   0.25x3k 1 
1.25
3

12.5  2 x1k   x 2k  
 1, 1, 1 we have,
0 
Now, when k = 1, with x
x11  0.25x10  
1.25
3
 
7.5  2 x 20   x30   0.25  1 
1.25
3
7.5  2  1  1  0.25  3.542  3.292
x 21  0.25x 20  

1.25
3
 
14.5  4 x11  0.25  1 
1.25
3
14.5  4  3.292  0.25  0.555  0.305
x31  0.25x30  
1.25
3
 
12.5  2 x11  x 21  0.25  1 
1.25
3
12.5  2  3.292  0.305  2.342
Thus x  3.292, 0.305, 2.342 .
1
13
When k = 2, we have,
x12   0.25x11 
3

1.25

7.5  2 x 21  x 31  0.25  3.292 
1.25
3
7.5  2  0.305  2.342  1.580
x 221  0.25x 21 

3

1.25

14.5  4 x12   0.25  0.305 
1.25
3
14.5  4  1.580  3.332
x 32   0.25x 31 

3

1.25

12.5  2 x12   x 22   0.25  2.342 
1.25
3
12.5  2  1.580  3.332  4.695
 1.580, 3.332, 4.695 .

2 
Thus x
Theorem 1.4 (Ostrowski-Reich theorem) If A is a positive definite matrix, and

0    2 , then SOR method converges for any choice of the initial guess x 0  .
Theorem 1.5 If A is a positive definite and tridiagonal matrix, then  Tg    TJ   1 ,
2
and the optimal choice of  for SOR method is

2
 .
1  1   TJ 
2
With this choice of  , we have  T     1.
Example 1.11
Given a positive definite and tridiagonal matrix
4 3 0 
 
A   3 4  1
0 1 4 
 
Obtain the optimal relaxation parameter  , and hence find  Tg  and  T  .
Solution
1 
 0 0
 4 0 0 4 
 
0 0
1
we have D   0 4 0   D 1
 4 
 0 0 4 
  1
0 0 
 4
 0 0 0 0 3 0 
   
L   3 0 0  and U   0 0  1
 0 1 0 0 0 0 
   
14
Then,
 0  .0.75 0 
 
TJ   D L  U     0.75
1
0 0.25 
 0 0 
 0.25
   0.75 0 
 
 TJ  I    0.75   0.25 
 0 0.25   

So,

det TJ  I    2  0.625 
The solutions to detTJ  I   0 are 1  0 ,  2  0.625 , 3   0.625
Thus,
 TJ   0.625  0.7906
Then, we have
2 2
   1.24
1  1   TJ  1  1  0.625
2
Hence,
 Tg    TJ 2  0.625 and  T     1  1.24  1  0.24
It shows from the above example that  T    Tg    TJ   1. Which implies that
SOR converges faster than Gauss-Seidel, and Gauss-Seidel converges faster than Jacobi.
Therefore the most convergent iterative method is the SOR.
15

Lecture Notes On Numerical Methods

Uploaded by

Copyright:

Available Formats

Lecture Notes On Numerical Methods

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Lecture Notes On Numerical Methods

Uploaded by

Copyright:

Available Formats

ITERATIVE METHODS FOR SOLVING SYSTEM OF LINEAR EQUATIONS

1. 1 Vector and Vector Norms

Given that x  3  4 2 T . Find i  x 1

1.2 Properties of Vector Norms

Let x be a vector and  be a real number. Then

1.3 Matrices and Matrix Norms

This can shortly be written as A  aij  1  i  3, 1  j  3

1.3.2 Matrix Operations

Transposition: A transpose of a matrix (say A) is a matrix obtained by interchanging

Addition and Subtraction

Suppose we need to perform multiplication AB such that A is m p and B is p  n , then

1.3.3 Properties of Matrix Operators

1.3.4 Special Types of Matrices

Theorem 1.1 If A is a positive definite matrix, then

Matrix A is a tridiagonal matrix while matrix B is not.

1.3.2 Matrix Norms

The matrix norms satisfy one more axiom that

Definition 1.3 The condition number k  A of non-singular matrix A is defined by

1.4 Eigenvalues and Eigenvectors

The eigenvector corresponding to   2 is a non-zero vector x such that  A  I x  0

The eigenvectors corresponding to   3 is a non-zero vector y such that  A  I y  0

Definition 1.6 The spectral radius of a matrix A is defined by

Note that if  is complex, such that   a  bi , then   a 2  b 2 .

Theorem 1.2 If A is an n  n matrix, then

Proposition 1.1 If an n  n matrix A is convergent, then

(c) lim An x  0 , for any vector x

1.5.1 Jacobi iterative method

x12   8  x 21  2 x31  x 41  8  8  2 2  0  4.000

 4.000,  1.000,  2.000, 9.333 .

1.5.2 Gauss-Seidel Iterative Method

Where Tg  D  L  U and c g  D  L b

 2.833, 1.056, 2.630

 2.327, 1.731, 3.192 as required.

Lemma 1.1 If  T   1 , then I  T  exists, and

1.5.3 Successive Over Relaxation (SOR) Method

x1k   0.25x1k 1 

x 21  0.25x 20  

x 221  0.25x 21 

x 32   0.25x 31 

 1.580, 3.332, 4.695 .

Theorem 1.4 (Ostrowski-Reich theorem) If A is a positive definite matrix, and

and the optimal choice of  for SOR method is

With this choice of  , we have  T     1.

You might also like