In the linear system Ax = b suppose the coefficient matrix A is symmetric and positive definite. Solving Ax = b is equivalent to minimizing the quadratic function .
Example: The following linear system of the form Ax = b arises
in connection with the Laplace Finite Difference equation on a 3x3 square
grid.
For this example, the function Q(x) is given by
From calculus, we know that if Q(x) has a minimum at some point x, then all the partial derivatives of Q must be zero at that point. Calculating the partial derivatives of Q with respect to each xi, and setting each partial derivative to zero results in the linear system Ax - b = 0 or Ax = b. From this we see that it is correct to view Ax - b as the gradient of Q and write .
In the method of steepest descent, we make an initial estimate of the solution x to Ax = b and then proceed down gradients of Q to arrive at the minimum of Q and thus arrive at the solution to Ax = b. In this connection, define the residual ri associated with the approximate x i to be
We define our next estimate to the solution of Ax = b to be down gradient of xi by setting
Now it remains to pick a value of that determines how far we go in the down gradient direction defined by ri. We choose so that we minimize
The rest of the steepest descent method follows from calculus: find and set By the chain rule,
By setting ,we can see that for computing xi+1 is given by
The method of steepest descent is easy to program, but it often converges slowly. The main reason for its slow convergence is that the method may spend time minimizing Q along parallel or nearly parallel search directions.