Quick description
Nonlinear equations are ubiquitous in mathematics, and we need practical methods to solve such equations. Methods vary in their reliability and robustness as well as their speed and ease of implementation, as well as applicability. For example, the Newton-Raphson method can be applied to systems of nonlinear equations
and can converge very fast, while bisection is extremely reliable, but is much slower and can only be used for a single scalar equation in one variable.
Prerequisites
Calculus.
Example 1
Bisection is extremely robust but relatively slow for solving
once we have
where
and
have opposite signs.
One simply computes
and determines the sign of
. One then either updates
or
and repeats the process depending on the sign of
to ensure that we again have
and
of opposite signs. Of course, if
, we stop. Normally we stop when
, a pre-determined tolerance.
As long as
is continuous, this method will converge thanks to the intermediate value theorem.
Speed can be improved using so-called bracketing methods such as Dekker's or Brent's method.
Example 2
The so-called Newton-Raphson method is based on the linear approximation
for
. Solving this linearized equation
gives a new estimate
This can be repeated to give further improvements:
The Newton-Raphson method can be guaranteed to converge provided the Jacobian matrix
is non-singular at the solution
, and
is "sufficiently close" to
. When it does converge to
and
is non-singular with
sufficiently smooth, convergence is quadratic; that is,
\to constant
\epsilon
\|f(x_k)\|<\epsilon
O(\log\log(1/\epsilon)
d_k=-\nabla f(x_k)f(x_k)
x_{k+1}\gets x_k+\alpha_k d_k
0<\alpha_k\leq 1
\|f(x_{k+1})\|<\|f(x_k)\|.
This indicates that the Newton-Raphson method is essentially a local method; some sort of "global...f(x)=0
g(x)=0
h(x,\lambda)
h(x,1)=f(x)
h(x,0)=g(x)
h(x,\lambda)=0
(x,\lambda)\in\R^n\times[0,1]
g(x)=0
f(x)=0
h(x,\lambda)=\lambda f(x)+(1-\lambda)(x-a)
a. Provided no solutions "escape to \infty" on this curve, and the curve is smooth, we can use local techniques (such as a modified version of...a
a
x^T f(x)>0
\|x\|=R$) are typically used to ensure that curves do not escape to infinity.
There are also piecewise affine versions of this idea, and in fact this is related to simplex-tableau type algorithms for finding solutions to nonlinear equations and (for example) linear complementarity problems.
Tricki