Quick description
If
is a random variable that takes non-negative values, then the probability that
is at most
This simple result is called Markov's inequality. Clever uses of Markov's inequality can have surprisingly powerful consequences. The main trick is to choose an increasing function
and apply Markov's inequality to the random variable
instead. If
is rapidly growing but
is not too large, then one can prove that the probability that
, which equals the probability that
, is very small.
Prerequisites
A familiarity with basic concepts in probability such as random variables and expectation.
Example 1
Let us begin with a proof of Markov's inequality itself. By the law of total probability we know that
Since
(by hypothesis), this is at least
and from this the result follows.
Tricki
Comments
a suggestion for improvement
Wed, 27/01/2010 - 08:06 — alexHere is a point which I feel needs to be emphasized more in Example 3.
If we try to make the argument work without leaving
to be optimized later, but by, say, initially setting
, the argument will fail: we will get a useless bound. The reason: we need to pick a function
which will grow rapidly while
will grow slowly. In this case,
grows pretty fast with
.
What kind of functions
work for this? Well,
works: out of the
terms in
, almost all have expectation
. Similarly,
works fine as well: in the sum
, we will be able to easily cancel many terms because they have expectation
.
The central insight behind the proof in Example 3 is that picking
for really, really small
works. The reason: the average of
and
is really close to
. That is: near one, inversion is about the same as subtraction. This insight can be pushed a little further to argue that if
has mean
, the quantity
should be pretty close to
. Thus one might hope that
grows slowly - after all, it is "approximately"
.