Tricki

What makes some equations so much easier to solve than others?

Quick description

Solving an equation can frequently be thought of as determining given that , for some function and some number . For example, we might want to find when , or when . If you think about these two examples, you will see that the first is much easier than the second. One sign that it is easier is that you can work out on a calculator using no memory and inputting only once: after putting in, all you have to do is multiply it by 3, add 5, and square it. (This assumes your calculator has an button.)

Example 1

Suppose you are asked to solve the equation . It is rather easy to do: taking square roots tells you that , subtracting 5 gives , and dividing by 3 gives .

Now suppose you are asked to solve the equation . It isn't nearly so easy, and you have to use the full theory of quadratic equations to solve it.

General discussion

What is it that makes the first equation easier? It's that we can see quickly how to undo the operation of turning into . That operation naturally splits into three stages: multiply by 3, add 5, and take the square. So if that gives us the number 12, then we can recover by reversing each of the three stages of the calculation, starting with the last and ending with the first. So we take the square root, subtract 5, and divide by 3.

If we try to do the same thing with , we find we cannot. That is because replacing by doesn't naturally split up into simple stages in the same way, so we can't just do the reverse process to 2.

Perhaps you would like to dispute this. For example, here is a way that one might calculate . We could take multiply it by , add , multiply by 3 again, multiply by , and add 15. Why isn't this a simple process?

It is quite simple (and is in fact quite a useful way of evaluating polynomials) but there is a big difference: to do this process we had to input twice. So if you had a complicated number and wanted to work it out on a calculator with no memory, then you would have to write down and key it in at two points in the calculation.

Thus, the answer to the question in the title of this article is as follows. Some equations are particularly easy because they are of the form , where to work out you start with and do a succession of simple operations on it without inputting again. So if you are told that then you can work out by applying the inverse operations to in the reverse order.

Example 2: two more examples

Here is another example of an easy equation to solve. If , then , so , so . (This uses the fact that for any .)

The same basic principle applies to other sorts of equations too. Here, for example, is a differential equation where the unknown is a function . We are told that . From this it follows that , so , so . The reason this was easy is that to obtain the left-hand side we started with , took its logarithm, squared the result, and differentiated. So to solve the equation we just applied the inverse operations in reverse order to the function .

By contrast, if we want to solve the equation then we will have a much harder problem, because this time we have had to "input twice".