Induction is used to prove a sequence of
statements ,
,
, .... There may be finitely many
statements, but often there are infinitely many.
For example, consider the statement
This is actually an infinite set of statements, one for each integer
:
You might think of proving this result by induction --- and in fact, I'll do so below.
On the other hand, this statement is also an infinite set of statements:
However, there is one for each real number. You are unlikely to prove this by induction.
The fact that a statement involves integers does not mean induction is appropriate. For example, consider the statement
"The sum of two even numbers is even."
It's true that this is a statement about an infinite set of pairs of integers.
However, the problem with using induction here is that there isn't an obvious way to arrange the statements in a sequence.
Likewise, consider the statement
"If n is an integer, then is odd."
While it's conceivable that you could give an induction proof, it
would be overkill: expresses
as 2 times something plus 1, so it must be odd.
How does induction work? Suppose you have a sequence of statements
,
, ...,
, ..., one for each
positive integer. (There's no harm in starting your numbering with 0
if it's convenient.) In the simplest case, to give a proof by
induction:
1. Prove .
2. Let , and assume
is
true.
3. Using the assumption that is true, prove that
must be true.
If you can complete these steps, you can conclude that
is true for all
, by induction.
The assumption that is true is often called the induction hypothesis, or the
inductive assumption.
Why does it work? Induction follows from a fundamental fact about the positive integers called the Well-Ordering Axiom.
Well-Ordering Axiom. Every nonempty subset of the positive integers has a smallest element.
The Well-Ordering Axiom is an axiom for the positive integers, so there's no question of proving it.
Example. Verify the Well-Ordering Axiom for the following subsets of positive integers:
(a) The positive even integers: .
(b) The subset of the positive integers which consists of positive integers whose decimal representations begin and end with "1". For example, 101, 128391, and 1651 are elements of this set.
(a) The smallest element of this set is 2.
(b) The smallest element of this set is 1.
Theorem. (Principle of
Mathematical Induction) Let ,
, ... be a set of statements indexed by the positive integers.
Suppose that:
(a) is true.
(b) For every , if
is true, then
is true.
Then is true for all
.
The intuitive idea is illustrated below:
Assumption (a) says that is true. Assumption (b) says that
if a statement is true, the next statement below it is true. I've
represented this with a "C-shaped" arrow connector from the
known true statement to the next one. As you move the connector down,
you find that
being true makes
true,
being true makes
true, and so on. Thus, all the statements
are true.
Proof. I'll prove the result by contradiction.
Assume that is true, and for every
, if
is true, then
is true. Suppose that it
isn't true that
is true for all
.
Consider the set of integers such that
is false. Since
is not true for all
, there must be some n for which
is false. Hence, the set of integers
such that
is false is nonempty.
Since this is a nonempty subset of the positive integers, by
Well-Ordering it contains a smallest element MIN. Thus, is false, and
is not false (i.e.
is true) for all
.
Now MIN can't be 1, because is true by assumption. Thus,
. So
is a positive integer, and
must be true.
But now my second assumption --- if a statement in the sequence is
true, the following statement is true --- shows that must be true. This contradicts
the fact that
is false.
This contradiction proves that is true for all
.
Remark. You can also do the induction step by
assuming the result for n, and proving it for , and I will often do this. Usually, there is no
difference in the difficulty, so you should use whichever approach
you find comfortable.
Also, some inductions begin with a statement numbered "0" --- or "42". The numbering doesn't matter.
Example. Prove that
The formula is true for , since the left side is 1 and the
right side is
Suppose the formula holds for n. This means that
I want to prove that the formula holds for .
Suggestion: You might want to write the
formula for out on scratch paper so you know what
you're trying to prove. You get it by replacing "n" with
"
" in the formula for n:
Don't start with this equation! --- that would be assuming what you want to prove. Instead, start with the left side and work until you get the right side.
This proves the formula for . Hence, the result is true for
all
, by induction.
Notice that when I wrote out the left side of the formula for I put in both the final term "
" and the term before it, which is
"n". In that way, I could see the expression "
" which is the left side of my
induction assumption --- so I knew to begin by substituting. This
trick is often useful in the induction step.
Example. Prove that for
The formula is true for , since the left side is 14 and
the right side is
Suppose the formula holds for n. This means that
I need to prove the formula for .
On scratch paper, I write out the formula for ; it is
Remember not to start with this equation --- start with the
left side! Notice that on the left side I wrote out the final term
( ), then wrote out the previous
term (
). This helps me recognize the left side of the
induction assumption.)
Continuing with our proof, here's the proof of the formula for . I won't write out justifications for steps this
time, as most of it is basic algebra.
This proves the result for , so the result is true for all
by induction.
Are you wondering how I made the final step?
The idea is that I knew from my scratch work that the final answer
had to be . So I wrote it down,
then checked that it worked by multiplying out
to see if I got
. Nobody
"knows" how you got the final step --- the question is
whether it's true!
Example. Let ,
. Prove that
You're allowed to "begin" an induction with --- or any other integer --- if it's convenient to do
so.
For ,
The result is true for .
(For variety, I'll do this induction step by assuming the result for
and proving it for n. The amount of work is about the
same as assuming it for n and proving it for
.)
Assume that , and that the result holds for
:
I want to show that the result holds for n.
This proves the result for n, so the result is true for all , by induction.
Example. Let ,
. Prove that
You may have seen this idea when you learned about infinite series; it's called telescoping:
The "argument" is that the terms cancel in pairs, leaving
.
Arguments where you say "and so on" are often justified rigorously by induction. That's what I'll do here.
For ,
The result is true for .
Assume that the result is true for n:
I want to show that the result holds for .
This proves the result for , so the result is true for all
, by induction.
In some situations, it's convenient to give an induction proof in the following form:
1. Prove .
2. Let , and assume
is true for
all
.
3. Using the assumption that is true for all
, prove that
must be true.
In other words, you can take as your induction hypothesis the truth
of all the statements prior to the statement, which you are trying to prove. This is
sometimes called generalized induction or strong induction.
Example. Define a sequence of integers by
Prove that
To get the induction started, I have to verify the formula for both
and
. The reason is that the recursion
formula doesn't kick in till
.
For , the formula gives
For , the formula gives
The formula holds for and
.
Now assume that and the formula holds for all
, i.e. for all numbers less than n. In particular,
since
and
are less than n, I have
The fact that I need both of these formulas to do the induction step
is why I'm using a generalized or strong induction. If I did a
standard induction where I just assumed the result for , I would not have it for
.
I have to show that the formula holds for n. I start with the
recursion formula that defines , substitute the
results for
and
, and do some algebra:
This proves the result for n, so the result holds for all by induction.
Here are some remarks about the algebra I did. My target expression
has a power of 7 and a power of
-2. So I began by grouping terms into powers of 7 and powers of -2.
Next, to try to make what I had look like what I wanted, I factor out
and
. Then
simplifying what was left produced the powers (
and
) that I needed to finish. Perhaps it looks like
magic, but as we've seen in other proofs, if your steps are correct,
it has to work --- you know what the final expression should be.
In the examples that follow, I use induction to prove inequalities and a divisibility result. The general setup for the inductions remains the same --- it's just the specific work that differs.
Example. Prove that
For ,
The result holds for .
Assume that the result holds for n:
I'll prove it for . To get the proof started, I make a
"
" term appear using algebra, so I can use the
induction assumption.
(In the inequality which gave the second-to-the-last line, I used the
facts that and
to replace the existing terms
so I could match the target.)
This proves the result for , so the result is true for all
by induction.
Example. Prove that for ,
In this case, the basis step is , because the result is to be
proved for
. (Try some numbers and you'll find the
can be bigger than
for some numbers
smaller than 6 .)
For ,
The result is true for .
Assume that the result holds for n:
I have to prove the result for .
Thus, I have
On scratch paper, I see that my target inequality is
So my strategy will be to break up the that I have so far
into 4 pieces, each of which will be greater than or equal to one of
the 4 pieces of
. I will do this in steps.
Since and
, I have
Again, since , multiplying both sides by
, I get
Since , it follows that
. So taking the last inequality and adding another
term, I have
Finally, gives
Adding the inequalities (1), (2), (3), and (4), I have
Combine this with the inequality I proved earlier:
This proves the result for , so the result is true for all
by induction.
There are other ways to prove the key inequality . For example, you could use
calculus: After some algebra, you get the inequality
. Take
. Note that
, and show that
for
, so the function increases.
This proof is a little involved, so for starters try to understand why each step is correct before trying to grasp the whole. Don't feel bad if you think that you wouldn't have figured this out by yourself! There are math problems that stump even the best mathematicians. The more experience you get, the more you will be able to do by yourself.
For the next example, remember that if m and n are integers, m
divides n (in symbols, ) if
for some integer
n.
Example. Prove that for ,
You can do this proof directly, without induction. But I'll give an induction proof by way of example.
For , I have
Since , the result holds for
.
Assume that the result holds for n:
I can write this as for some integer k, or
I must prove the result for . As usual, the first thing I'll
do is to work on the given expression to get (part of) the induction
assumption to appear.
I have the " " which is part of the induction
assumption. Now
means
for some integer k, or
Substituting this into the expression above, I get
All together,
Therefore, . This proves the result for
, so the result holds for
by induction.
Here's a sketch of a direct proof: Write . Then use the formula
Lines and regions in the plane. This is an extended example which illustrates how you might "guess" a formula, then confirm it using induction.
Suppose we draw lines in the plane, one after another:
Each line that we draw should satisfy the following conditions:
1. It should not be parallel to any of the previous lines.
2. It should intersect all of the previous lines.
3. It should not pass through the intersection point of any two of the previous lines.
The lines in the picture above satisfy these conditions.
Question: Into how many regions do the lines divide the plane?
Before we even consider this question, we should be careful that we can actually do what we require. In particular, how do we know we can satisfy conditions 2 and 3?
Try it yourself (maybe with a drawing program on a computer). It starts to become a little tricky to get the next line to cross all of the previous lines. Imagine that you had 100 lines and you were trying to draw one more!
I'll prove this is possible by induction.
Proposition. Suppose there are n lines in the plane, each of which intersects all the others, and such that no three lines intersect. Then there is a line which intersects all n lines, and does not go through any of their intersections.
Proof. From geometry, two lines are parallel if and only if they have the same slope. So if two lines have different slopes, they aren't parallel, and they must intersect. We'll use this to give an induction proof.
We can get started and draw the first line wherever we want.
Then suppose we've draw lines ,
, and so on, all the
way up to
. We want to draw the
line so
that it intersects the n previous lines.
Suppose the slope of is
, the slope of
is
, and so on. We have n slopes:
These are n different numbers (where a "number" could include the designation "undefined" if one of the lines is vertical). The numbers are different because these n lines intersect, and these numbers are their slopes.
Pick a number which is different from all of these m's.
To make the arguments that follow easier, I'll also assume
is not "undefined". I can pick such a
number
because there are infinitely many real
numbers, so there are real numbers different from the previous m's
and "undefined".
Consider a line with slope
. Since
is different from all the other
m's, the line
must intersect the n previous lines.
How can we ensure that this new line does not go through the
intersections of any of the previous lines? The n previous lines
intersect in a finite number of points. Consider the lines with slope
which go through these intersection points.
In the picture above, the dots represent the intersection point of
the previous lines, and the lines I've drawn show the lines with
slope which pass through them.
Each of those lines has a y-intercept, so there are finitely many such y-intercepts.
Let's say the y-intercepts are ,
, and so on. Because there are only finitely many b's, I can choose a
number b different from all of them.
Consider the line . It has slope
, so it intersects all the previous lines. Its
y-intercept is different from the y-intercepts of all the
intersection point lines, so it does not pass through any of the
intersection points.
This proves the result for n lines, so it's true for any number of
lines, by induction.
At this point, we know our question makes sense, so we can think about answering it. Let's see if we can see a pattern.
We go from 2 regions to 4 regions, and . We go from 4
regions to 7 regions, and
. If you draw a picture,
you'll find that with 4 lines there are 11 regions, and
. Let's set up some notation. Let
be the number of regions made by n lines. It appears that
The formula seems to work for .
Let's see if we can verify this with a picture. Suppose I have n
lines. When I draw the line, it must cross each of
the previous n lines. Schematically:
As the new line crosses , it splits the region on the left
of
, and one new region is added. As the new line
crosses
, it splits the region between
and
, and one new region is added. And so on. All
together,
new regions are added. That is,
regions are added to
to get
.
This recursion isn't bad, but it would be nice to have a
"closed-form" expression which gives
in terms of n. We'll show how to go from the recursion to such an
expression in the next theorem.
Theorem. Suppose , and a sequence
is defined recursively by
, and
Then for ,
Proof. Method 1. This method doesn't use induction, but I'll point out afterward some details which it omits.
Write out the recursion for
, then add the equations:
I subtract from both sides:
Now
Also, . Plugging these into the equation for
, I get
Method 2. For , the formula gives
The result holds for .
Assume that the result is true for n:
I will prove it for :
This proves the result for , so the result is true for all
by induction.
The first method may seem pretty appealing --- you may have seen similar techniques used in computing values of infinite series ("telescoping"). Depending on what you're willing to assume, this may be adequate as a proof. Let's note a couple of things.
First, what does " " mean?
Maybe it's obvious to you: It means "add up the numbers
,
, and so on up to
. However, addition is a binary
operation: It tells you how to add 2 numbers at a time, not 100
numbers or
numbers. Adding more than two numbers at a
time actually requires a definition.
Well, you might say: Just add the numbers two at a time. And in fact,
you can define a sum of more than 2 numbers recursively in this way:
If you already know how to add n numbers, then to add numbers, you can make the following definition:
The right side is a sum of two numbers, so this works.
But a second issue arises, and that is that the definition above picks a particular way of grouping the numbers when you add. Now the associative law for addition tells you that you get the same thing when you group 3 numbers for addition in two different ways:
However, when I write " "
I'm implying that if I add any number of numbers, I get the
same result regardless of how I group the additions. For example,
Try deriving the last line using associativity for just three numbers. We're saying that the same thing works for any grouping, and any number of numbers. And it's true! How do you prove it? You use induction! I won't give the proof here, but you might try it yourself as a challenge.
In other math courses, you didn't worry about these issues, and things work the way you'd expect. But as you've already noticed, proving things is a lot harder than simply know or believing that they're true.
Copyright 2019 by Bruce Ikenaga