Why is mathematical induction a valid proof technique? [duplicate]

Consider the following definition of mathematical induction (adapted from David Gunderson's book Handbook of Mathematical Induction):


Principle of mathematical induciton: For some fixed integer $b$, and for each integer $n\geq b$, let $S(n)$ be a statement involving $n$. If

  1. $S(b)$ is true, and
  2. for any integer $k\geq b, S(k)\to S(k+1)$,

then for all $n\geq b$, the statement $S(n)$ is true.


Mathematical induction's validity as a valid proof technique may be established as a consequence of a fundamental axiom concerning the set of positive integers (note: this is only one of many possible ways of viewing induction--see the addendum at the end of this answer). The following statement of this axiom is adapted from John Durbin's book Modern Algebra, wherein it is called the Least Integer Principle, but it is often referred to as the Well-Ordering Principle or WOP. The principle is as follows:

Well-Ordering Principle: Every nonempty set of positive integers contains a least element.

The validity of mathematical induction, in this context where we are using the WOP to prove the validity of mathematical induction, is established by using a proof by contradiction. This proof will contain several "steps" or "parts." Before giving all of the steps to the proof of mathematical induction, it may be useful to reformulate the definition of the proof technique in terms of the notation that will be used throughout the sequence of steps in the explanation (for consistency and facilitated understanding):


Reformulated principle of mathematical induction: For some fixed integer $1$, and for each integer $n\geq 1$, let $S(n)$ be a statement involving $n$. If

  1. $S(1)$ is true, and
  2. for any integer $k\geq 1, S(k)\to S(k+1)$,

then for all $n\geq 1$, the statement $S(n)$ is true.

Let 1 and 2 above be denoted by $(\dagger)$ and $(\dagger\dagger)$, respectively.


Steps of the proof that mathematical induction is a consequence of the WOP:

  1. Start by supposing that $S(1)$ is true and that the proposition $S(k) \rightarrow S(k+1)$ is true for all positive integers $k$, i.e., where $(\dagger)$ and $(\dagger\dagger)$ hold as indicated above.

  2. The goal is to verify whether or not $S(n)$ is true for all $n \geq 1$ if $S(1)$ and $S(k) \rightarrow S(k+1)$ are true. The statement of mathematical induction above indicates that $S(n)$ will logically follow if $S(1)$ and $S(k) \rightarrow S(k+1)$ are true, but does $S(n)$ really follow if $(\dagger)$ and $(\dagger\dagger)$ are true? If yes, then mathematical induction is a valid proof technique. If not, then it is mere rubbish.

  3. We are skeptics, and we think that mathematical induction is a sham (hint: a proof by contradiction is about to take place). Our skepticism induces us to assume that there is at least one positive integer for which $S(n)$ is false [keep in mind that we are assuming that $(\dagger)$ and $(\dagger\dagger)$ are true, albeit we are disputing whether or not $S(n)$ really follows from their truthfulness]. Surely there is at least one positive integer for which $S(n)$ is false even though $S(1)$ and $S(k) \rightarrow S(k+1)$ are true.

  4. Let $\mathcal{P}$ denote the set of all positive integers for which $S(n)$ is false. Is this set empty? We think not---after all, we are assuming that there is at least one positive integer, say $\ell$, for which $S(n)$ is false; that is, the assumption is that $S(\ell)$ is false, where $\ell \in \mathcal{P}$. Are there other positive integers in $\mathcal{P}$? Perhaps, but we cannot say for certain at the moment. We can, however, declare with certainty that $\mathcal{P}$ has a least element by the well-ordering principle. We let the least element of $\mathcal{P}$ be $\ell$ without loss of generality.

  5. Since $S(1)$ is true, we know that $\ell \neq 1$, and because $\ell$ is positive and greater than $1$, we also know that $\ell -1$ must be a positive integer. Moreover, because $\ell -1$ is less than $\ell$, it should be clear that $\ell -1$ cannot be in $\mathcal{P}$ [this is because $\ell$ is the least element in $\mathcal{P}$, meaning that any lesser positive integer cannot be in $\mathcal{P}$]. Notationally, we have it that $\ell \in \mathcal{P}$ and $(\ell - 1) \notin \mathcal{P}$. What does this mean? Since $(\ell - 1) \notin \mathcal{P}$ and $\ell -1$ is a positive integer, and because $\mathcal{P}$ is the set of all positive integers for which $S(n)$ is false, it must be that $S(\ell-1)$ is true.

  6. Finally, recall that we maintained that $(\dagger\dagger)$ was true; that is, $S(k) \rightarrow S(k+1)$ is true for any integer $k \geq 1$. Since $\ell$ and $\ell -1$ are both positive integers, we may let $k = \ell -1$ and $k+1 = \ell$. Substituting these values into the implication that we assume to be true, we get that $S(\ell-1) \rightarrow S(\ell)$. Do you see the problem now (and hence the conclusion of the proof by contradiction)?

  7. By supposing that $(\dagger)$ and $(\dagger\dagger)$ held while also supposing that $S(n)$ was false for some positive integer $\ell$, we deduced through a series of steps that $S(\ell-1) \rightarrow S(\ell)$ [by $(\dagger\dagger)$ where $k = \ell -1$ and $k+1 = \ell$]. What is wrong with this? Simply consider the following three assertions that occur within the proof:

    • $S(\ell-1)\to S(\ell)\qquad$ [True---by supposition $(\dagger\dagger)$]
    • $S(\ell)\qquad$ [False---because of the assumption that $S(n)$ was false for $\ell\in\mathbb{Z^+}$]
    • $S(\ell-1)\qquad$ [True---by the well-ordering principle]

    The logical issue should now be apparent. We know $S(\ell-1) \rightarrow S(\ell)$ is true by our original supposition, but this implication cannot be true if $S(\ell)$ is false. Why? Because an implication of the form $p \rightarrow q$ is only false when the hypothesis, $p$, is true and the conclusion, $q$, is false. In our own case, since $S(\ell-1)$ is true, the implication $S(\ell-1) \rightarrow S(\ell)$ is only true when $S(\ell)$ is also true, thus contradicting the choice of $\ell$. Consequently, $S(n)$ must be true for every positive integer $n$.


Addendum: It may be of interest to other students taking discrete mathematics courses that the form of induction proved above (often referred to simply as "induction") is actually equivalent to both strong induction and the WOP. This may be surprising, but there is a good paper about the Equivalence of Three Variations on Inducton for readers who are interested.

The basic idea behind the equivalence proofs is as follows:

  1. Strong induction implies Induction.
  2. Induction implies Strong Induction.
  3. Well-Ordering of $\mathbb{N}$ implies Induction [This is the proof outlined in this answer but with much greater detail]
  4. Strong Induction implies Well-Ordering of $\mathbb{N}$.

Equivalence of Induction, Strong Induction, and Well-Ordering on $\mathbb{N}$ follows after having proved the four implications outlined above (the paper linked to contains details of the proof(s)).

The answer I provided takes care of (3) above, but you can explore the other three to show equivalence if desired.


Your question seems somewhat unclear to me, as it stands, but I'll answer the one in the title, and if the question is updated, I'll address that too.

Mathematical induction can be taken as its own axiom, independent from the other (though, as comments point out, it can be proven as a theorem in common systems like ZF). That is to say, it is more akin to statements like: $$x\cdot (y+1)=x\cdot y + x$$ which are often taken to be part of the definition of multiplication and may not be derivable from more basic statements. The point behind such definitions is to capture some intuitive idea - the above starts to capture the distributive property of multiplication, which we know from intuition to be a reasonable idea. Yet, it is possible that we could not prove the above statement without taking it as an axiom.

Induction basically says every natural number can be "reached" from zero. That is, if we have proven the statements:

  • $P(1)$ is true
  • $P(n)\rightarrow P(n+1).$

Then, we can build a proof for every natural number. For instance, if we wanted to know that $P(5)$ was true, we could infer that, since $P(1)$ is true and $P(1)\rightarrow P(2)$, we have $P(2)$. But $P(2)\rightarrow P(3)$ and $P(3)\rightarrow P(4)$ and $P(4)\rightarrow P(5)$, thus we can deduce $P(5)$ too. Induction merely says that $P(n)$ must be true for all natural numbers because we can create a proof like the one above for every natural. Without induction, we can, for any natural $n$, create a proof for $P(n)$ - induction just formalizes that and says we're allowed to jump from there to $\forall n[ P(n)]$.