What are the applications of the Mean Value Theorem?

I'm going through my first year of teaching AP Calculus. One of the things I like to do is to impress upon my students why the topics I introduce are interesting and relevant to the big picture of understanding the nature of change.

That being said, while I know that the Mean Value Theorem is one of the central facts in the study of calculus, I'm not really clear on why. I feel that it's a bit like IVT for the derivatives of a continuous and differentiable function. But I feel that the only thing I did with it when I studied Calc is to identify the point where the tangent was parallel to the secant of the endpoints. If the class had been a little smaller or I had been a little bolder, I might have raised my hand and asked the professor "So what?" But I didn't, so here we are.

To be clear, I am not at all arguing that MVT is not critical, so I don't plan on the answers being opinion-based. But can you discuss some uses of MVT that justify the lofty place it has in the curriculum?


Solution 1:

When I teach the MVT in a Calculus class, I do three things:

a) Show the one real-world example I know and which everyone gets: Police has two radar controls at a highway, say at kilometre $11$ and at kilometre $20$. Speed limit is $70$ km/h. They measure a truck going through the first control, at 11.11am, at $65$ km/h, and going through the second control at 11.17am, at $67$ km/h. They issue a speeding ticket. Why?

Let the class think about this. Every time I've taught this, someone realised after a short while that the truck passed $9$ km in $6$ minutes, so its average speed was $90$ km/h. Then someone says something like: you cannot go at an average speed of $90$ km/h without ever going at a speed of $90$ km/h (and certainly not without ever going more than $70$ km/h). This is totally common sense, but also it is exactly the MVT. Draw a graph of the position function, realise that the numbers $65$ and $67$ were just red herrings (tangent slopes at the endpoints, irrelevant to the argument), discuss whether there is some way out: Can the function have discontinuites? Well, a jump discontinuity would be a wormhole the truck fell through, or more realistically some shortcut off-highway which would be illegal too. Points where the derivative does not exist? Actually yes, if the truck braked somewhere, but it cannot have done that more than finitely many times, and then we break down the problem into subintervals. Turns out: No, even sharp braking cannot create a "sharp turn" of the function under standard assumptions of physics, see comments by users @leftaroundabout and @llama.

b) Mention that aside from that, it is a "workhorse theorem" which we never see but which makes the entire curve sketching routine work. How do you prove Positive derivative means increasing function: with the MVT. How do you prove Derivative $0$ on an interval means constant: with the MVT. Of course we never think of the proofs of those, we just use them as "well-known", but without MVT, they would not be there.

c) Related to b, it comes up crucially in the Fundamental Theorem later, compare Arturo Magidin's answer. I point it out when I'm there.

Added: As this answer seems to get a lot of attention, I want to put in one more thing part of which I try to get across in class when the MVT is up.

d) The derivative is a cool thing because it carries a lot of information about the original function, but in a subtle way. To the non-initiated, the graphs of an $f$ and $f'$ would most often look totally unrelated. But the initiated, i.e. your calculus class, at this point should already "get" intuitively "hmm, $f'$ is very negative around here, so $f$ should decrease with a steep slope in this neighbourhood". Now the MVT is the one theorem which attaches actual numbers to this intuition, it is the first result which gives an explicit (albeit subtle) relation between values of $f$ and values of $f'$. That is why it underlies the proofs of all the fancy machinery that, later, gives seemingly much stronger relations between $f$ and $f'$, like Curve Sketching, the Fundamental Theorem, Taylor Series, and even L'Hôpital's rules (thanks @JavaMan for pointing out this one). They get all the limelight, but in a way, they all are refined versions of repeated applications of the MVT plus special conditions.


Further update: Since the "speeding" application of the MVT keeps getting mentioned everywhere (and of course I don't even remember where I got it from originally), I googled a little and see that it's been around for quite a while. This educational video of the MAA's from 1966 is almost of historical value (although I hardly understand the voice-over due to its very American accent). As for the question whether this is actually done, thanks to User Bracco23 for providing one source from Italy in a comment. Here is another one from Scotland: http://news.bbc.co.uk/2/hi/uk_news/scotland/4681507.stm The internet has more hearsay and debates: 1 2 3. Cf. also this answer.

Solution 2:

In terms of why students are taught the Mean Value Theorem in Calculus 1, I think it is often obscured because they are often not shown a proof of the Fundamental Theorem of Calculus. But the Mean Value Theorem is a key step in the proof of the first part of the Fundamental Theorem of Calculus. If you want to prove the first part of the Fundamental Theorem of Calculus, the simplest way is to use the MVT:

Namely, to calculate the integral $\int_a^b f'(x)\,dx$, pick a partition of the interval $[a,b]$, $a=x_0\lt x_1\lt\cdots\lt x_n=b$. We want to select points $x_i^*$, $x_{i-1}\leq x_i^*\leq x_i$ to do the Riemann sum $$\sum_{i=1}^{n}f'(x_i^*)(x_i-x_{i-1}).$$ By the Mean Value Theorem, there exists a point $x_i^*$ in $[x_{i-1},x_i]$ such that $f'(x_i^*)(x_{i}-x_i) = f(x_i)-f(x_{i-1})$. So pick those points so that the Riemann sum becomes $$\sum_{i=1}^n f'(x_i^*)(x_i-x_{i-1}) = \sum_{i=1}^n (f(x_i)-f(x_{i-1}))$$ which is a telescoping sum that equals $f(x_n)-f(x_0) = f(b)-f(a)$.

Since every Riemann sum can be selected to equal $f(b)-f(a)$, the limit as the mesh size goes to zero is $f(b)-f(a)$, proving that $$\int_a^b f'(x)\,dx = f(b)-f(a),$$ the first part of the FTC.

(There are other ways of proving the first part of the FTC, e.g., using the second part, but this is still a major highway to the FTC.)

In terms of "motivation", there is the classic one: if you drive at an average speed of 60 mph, is there necessarily an instant where your instantaneous speed is exactly 60 mph? In other words, is this notion of "instantaneous rate of change" reasonable when compared to our much more sensible notion of "average rate of change"?

All in all, if one is not going to prove the FTC, then there is an argument for not emphasizing (or going over) the MVT; though I confess that not having taken the relevant physics course, I am not aware if there is a need for the MVT somewhere in the parallel physics curriculum of dynamics.

Solution 3:

I speak as a person that worked with maths with the typical rule-of-thumb approach of engineers for many years, before deciding to get back to maths and study it thoroughly by myself. What was illutimating to me was actually the following "chain"

  1. Axiom of Completeness
  2. Extreme Value Theorem
  3. Rolle's Theorem

I sort of "see" the Mean Value Theorem as a direct consequence of Rolle's Theorem, in the sense that if $f(x)$ satisfies MVT's hypotheses in $[a,b]$, then $g(x)=f(b) -f(x)-\frac{f(b)-f(a)}{b-a}(b-x)$ satisfies Rolle's Theorem hypotheses in $[a,b]$, and hence its demonstration. Use instead $h(x) = f(b)-f(x)-\frac{f(b)-f(a)}{g(b)-g(a)}(g(b)-g(x))$, and you'll demonstrate Cauchy's generalization of MVT.

As in other comments and answers, it is fascinating that a repeated use of MVT brings to such powerful an instrument as Taylor polynomial.