Understanding the Frechet derivative

The only difference is that you've moved everything to one side of the equation.

$$ f'(a) = \lim\limits_{h \to 0} \frac{f(a+h)-f(a)}{h}$$

becomes

$$ 0 = \lim\limits_{h \to 0} \frac{f(a+h)-f(a)}{h} - \frac{hf'(a)}{h} $$

so that

$$ 0 = \lim\limits_{h \to 0} \frac{f(a+h)-f(a)-f'(a)h}{h}$$

chucking in some absolute values doesn't change anything

$$ 0 = \lim\limits_{h \to 0} \frac{|f(a+h)-f(a)-f'(a)h|}{|h|}$$

The utility of defining derivatives this way is that it extends to situations other than that of functions of one variable. Let's recast this once more and instead of sending $h$ to $0$ we can equivalently send $x$ to $a$ ($h=x-a$). Then we get

$$ 0 = \lim\limits_{x \to a} \frac{|f(x)-f(a)-f'(a)(x-a)|}{|x-a|}$$

Now if you replace $x$ and $a$ with vectors, $f$ with a function from vectors to vectors and think of the absolute value as the length of a vector, we have a perfectly reasonable definition for a derivative. Well...except for this "$f'(a)(x-a)$" business.

We need to replace the "number" $f'(a)$ with a linear operator (or a matrix = Jacobian) and then everything makes sense.

By the way, this is my preferred way of presenting derivatives in multivariable calculus. We see the derivative as being a linearization which well approximates our function: $f(x) \approx f(a)+f'(a)(x-a)$ (the tangent). When $f$ is a scalar-valued function, $f'(a)$ is just the gradient. Also, we get that approaching this multivariate limit along coordinate axes reduces to partial derivatives. This then explains why partials (and in fact all directional derivatives) can exist at a point even when a function is not differentiable (a limit can exist along all lines but still fail to exist).


It should be $h$, not $1$.

When we have function from $\Bbb R$ to itself, we have $f(x+h)=f(x)+f'(x)h+o(h)$ whereas in the general case, $f(x+h)=f(x)+D_x(f)(h)+o(\|h\|)$. So $D_x(f)(h)=f'(x)h$ which equals $1\times h=h$ in your case.


The Frechet derivative is the linear operator $h\mapsto f'(x)h$. So in your example it is the operator $h\mapsto h = 1\cdot h$. The Frechet derivative is therefore the identity operator. It now depends on how you want to describe the identity. You could say it is "$h$" but I think it's better to say the derivative is the linear map that multiplies a given number $h$ by $1$. The relevant information in that phrase is the number $1$, so I would say the derivative is $1$.