What did Alan Turing mean when he said he didn't fully understand dy/dx?
Alan Turing's notebook has recently been sold at an auction house in London. In it he says this:
Written out:
The Leibniz notation $\frac{\mathrm{d}y}{\mathrm{d}x}$ I find extremely difficult to understand in spite of it having been the one I understood best once! It certainly implies that some relation between $x$ and $y$ has been laid down e.g. \begin{equation} y = x^2 + 3x \end{equation}
I am trying to get an idea of what he meant by this. I imagine he was a dab hand at differentiation from first principles so he is obviously hinting at something more subtle but I can't access it.
- What is the depth of understanding he was trying to acquire about this mathematical operation?
- What does intuitive differentiation notation require?
- Does this notation bring out the full subtlety of differentiation?
- What did he mean?
See here for more pages of the notebook.
Solution 1:
I'm guessing too, but my guess is that it has something to do with the fact that beyond his initial introduction to calculus Turing (in common with many of us) thinks of a function as the entity of which a derivative is taken, either universally or at a particular point. In Leibnitz's notation, $y$ isn't explicitly a function. It's something that has been previously related to $x$, but it actually is a variable, or an axis of a graph, or the output of the function, not the function or the relation per se.
Defining $y$ as being related to $x$ by $y = x^2 + 3x$, and then writing $\frac{\mathrm{d}y}{\mathrm{d}x}$ to be "the derivative of $y$ with respect to $x$", quite reasonably might seem unintuitive and confusing to Turing once he's habitually thinking about functions as complete entities. That's not to say he can't figure out what the notation refers to, of course he can, but he's remarking that he finds it difficult to properly grasp.
I don't know what notation Turing preferred, but Lagrange's notation was to define a function $f$ by $f(x) = x^2 + 3x$ and then write $f'$ for the derivative of $f$. This then is implicitly with respect to $f$'s single argument. We have no $y$ that we need to understand, nor do we need to deal with any urge to understand what $\mathrm{d}y$ might be in terms of a rigorous theory of infinitesimals. The mystery is gone. But it's hard to deal with the partial derivatives of multivariate functions in that notation, so you pay your money and take your choice.
Solution 2:
I'll try my hand and give one possible rationale behind Turing's confusion over the notation $ \tfrac{\mathrm{d}y}{\mathrm{d}x} $. The short answer is that he appears to take issue with the notation on the grounds that differentiation is a mapping between two function spaces but $y$ looks like a variable.
To answer the first part of your question regarding depth and his meaning, I base my answer on the discussion in the linked webpage. Based on the dating of the notes and the discussion in the pictures, I doubt that his contention is regarding actual interpretation in terms of differentials $\mathrm{d}x, \mathrm{d}y$ and is instead more pedantic in nature. Earlier in the discussion, he talks about indeterminates and the difference between an indeterminate and a variable. Later, he writes "What is the way out? The notation $\tfrac{\mathrm{d}}{\mathrm{d}x} f(x, y)_{x=y,y=x}$ hardly seems to help in this difficult case". From this I gander that it's primarily the fact that he doesn't like the use of $y$ as something being differentiated. He states that $y = x^2 + 3x$ as if to say that you could alternatively just rearrange the equation in terms of $x$; however, in taking the derivative, you get another function - that is, you have a function $f(x)$ that becomes $g(x)$ as a result of differentiation $D:f(x) \rightarrow g(x)$. Thus, $y$ can't be a variable but rather a function if we want differentiation to be well-defined. Think of it as something akin to abuse of notation.
Regarding intuition and subtlety, Leibnitz's notation does indeed provide both although in truth $$ \dfrac{\mathrm{d}}{\mathrm{d}x} f(x) = g(x)$$ is clearer than using $y$. From a more intuitive point of view, you can think of the derivative as a variation in $f$ with respect to $x$ and indeed it's this notion that is more important than the one regarding secants and tangents. The subtlety of the notation is readily apparent when people are using the chain rule, even though you're not really cross-multiplying (for instance it doesn't work with second order derivatives). The subtlety becomes more apparent when you abstract even further to vector calculus or even further to differentiable manifolds.