Thurston's 37th way of thinking about the derivative
Zev, I honestly think Thurston's tongue was firmly implanted in his cheek when he wrote this. So the key point is that a connection on a vector bundle gives you (a) a means of differentiating sections (generalizing the covariant derivative for a Riemannian manifold as a connection on the tangent bundle) and (b) a notion of parallelism (generalizing the notion of parallel transport of tangent vectors).
As you suggested, the differential of $f\colon D\to\mathbb R$ gives you a $1$-form, hence a section of the cotangent bundle $T^*D$. With the standard symplectic structure on $T^*D$, Lagrangian sections (i.e., ones that pull back the symplectic $2$-form to $0$) are precisely closed $1$-forms. [This is tautological: If $q_i$ are coordinates on $D$, a $1$-form on $D$ is given by $\omega = \sum p_i\,dq_i$ for some functions $p_i$. By definition, $d\omega = \sum dp_i\wedge dq_i$, and this is (negative of) the pullback by the section $\omega$ of the standard symplectic form $\sum dq_i\wedge dp_i$ (with canonical coordinates $(q_i,p_i)$ on $T^*D$).]
Now, a connection form on a rank $k$ vector bundle $E\to M$ is a map $\nabla\colon \Gamma(E)\to\Gamma(E\otimes T^*M)$ (i.e., a map from sections to one-form valued sections) that satisfies the Leibniz rule $\nabla(gs) = dg\otimes s + g\nabla s$ for all sections $s$ and functions $g$. In general, one specifies this by covering $M$ with open sets $U$ over which $E$ is trivial and giving on each $U$ a $\mathfrak{gl}(k)$-valued $1$-form, i.e., a $k\times k$ matrix of $1$-forms; when we glue open sets these matrix-valued $1$-forms have to transform in a certain way in order to glue together to give a well-defined $\nabla$.
OK, so Thurston takes the trivial line bundle $D\times\mathbb R$. A connection is determined by taking the global section $1$ and specifying $\nabla 1$ to be a certain $1$-form on $D$. The standard flat connection will just take $\nabla 1 = 0$ and then $\nabla g = dg$. I'm now going to have to take some liberties with what Thurston says, and perhaps someone can point out what I'm missing. Assume now that our given function $f$ is nowhere $0$ on $D$. We can now define a connection by taking $\nabla 1 = -df/f$. Then the covariant derivative of the section given by the function $f=f\otimes 1$ [to which he refers as the graph of $f$] will be $\nabla(f\otimes 1) = df - f(df/f) = 0$, and so this section is parallel.
Slightly less tongue-in-cheek, parallelism is the generalization of constant (in a vector bundle, we cannot in general say elements of different fibers are equal), and covariant derivative $0$ is the generalization of $0$ derivative.