Notation: subscript vs. superscript for coordinate vector fields
Some books write the coordinate vector fields with a subscript as $$\frac{\partial}{\partial x_i}$$ while some write it with a superscript as $$\frac{\partial}{\partial x^i}.$$
Is there a conceptual reason for this distinction? I.e. in some texts I have seen, a supercript is to indicate the components of covectors, and a subscript for vectors.
This is part of a notational convention that evolved with J.J. Sylvester (1853) in his study of algebraic invariant theory.
The distinctions represented by the use of superscripted or subscripted indices only become important when there is more than one coordinate system in play.
In fact: "when there is a fixed coordinate basis (or when not considering coordinate vectors), one may choose to use only subscripts"
In cases where there are multiple coordinate systems under simultaneous consideration, there will typically be a reference axis (often this has the standard basis), and a coordinate axis (often this has a non-orthogonal basis, e.g. an oblique coordinate system).
The conventions for a vector (array of elements) and their indexing depend on how their values are obtained, as the following image (from Mathview's animation) illustrates:
CONTRAvariant components (superscripts) of a vector are obtained for example by parallel projection of the vector onto the coordinate axes. So in this example, contravariant components would be the familiar coordinate values. CONTRAavariant components, by definition, change as the coordinate axes change. They can be remembered as superscripted by considering the third letter 'n' whose rounded edge can be taken as pointing up.
COVariant components (subscripts) are by obtained for example by orthogonal projection of the vector onto the coordinate axes when these are themselves taken as unit vectors given in the reference frame. This is the usual dot-product of vectors against unit vectors in the coordinate directions. The components can be remembered as being subscripted by considering the third letter 'v' to point to its use. COVariant components, by their definition, change as the reference axes change, since that changes the values of the unit direction vectors of the coordinate axes which are being dot producted against.
In abstract mathematics terminology, the covariant vector is a linear functional that lives in the space dual to the contravariant vector, and, when used in an inner product with a vector (primal space) generates a scalar. In physics, this is expressed in the famous Einstein notation convention for representing length invariantly regardless of coordinate systems by taking the inner product of a covariant and contravariant vector (called contraction).
References:
- Wikipedia: indexing conventions
- Good introduction to covariance / contravariance
- Wikipedia explanation of covariant / contravariant
- Mathview video lecture on Contravariant and Covariant Components of a Vector (Part 2)
Usually one wants lower subscripts to denote vectors, probably coming from the common practice of denoting the standard basis of $\mathbb R^n$ as $e_1,\ldots,e_n$. Then to use Einstein summation convention you would want the coordinate functions $x^i$ on $\mathbb R^n$ to have upper indices since they represent covectors: any vector on $\mathbb R^n$ is written as $$ v = x^i(v)e_i. $$ In $\frac{\partial}{\partial x^i}$, the $i$ is a lower index (consistent with vectors having lower indices) and arguably this is the better notation. The main advantage, in my opinion, of using $x_i$ instead of $x^i$ is that you don't confuse the index with an exponent and writing a power of a coordinate function is cleaner.