What do cones have to do with quadratics? Why is $2$ special?

I've always been nagged about the two extremely non-obviously related definitions of conic sections (i.e. it seems so mysterious/magical that somehow slices of a cone are related to degree 2 equations in 2 variables). Recently I came across the following pages/videos:

  • This 3B1B video about ellipses, which reignited my desire to understand conics
  • Why are quadratic equations the same as right circular conic sections?, which offers a very computational approach to trying to resolve this question
  • Another 3B1B video on visualizing Pythagorean triples (i.e. finding the rational points of a circle)
  • and Manjul Bhargava's lecture on the Birch-Swinnerton-Dyer Conjecture, where minutes ~10-15 discuss the complete solution to the problems of rational points on conics.

While 3B1B's video makes a lot of sense and is very beautiful from a geometric standpoint, it does not talk about any of the other conics, or discuss the relationship with "degree 2". Moreover, the 2nd 3B1B video I linked and then Bhargava's lecture highlights "degree 2" as something we understand well, compared to higher degrees (reminds me a little bit of Fermat's last theorem and the non-existence of solutions for $n>2$).

So, I suppose my questions are as follows:

  1. Why, from an intuitive standpoint, should we expect cones to be deeply related to zero-sets of degree 2 algebraic equations?

and more generally:

  1. Is there some deep reason why "$2$" is so special? I've often heard the quip that "mathematics is about turning confusing things into linear algebra" because linear algebra is "the only subject mathematicians completely understand"; but it seems we also understand a lot of nice things about quadratics as well -- we have the aforementioned relationship with cones, a complete understanding of rational points, and the Pythagorean theorem (oh! and I just thought of quadratic reciprocity).

Also interesting to note that many equations in physics are related to $2$ (the second derivative, or inverse square laws), though that may be a stretch. I appreciate any ideas you share!

$$\rule{5cm}{0.4pt}$$

EDIT 3/12/21: was just thinking about variances, and least squares regression. "$2$" is extremely special in these areas: Why square the difference instead of taking the absolute value in standard deviation?, Why is it so cool to square numbers (in terms of finding the standard deviation)?, and the absolutely mindblowing animation of the physical realization of PCA with Hooke's law: Making sense of principal component analysis, eigenvectors & eigenvalues.

In these links I just listed, seems like the most popular (but still not very satisfying to me) answer is that it's convenient (smooth, easy to minimize, variances sum for independent r.v.'s, etc), a fact that may be a symptom of a deeper connection with the Hilbert-space-iness of $L^2$. Also maybe something about how dealing with squares, Pythagoras gives us that minimizing reconstruction error is the same as maximizing projection variance in PCA. Honorable mentions to Qiaochu Yuan's answer about rotation invariance, and Aaron Meyerowitz's answer about the arithmetic mean being the unique minimizer of sum of squared distances from a given point. As for the incredible alignment with our intuition in the form of the animation with springs and Hooke's law that I linked, I suppose I'll chalk that one up to coincidence, or some sort of SF ;)


Solution 1:

A cone itself is a quadratic! Just in three variables rather than two. More precisely, conical surfaces are "degenerate hyperboloids," such as

$$x^2 + y^2 - z^2 = 0.$$

Taking conic sections corresponds to intersecting a cone with a plane $ax + by + cz = d$, which amounts to replacing one of the three variables with a linear combination of the other two plus a constant, which produces a quadratic in two variables. The easiest one to see is that if $z$ is replaced by a constant $r$ then we get a circle $x^2 + y^2 = r^2$ (which is how you can come up with the above equation; a cone is a shape whose slice at $z = \pm r$ is a circle of radius $r$). Similarly if $x$ or $y$ is replaced by a constant we get a hyperbola.

I don't know that I have a complete picture to present about why quadratics are so much easier to understand than cubics and so forth. Maybe the simplest thing to say is that quadratic forms are closely related to square (symmetric) matrices $M$, since they can be written $q(x) = x^T M x$. And we have lots of tools for understanding square matrices, all of which can then be brought to bear to understand quadratic forms, e.g. the spectral theorem. The corresponding objects for cubic forms is a degree $3$ tensor which is harder to analyze.

Maybe a quite silly way to say it is that $2$ is special because it's the smallest positive integer which isn't equal to $1$. So quadratics are the simplest things that aren't linear and so forth.

Solution 2:

What is a cone?

It is a solid so that every cross section perpendicular to its center axis is a circle, and the radii of the these cross section circles a proportional to the the distance from the cone's vertex.

And that's it. the surface of the cone are the points $(x,y,z)$ where $z = h= $ the height of the the cross-section $= r = $ the radius of the cross section. And $(x,y)$ are the points of the circle with radius $r = h = z$.

As the equation of a circle is $\sqrt{x^2 +y^2} = r$ or $x^2 + y^2 = r^2$ the equation of a cone is $x^2 + y^2 = z^2$.

Every conic section is a matter intersecting the cone with a plane. A plane is a restriction of the three variable to be related by restraint $ax +by + cz= k$ and that is a matter of expressing any third variable as a linear combination of the other two.

So the cross section of a plane and cone will be a derivation of the 2 degree equation $x^2 = y^2 = z^2$ where one of the variables will be linear combination of the other two. In other words a second degree equation with two variables.

And that's all there is to it.

Of course the real question is why is the equation of a circle $x^2 + y^2 =r^2$? and why is that such an important representation of a second degree equation?

And that is entirely because of the Pythagorean theorem. If we take any point $(x,y)$ on a plane and consider the three points $(x,y), (x,0)$ and $(0,0)$ they for the three vertices of a right triangle. The legs of this triangle are of lengths $x$ and $y$ and therefore by the Pythagorean theorem the hypotenuse will have length $\sqrt{x^2 + y^2}=h$ and that is the distance of $(x,y)$ to $(0,0)$.

Now a circle is the collection of points where the distance from $(x,y)$ to $(0,0)$ is the constant value $r = h$. And so it will be all the points $(x,y)$ where $\sqrt{x^2 + y^2} =r$.

And that's it. That's why: distances are related to right triangles, right triangles are related to 2nd degree equations, circles are related to distances, cones are related to circles and all of them are related to 2nd degree equations.

That's it.