What is the correct way to think about quotient sets and equivalence relations?

Perhaps there is not a correct way to think about it but I would want to know how others think about it. Here are my problems/questions, after my definitions:

Definition 1. Let $X$ be a set and $\sim$ be an equivalence relation on $X$. Then $[x]:=\{y \in X \mid y \sim x\}$ and $X/{\sim} := \{[x] \mid x \in X\}$.

My question could be summarized to "How should I think about $X/{\sim}$?". Consider $\mathbf{Z}/{\sim}$ with $z_1 \sim z_2$ $\iff$ $z_1-z_2$ is even. One then obtains $\mathbf{Z}/{\sim} = \{[0],[1]\}=\{\{...,-4,-2,0,2,4,...\},\{...,-5,-3,-1,1,3,5,...\}\}.$

The way I think about the set of all equivalence classes is that one collects all equivalent elements into one set for all elements and obtains the set on the very right in the example. Then one picks a "name" for each of those sets, calling it by one of its members. In the example one has the canonical choices of $[0],[1]$. If I now pick an arbitrary element $a \in \mathbf{Z}/{\sim}$, then there exists a $z \in \mathbf{Z}$ such that $a=[z]$. This is because I can simply call the set $a$ by one of its representatives, in this case $z$ or in the example above $[0]$ or $[1]$. When defining a function it then suffices to define it on all the "names" $[z]$ because I can give each object in $\mathbf{Z}/{\sim}$ one. The function being well defined then comes down to showing that it is independent of the name each object has been given. Is this a valid way to think about this concept or are there other, perhaps better ways to do so? I am not sure if I am satisfied with the way I would explain it to myself since the "giving it a name" does not really sound that rigorous. I guess one could also view this as a sort of assignment which assigns to every set of equivalent elements a member of it (which is not well defined) and then assigns to it a value such that this process is well defined.

Edit: The following is still not entirely clear to me. When defining a function from a quotient set to another set, one usually defines this in the following way: $$f: X/{\sim} \to A, \ [x] \mapsto a(x).$$ How should I think about this? Do I first choose a (arbitrary) complete system of representatives, define this function for them and then show that it is not dependent of the choice of the complete system, or do I map all $[x]$, $ x \in X$ and then realize that the images of equivalent elements are the same, meaning that the function is well defined?

Solution 1:

Frankly, the way you have worded all this seems very sound to me.

The one issue is that one cannot generally expect to be able to pick a "canonical" element. So we generally are satisfied with the ambiguity of the "name" $[x]$ for the equivalence class that contains $x$. Formally, one simply uses that the statements $[x]=[y]$ and $x \sim y$ are logically equivalent, and one remembers (just as you say) to check that all definitions based on such "names" are well-defined.

In fact, it is even somewhat problematical to assert the existence of an assignment, to each equivalence class, of a member of that class. There is a special set theory axiom devoted to the assertion that such assignments exist in complete generality: the Axiom of Choice. However, there are plenty of situations where one can construct a choice assignment by hand without that axiom, and there are plenty of situations where one does not need to bother with a choice assignment.

Regarding your latest edit, in this situation it is not necessary to choose representatives. To define a function $f : X/\!\sim \, \to A$, you can first define a function $F : X \to A$, and then you prove that $F$ has the following property:

For all $x,y \in X$, if $x \sim y$ then $F(x)=F(y)$.

You may then write down the formula $$f([x])=F(x) $$ and you are guaranteed that this formula gives a well-defined function $f : X / \! \sim \, \to A$.

Solution 2:

The way I think about the set of all equivalence classes is that one collects all equivalent elements into one set for all elements and obtains the set on the very right in the example. Then one picks a "name" for each of those sets, calling it by one of its members.

Yes, this is exactly what we are doing.

The function being well defined then comes down to showing that it is independent of the name each object has been given.

Yes, again.

However, I suspect that you might be missing/misunderstanding the whole point of this exercise. Our intention is not just to give a name to an equivalence class that we have produced. The whole point of defining an equivalence relation, is to rigorize the concept of identifying different elements of the set.

There might be conditions where we want to consider two elements of a set as 'same', even when they are not actually equal in the normal sense. For example, in Geometry, consider triangles on a plane. Two triangles, with different vertices are not the same, in the strictest sense, as their vertices are different points. However, we can consider two congruent triangles to be the 'same', and this way of thinking might actually prove to be useful.

Similarly, in your case, we need to classify the integers just based on whether they are odd or even (and say that two integers are 'same' if they have the same parity). An equivalence relation is a way to rigorize this, as it divides our elements in to two different classes, with each class consisting of elements that we needed to be the same. Thus, $\{n | n \text{ is even} \}$ is now just a single element of your set, namely $[0]$. We do not care about the fact that it is the set of even numbers, anymore. Similarly, for odd numbers also.

Shortly, an equivalence relation generalises the concept of what it means for two elements to be equal. The quotient set just represents the original set, however, with a different notion of equality than before.

Solution 3:

The idea is to view the structure through a lens which discards (or ignores for the moment) certain information. Indeed, even in colloquial language, mathematicians are prone to say things like, “Modding out by (in other words, ignoring) details like X, Y, and Z, the big story is this…”

Human language does this when we abstract, losing some information about particular dogs when we subsume them into the category dog. (This would be something like “Animals mod ‘same species’”.)

This generalizes to arrows in a category…typically you can view the image of a morphism (a subobject of the codomain) as a sort of low-resolution version of the domain. This is the content of the first isomorphism theorem in groups, rings, etc., but you can manage to do similar things in lots of categories, if you’re lucky. See Tom Leinster’s introductory book on category theory for this perspective.

It’s sort of the only trick we have in mathematics…in order to understand X, look at morphisms out of X (“shadows of X”) and morphisms into X (“things of which part of X is a shadow”).

Solution 4:

I think of it as the image of a map with the original set as the domain. For every quotient $Z/\sim$ the map $Z\to Z/\sim$ which maps every element of $Z$ to its equivalence class is such a map. And for every surjective map $Z\to Y$, we can define an equivalence relation $\sim$ where the preimage of every element of $Y$ is an equivalence class. And then $Y$ and $Z/\sim$ have the same cardinality.

This kind of thinking extends to all kinds of quotient structures. Quotients of groups, rings, vector spaces, topological spaces, etc. all come to mind. But "same cardinality" can be replaced by "isomorphic". Essentially, the quotients of a set/group/ring/... are exactly all the images of the corresponding morphisms (maps for sets, homomorphisms for groups, continuous maps for topological spaces, etc.), up to isomorphism.

Solution 5:

$X/{\sim}$ is a partition of $X$. $\sim$ is the relation between two elements in the same piece of the partition. An equivalence relation is a relation that appears in this way from a partition.

In general, you should not think of equivalence classes in "names" so much: in an abstract setting, there may be no obvious way of choosing them. In fact, being able to simultaneously choose a representative for all classes of any equivalence relation is (quite straightforwardly) equivalent to the axiom of choice. In some cases (like for integers), this is easier, but even then, the choice is never really "canonical" in any meaningful manner.

Instead, you should think of equivalence classes as subsets of the domain. They require no special names, they name themselves. It may be useful to call the odd and even integers using $1$ and $0$, but that is no better than simply calling them for what they are: the set of odd integers, and the set of even integers.

Of course, for an arbitrary set of integers, we usually have no name at all (and even less so for an arbitrary subset of an arbitrary set). But that is fine. We have no nice names for most numbers between $0$ and $1$, either.

Re: edit: when we define a map $X/{\sim}$ via a formula of the form $[x]_\sim \mapsto f(x)$ for some function of $x\in X$, it is not usual (in my experience) to think of a complete set of representatives. I can think of two more natural ways of interpreting what happens in those cases, the first one a bit more concrete, the second perhaps a bit more advanced, but also more explicit IMO:

Take any $c\in X/{\sim}$, observe that it is of the form $[x]_\sim$ for some $x$, and then prove that the result $f(x)$ does not depend on the choice of $x$, i.e. the formula $\bar f(c)=f(x)$ is well-defined.
Show that if $x_1\sim x_2$, then $f(x_1)=f(x_2)$, and then use the fact that the quotient map $X\to X/{\sim}$ is universal in the sense that for any map $f\colon X\to Y$ with the property that $f(x_1)=f(x_2)$ whenever $x_1\sim x_2$, there is a unique vertical arrow making the following diagram commute: $$\require{AMScd} \def\diaguparrow#1{\smash{ \raise.6em\rlap{\scriptstyle #1} \lower.3em{\mathord{\diagup}} \raise.52em{\!\mathord{\nearrow}} }} \begin{CD} && Y\\ & \diaguparrow{ f} @A\bar fAA \\ X @>>> X/{\sim} \end{CD}$$

Of course, they are all the same thing, just viewed (slightly) differently.