Should every group be a monoid, or should no group be a monoid?

Question: What is more convenient/useful? Writing mathematics as if every group is a monoid, or as if these two classes are disjoint?


Additional discussion. Define a monoid as follows.

Defn 1. A monoid is a triple $(X,*,e)$ such that $*$ is an associative binary operation on $X$, and $e \in X$, and $e$ has the property that for all $x \in X$ it holds that $x * e = e * x = x$.

From here, there's at least two ways of defining a group.

Defn 2. A group is a monoid $(X,*,e)$ such that for all $x \in X$ there exists $y \in X$ such that $x*y=e$.

Defn 2'. A group is a quadruple $(X,*,e,i)$ such that $(X,*,e)$ is a monoid, and $i$ is a function $X \rightarrow X$, and for all $x \in X$ it holds that $x * i(x) = e$.

Now I understand that minimalism favors Defn 2, while practitioners of universal algebra favor Defn 3. However, this is not my question.

My question is not: which is preferable, Defn 2 or Defn 2'?

Rather, my question is: which is preferable, definitions like Defn 2 such that every group is a monoid, or definitions like Defn 2' such that no group is monoid? So just to clarify, I want to know: what is more convenient/useful? Writing mathematics as if every group is a monoid, or as if these two classes are disjoint?


In my opinion Definition 2' is the correct definition (and Definition 2 is not a definition, but rather a characterization), because it follows the general principles how algebraic structures are defined in universal algebra (by operations satisfying equations), and therefore it also directly generalizes to other categories, for example topological spaces. A topological group is not just a topological monoid such that every object has an inverse. We also need that the inverse map is a continuous map (it is an interesting fact that for Lie groups this is automatic). So it is useful when the inverse map belongs to the data.

There are several other reasons why this is important: The subgroup generated by a subset $X$ of a group has underlying set $\{a_1^{\pm 1} \cdots \dotsc \cdots a_n^{\pm 1} : a_i \in X\}$. When you view groups as special monoids, you may forget the $\pm 1$ here.

Different categories should always be considered as disjoint. This helps to organize mathematics a lot. Unfortunately, forgetful functors between these categories are usually ignored, treated as if they were identities, but of course they are not. For example, we have the forgetful functor $U : \mathsf{Grp} \to \mathsf{Mon}$. It turns out that it is fully faithful (in many texts group homomorphisms are defined that way, of course again this is not conceptually correct). It has a left adjoint (Grothendieck construction) as well as a right adjoint (group of units). In particular it preserves all limits and colimits. These properties of $U$ show that often (but not always!) there is no harm when you identitfy $G$ with $U(G)$.

By the way, Definition 2' is conceptually not complete yet. You have to assume $x \cdot i(x) = i(x) \cdot x = e$. This becomes important when you study group objects in non-cartesian categories, aka Hopf monoids, for example Hopf algebras. The axiom for the antipode then contains two diagrams.


Previous answers are very complete. Let me just add a model theoretic point of view. (I did myself ask that kind of questions recently.)

For the model theorist, your question can be stated as : considering $\mathcal L_1$ the language $\{\ast, e\}$ and $\mathcal L_2$ the language $\{\ast, e, i\}$, should a group be defined as a $\mathcal L_1$-structure (satisfying some $\mathcal L_1$-theory $T_1$) or as a $\mathcal L_2$-structure (satisfying some $\mathcal L_2$-theory $T_2$) ?

If your goal is to talk about group, you should prefer $\mathcal L_2$. For example, considering a $\mathcal L_2$-model $\mathfrak M \models T_2$ of base set $M$, the substructure generated by some $A \subseteq M$ is the subgroup $\langle A \rangle$ of $M$. But taking a $\mathcal L_1$-model $\mathfrak N \models T_1$ makes the substructure generated by $A \subseteq N$ only the monoid generated by $A$ in the implicit monoid $N$ and hence the substructure generated by $A$ is not necessarily model of $T_1$. It shows that the language $\mathcal L_1$ is more suitable to talk about monoids than groups.

We can even go a little further. Consider the language $\mathcal L_0 = \{\ast\}$ . You have enough to state a theory $T_0$ of groups : $$ \begin{aligned} \forall x \forall y \forall z, &\ (x \ast y) \ast z = x \ast (y \ast z) \\ \exists e \forall x, &\ x \ast e = x = e \ast x \\ \forall x \exists y, &\ x \ast y = e = y \ast x . \end{aligned} $$ But because of the second statement, the theory is not $\forall\exists$ (i.e. not every sentence of the theory $T_0$ is equivalent to a sentence $\forall x_1 \dots \forall x_n \exists y_1 \dots \exists y_m, \varphi_0(x_1,\dots,x_n,y_1,\dots,y_m)$ with $\varphi_0$ without quantifier). It has for result that a increasing union of groups isn't necessarily a group (here group in the sense a $\mathcal L_0$-model of $T_0$). That is, groups in that sense does not have the simplest induction limit as a group. Actually, what could go wrong is that the union could have no neutral. As you can imagine, the language $\mathcal L_0$ is then more suitable for associative magmas than for monoids or groups.


First, a group is a monoid where every element has a two-sided inverse, not just a one-sided inverse.

Second, whether you define a group to be a monoid where every element has a two-sided inverse, or you define a group to be a quadruple $(G,\circ, e, i)$, where $\circ $ is an associative binary operation on $G$, $e$ is a twos-sided identity for the operation, and $i$ is a unary operation assigning to every element a two-sided inverse, is irrelevant to the question whether or not a group is a monoid. The particularities of a definition are of little importance as to what an actual thing is. The reason that every group is a monoid is that there is a fully faithful forgetful functor from $Grp$ to $Mon$ (the category of groups to the category of monoids). That means that there is a way of seeing every group as giving rise to a monoid. Exactly how that is done may depend on the particularities of the given definition of groups and monoids. But, no matter which definitions you'll choose for the concepts of group and monoid, there will result categories $Grp_1$, $Grp_2$, $Mon_1$, and $Mon_2$, with $Grp_1 \cong Grp_2$ and $Mon_1 \cong Mon_2$, and there will be the corresponding forgetful functors.

So, you may choose any particular way to define something. Category theory will then (if that thing you defined gives rise to a category, which most often is the case (but not always)) give you a precise way to saying when two definitions actually define the same thing (strongly the same means the categories will be isomorphic, weakly the same (which is the kind of sameness we care about in maths) means that categories will be equivalent). And then, whether every blool is a gradnal boils down to the existence of a certain nice functor from $Blool$ to $Gradnal$. This is independent of the choice of definition that gave rise to these categories, as long as different categories of blools (resp. gradnals) are equivalent.

In short: a concept lies not in a definition, but rather in the (too large to exist) equivalence class of equivalent categories. In some cases, categories need to be replaced by $\infty$-categories, making things much more difficult and interesting (e.g., this is in a sense modern algebraic topology: spaces are the same as simplicial sets, as long as one considers the relevant $\infty $-categories).


Note that Def. 2' is a standard type of universal algebraic definition, which means that it requires only equational logic.

On the other hand Def. 2 requires both types of quantification, i.e. stronger logic than Def. 2' (namely: predicate logic).

So Def. 2' appears to be more economical, it needs less of expressive power. At the same time it is not cumbersome or too long. In fact, both definitions are of quite the same length.

Therefore, imho, Def. 2' is better.