Are surjective functions a pointless concept? [duplicate]

What is the point of the concept of surjective or onto functions if you can just restrict the co-domain of your function to its image? Injectivity or 1-to-1ness is actually the defining property of bijections (sometimes called 1-to-1 correspondences), while all the question of surjectivity does is derail the argument into checking whether the image equals the co-domain.

Am I wrong in thinking this? What am I missing? Only situation where this concept could be marginally useful that I could think of would be some function for which it is easier to find an element of the co-domain for which no pre-image exists than to actually find the image itself.


Solution 1:

If you are given one function $f$, it is indeed true that asserting that it is surjective is equivalent to asserting that its image is equal to its codomain. But suppose that you have several functions. Then asserting that some of them are surjective whereas others are not becomes more natural, since it is not natural to restrict the codomain of some of them and not to restrict the codomain of the other ones.

Solution 2:

If you are given a function $f:A\to B$, you are right that injectivity is "intrinsic" to the function, in the sense that it only depends on the graph of the function; while any function is surjective "onto its image".

Others have already explained why it can be unilluminating to look at the image of the function : sometimes (often) it's very hard to describe said image, and the question of surjectivity onto $B$ just becomes a question of equality : is $\mathrm{im} (f) = B$ ? As such, you're not solving the question by saying "oh it's surjective onto $\mathrm{im}(f)$", and this is the point I want to make : when asking for surjectivity of a function, you're often not interested in whether it's surjective somewhere, rather your real interest lies in the set $B$.

In other words, asking whether $f$ is surjective is not asking something only about $f$ (and its graph more specifically), it's asking whether the equation $f(x)=b$ always has a solution for $b\in B$. From that perspective, you can see why we're interested in surjections : they're the maps such that any equation is solvable.

Let me give you a couple of examples where the concept of surjectivity is interesting :

-Suppose you have a field $k$ (you can think $k=\mathbb{R,C,Q}$ if you don't know a lot about fields) and a polynomial function $P\in k[x]$. Then $P:k\to k$ and you may ask whether $P$ is surjective. Of course it's surjective onto its image, but that's not really what you want to know. Being even more specific, take $P(x) = x^2$, then asking whether $P$ is surjective is asking "does every element of $k$ have a square root in $k$ ?" Now that is quite clearly an interesting question (that lead to the discovery of $\mathbb{C}$ !), and obviously it's the same as "is $\mathrm{im}(P) = k$?", but again, phrasing it like this doesn't really help, and doesn't remove the interest of the question.

-If you know Cantor's theorem, then you know that for any set $X$ there is no surjection $X\to \mathcal{P}(X)$. Now without the notion of surjection this result isn't even expressible, whereas it's a very important statement. Of course any function $f:X\to \mathcal{P}(X)$ is surjective onto its image : but who cares ? what we're really interested in is whether every element of $\mathcal{P}(X)$ is attained.

In summary, surjectivity is an interesting notion when you're actually interested in the codomain, not only in the function : it indeed happens that sometimes you don't really care about $B$, you mainly care about $f$ and $A$, and in these cases you just say "corestrict to the image of $f$ and we're good"; but sometimes you're also interested in $B$, in which case the notion becomes relevant.

Let me end by noting that once you've asked the question of surjectivity, and, say, got a negative answer, the quest does not end here, because again, as you said, $f$ is always surjective onto $\mathrm{im}(f)$ : so if $f$ is not surjective onto $B$, it means that the equation $f(x)=b$ does not have a solution for all $b$, and so you enter a somehow more nuanced question, which is "for which $b$ does it have a solution ?" (which is of course the same question as "what is $\mathrm{im}(f)$ ?"; but perhaps phrasing it in terms of equations makes it clearer)

Solution 3:

A concept is useful if it helps us say what we want to say easier than if we didn't have the concept.

You're right that if we're given some function out of the blue as a set of ordered pairs $(x,f(x))$, then it doesn't make much sense to worry about whether it is surjective -- that is just a question of which codomain to choose for it. But getting a function without any context is essentially never what actually happens when we do mathematics.

It is much more common that we don't start out with a particular function in mind, but with a list of conditions, and then we ask "is there any function that satisfies our conditions?", or "is such-and-such true about all functions that meet our conditions?" The conditions come before we have fixed a particular function to apply them to. Therefore we need vocabulary for speaking about those conditions much more than (or at least in addition to) speaking about individual, concrete, functions.

As it turns out, we relatively often need to speak about conditions of the form

The range of the function is exactly the set $B$.

for some already-known $B$ that comes from whatever we happen to be doing. This happens so often that it is convenient to have a shorter way of saying it.

In older language this could be expressed by saying that we're thinking about a function "onto $B$" rather than a function "to $B$" (which merely requires that the range of the function is a subset of $B$). That certainly works -- it is undoubtedly short -- but mathematics educators have generally found it more instructive and explicit to express the condition as "the function $A\to B$ is surjective". Some advantages of that are:

  • "Onto" is almost too short -- it is relatively easy to miss the difference between "to" and "onto", especially for students who may not appreciate the significance of the distinction.

  • Because "surjective" is an adjective, we can use it in contexts such as "because $f$ is surjective..." or "we now prove that $g$ is surjective". In the older language we have to say "because $f$ is onto", which is pretty suspect grammatically -- "onto" is a preposition, so neither it nor "onto $B$" ought to be a predicate.

  • The notation "$A\to B$" is a convenient and memorable way to specify both the domain and codomain of the function you're thinking about. But then if you need to speak about the range being exactly $B$, there's not room to do that by putting a word in front of $B$, because that's where the $\to$ goes.

The downside of the usage is that the wording "$f$ is surjective" doesn't really make formal sense unless we imagine that $f$ is something that inherently knows what its codomain is supposed to be. This is not the case for the set-theoretic formalization of functions as sets of pairs. Some authors will explicitly define a "function" for this purpose as something like a triple of domain, codomain, and pairs, in which case "is surjective" is unproblematic. Others treat "is surjective" as an abbreviation for "has codomain $B$" and leave it to the reader to remember which $B$ from the preceding text it makes sense to understand it as.


By the way, in actual mathematical usage the main use (though not the only one) of "surjective" is as one half of the definition of "bijective". Injective functions are certainly a useful concept in its own right; so are bijections. It makes didactic sense to teach "bijective" as a combination of "injective" (which we already need to know about) and an additional condition which turns out to be "surjective".

Solution 4:

Well, consider the exponential function and its counterpart, the logarithm function. You need to know the domain and codomain to recognize that they are inverse to each other:

$\exp : {\Bbb R}\rightarrow{\Bbb R}_{>0}:x\mapsto e^x$ and $\ln: {\Bbb R}_{>0}\rightarrow{\Bbb R}:x\mapsto \ln(x)$.