Why is the change-to-polar-coordinate method valid in computing limits?

Note: this answer stems from the (now removed) comments left under my answer to your other question about polar coordinates. As requested, I have edited to add the proofs.

Before proceeding, let us define (possibly non-standard notation) $$\lim_{(r,\phi)\to\{0\}\times\mathbb R}g(r,\phi)=L\tag{$\diamond$}$$ to mean that for each $\epsilon>0$, there is an open neighborhood $U$ of $\{0\}\times\mathbb R$ in $[0,\infty)\times\mathbb R$ such that $(r,\phi)\in U$ and $r\neq0$ implies $|g(r,\phi)-L|<\epsilon.$

In the case $g(r,\phi)=f(r\cos\phi,r\sin\phi)$, which is $2\pi$-periodic in the second argument, this can be simplified, by requiring instead that for each $\epsilon>0$, there is a $\delta>0$ such that $0<r<\delta$ implies $|g(r,\phi)-L|<\epsilon$. (Intuitively, in this case we do not have to worry about $\delta$ getting smaller and smaller as $\phi\to\pm\infty$.) This can be done e.g. by using the tube lemma from elementary topology.

We may now state:

Proposition. The following are equivalent for a function $f:\mathbb R^2\setminus\{(0,0)\}\to\mathbb R$:

  1. We have $$\lim_{(x,y)\to(0,0)}f(x,y)=L.$$
  2. The function $\tilde f:\mathbb R^2\to\mathbb R$, defined by $$\tilde f(x,y)=\begin{cases}f(x,y);&(x,y)\neq(0,0),\\L;&(x,y)=(0,0),\end{cases}$$ is continuous at $(0,0)$ and $f(0,0)=L$.
  3. For all $\phi_0\in\mathbb R$ we have $$\lim_{(r,\phi)\to(0,\phi_0)}f(r\cos\phi,r\sin\phi)=L.\tag{$\star$}$$
  4. We have $$\lim_{(r,\phi)\to\{0\}\times\mathbb R}f(r\cos\phi,r\sin\phi)=L.$$
  5. The function $g:[0,\infty)\times\mathbb R\to\mathbb R$, defined by $$g(r,\phi)=\begin{cases}f(r\cos\phi,r\sin\phi);&r>0,\\L;&r=0,\end{cases}$$ is continuous at $(0,\phi_0)$ for each $\phi_0\in\mathbb R$.

Proof. The equivalence of 1 and 2 is standard and follows directly from the definitions.

To see that 2 implies 5, let $p:[0,\infty)\times\mathbb R\to\mathbb R^2$ be defined by $p(r,\phi)=(r\cos\phi,r\sin\phi).$ This is obviously continuous, so if $\tilde f$ is continous, $g=\tilde f\circ p$ must be continuous as well.

Next, we show that 5 implies 3. To see this, let $\epsilon>0$ and $\phi_0\in\mathbb R$. By $5$, there is a $\delta>0$ such that $\|(r,\phi)-(0,\phi_0)\|<\delta$ implies $|g(r,\phi)-g(0,\phi_0)|<\epsilon$. In particular, if $r>0$, this means that $\|(r,\phi)-(0,\phi_0)\|<\delta$ implies $|f(r\cos\phi,r\sin\phi)-L|<\epsilon$, which is precisely 3.

Next, we show that 3 implies 4. Let $\epsilon>0$. For each $\phi_0\in\mathbb R$ we have a $\delta(\phi_0)>0$ such that $\|(r,\phi)-(0,\phi_0)\|<\delta(\phi_0)$ implies $|f(r\cos\phi,r\sin\phi)-L|<\epsilon$. Let $$U(\phi_0)=\{(r,\phi)\in[0,\infty)\times\mathbb R\mid \|(r,\phi)-(0,\phi_0)\|<\delta(\phi_0)\}.$$ Then $U=\bigcup_{\phi_0\in\mathbb R}U(\phi_0)$ is an open neighborhood of $\{0\}\times\mathbb R$ in $[0,\infty)\times\mathbb R$ and for each $r\neq 0$ the condition $(r,\phi)\in U$ implies $\|f(r\cos\phi,r\sin\phi)- L\|<\epsilon$, because every such $(r,\phi)$ is an element of some $U(\phi_0)$ and therefore satisfies $\|(r,\phi)-(0,\phi_0)\|<\delta(\phi_0)$.

Finally, we show that 4 implies 1, thus concluding the circle of implications and establishing the desired equivalence. To do this, let $\epsilon>0$. By 4 and the fact that $f(r\cos\phi,r\sin\phi)$ is a $2\pi$-periodic function of $\phi$, there is a $\delta>0$ such that $0<r<\delta$ implies $|f(r\cos\phi,r\sin\phi)-L|<\epsilon$. Therefore, if $0<\|(x,y)\|<\delta$, we have $|f(x,y)-L|<\epsilon$, because $(x,y)=(r\cos\phi,r\sin\phi)$ for some $r\in(0,\delta)$ and $\phi\in\mathbb R$. This concludes the proof. $\qquad\square$

Remark. Note that the limit $(\star)$ is the limit of a two-variable function. Requiring that we have the single variable limit $$\lim_{r\to0}f(r\cos\phi_0,r\sin\phi_0)=L\tag{$\ast$}$$ for each $\phi_0\in\mathbb R$ is not (!) equivalent. In fact, this latter condition is not sufficient for the limit to exist as shown by the standard counterexample $f(x,y)=\frac{x^2y}{x^4+y^2}.$) The point is that the limit $(\star)$ takes into account points $(r,\phi)$ with different values of $\phi$ whereas in $(\ast)$ the value $\phi=\phi_0$ is fixed.

Added: here are the details regarding the part with the tube lemma, but I will do it without actually using the tube lemma, which would allow us to skip a few steps. (In fact, what follows basically includes a proof of the tube lemma in this special case.)

The definition of $(\diamond)$ mentions a neighborhood $U$ of $\{0\}\times\mathbb R$ in $[0,\infty)\times\mathbb R$. We wish to show that, if $g(r,\phi)$ is $2\pi$-periodic in $\phi$, we can always take a neighborhood of the form $V=[0,\delta)\times\mathbb R$ for some $\delta>0$, instead of this possibly more complicated neighborhood $U$.

Assume we have a neighborhood $U$ of $\{0\}\times\mathbb R$, as in the definition of $(\diamond)$, not necessarily of the form $[0,\delta)\times\mathbb R$. Using $U$, we are going to construct a neighborhood $V=[0,\delta)\times\mathbb R$ satisfying the same properties. Note that since $U$ is a neighborhood of every point $(0,\phi_0)\in\{0\}\times[0,2\pi]$, it contains a ball $$B((0,\phi_0),\eta(\phi_0))=\{(r,\phi)\in[0,\infty)\times\mathbb R\mid \|(r,\phi)-(0,\phi_0)\|<\eta(\phi_0)\}$$ of radius $\eta(\phi_0)>0$ around every such point. This ball contains a set of the form $$V(\phi_0)=[0,\delta(\phi_0))\times(\phi_0-\delta(\phi_0),\phi_0+\delta(\phi_0))$$ which still contains $(0,\phi_0)$. The sets $V(\phi_0)$ form a cover of the interval $\{0\}\times[0,2\pi]$, which is compact, so there is a finite subcover $V(\phi_1),V(\phi_2),\ldots, V(\phi_n)$, where $\phi_1,\phi_2,\ldots,\phi_n\in[0,2\pi]$. Now take $\delta=\min\{\delta(\phi_1),\delta(\phi_2),\ldots,\delta(\phi_n)\}$. Observe that $$[0,\delta)\times[0,2\pi]\subseteq V(\phi_1)\cup V(\phi_2)\cup\ldots\cup V(\phi_n)\subseteq U,$$ so $(r,\phi)\in[0,\delta)\times [0,2\pi]$ implies $|g(r,\phi)-L|<\epsilon$. But since $g$ is $2\pi$-periodic in $\phi$, this means that $(r,\phi)\in[0,\delta)\times\mathbb R$ also implies $|g(r,\phi)-L|<\epsilon$. Therefore $V=[0,\delta)\times\mathbb R$ has the desired properties.


$\newcommand{\Reals}{\mathbf{R}}\newcommand{\eps}{\varepsilon}$Let $f$ be a function defined in some deleted neighborhood $U$ of the origin in $\Reals^{2}$, and let $L$ be a real number. The following are equivalent:

  1. $\displaystyle \lim_{(x, y) \to (0, 0)} f(x, y) = L$.

  2. For every $\eps > 0$, there exists a $\delta > 0$ such that if $(x, y) \in U$ and $0 < \sqrt{x^{2} + y^{2}} < \delta$, then $|f(x, y) - L| < \eps$.

  3. For every $\eps > 0$, there exists a $\delta > 0$ such that if $0 < r < \delta$ and $(r\cos\theta, r\sin\theta) \in U$ for all real $\theta$, then $|f(r\cos\theta, r\sin\theta) - L| < \eps$.

  4. $\displaystyle \lim_{r \to 0} f(r\cos\theta, r\sin\theta) = L$.

Statements 2. and 3. are equivalent for the natural geometric reason that if $(x, y) = (r\cos\theta, r\sin\theta)$, then $\sqrt{x^{2} + y^{2}} = \sqrt{r^{2}} = |r|$, and without loss of generality (e.g., replacing $\theta$ by $\theta + \pi$ if necessary) we may assume $0 < r$.

Edit: Condition 4 is (in my experience) defined by Condition 3. It would be more honest to write something like "$\displaystyle\lim_{r \to 0} f(r\cos\theta, r\sin\theta) = L$ uniformly in $\theta$"; otherwise, this use of limit notation is technically anomalous, since the limit isn't taken at one point, but in a tube neighborhood of the $\theta$-axis. (Of course, the $\theta$-axis maps to one point, the origin, under polar coordinates.)