Suppose $f:M \to N$ is a smooth map between complete Riemannian manifolds of the same dimension. Suppose $Df(m_0)$ is invertible, and $n$ is a point close to $f(m_0)$. Can we perform Newton iteration to find a point $m^*$ mapping to $n$ as we do in the proof of the Inverse Function Theorem, by taking $m_{i+1} = \exp_{m_i} \circ (Df_{m_i})^{-1} \circ \log_{f(m_i)}(n)$? Hopefully we would expect this to converge to a point mapping to $n$ if $n$ is chosen close enough to $f(m_0)$.

My apologies if this is nonsense; I'm still learning about Riemannian manifolds, $\log$, $\exp$, etc.


Solution 1:

Not a full answer, but an extended comment following up on my comment above:

Pick charts $\phi:M \to \mathbb{R}^n$ and $\psi:N\to\mathbb{R}^n$; then you can "pull back" the function $f$ to a function $$g: \mathbb{R}^n\to\mathbb{R}^n,\quad x\mapsto (\psi \circ f \circ \phi^{-1})(x).$$

Now you can run Newton's method on the function $g(x) - \psi(n)$, with update rule $$x_{i+1} = x_i - (Dg)^{-1}[g(x_i)-\psi(n)].$$

Now clearly this form of Newton's method requires only differential structure on $M$ and $N$, and not a metric: no matter what charts $\phi$ and $\psi$ you pick, provided that the initial guess is sufficiently close to the root, $f$ is sufficiently regular, etc. and that the iterates $x_i$ do not leave the charts, you will converge to a solution $x_\infty$. Of course, the intermediate iterates will vary depending on your choice of charts, but "being a root" is clearly a coordinate-independent property of points of $M$.

What you've done is chosen $\phi$ and $\psi$ to be natural coordinates on $M$ and $N$. An extra wrinkle is that you are changing charts each iteration; how this affects convergence of Newton's method is not obvious to me offhand. Your formulation is nice in that it is coordinate-free: I'm not convinced it's necessarily more efficient from a computational perspective, though, since the best choice of charts (in the sense of maximizing the radius of convergence) has as much to do with the form of $f$ as $M$ and $N$. Even for $f:\mathbb{R}^2\to\mathbb{R}^2$, for instance, switching from Cartesian to polar coordinates can dramatically affect if and how quickly Newton's method converges for a given function and initial guess.