Why is this a safe neighborhood for uniform convergence of the sequence implicitly defining a function?

Look at page 169, on the sixth line, for the sentence that begins "First note that". A key question I have for you is: do you agree with the estimate, on page 169, that says that, for all $y$ in the interval $[b - \epsilon, b + \epsilon]$, we have$$|\phi'(y)| \le {1\over2}?$$If that estimate seems okay to you, then I think I can explain why the sequence$$f_1(x_*), \text{ } f_2(x_*), \text{ } f_3(x_*),\ldots$$converges, and, in particular, does not get into an infinite loop.

Would that answer your question?

I should also be clear about the "orders of quantification" in Theorem 1.4, which I could explain by turning the theorem into a game.

You give me a function $G$ and a point $(a, b)$ satisfying the conditions in the theorem. I then give you an interval $J$ and a function $f: J \to \mathbb{R}$. You then pick a point $x_*$ in $J$. We check whether$$G(x_*, f(x_*)) = 0.$$If yes, I win. If no, you win.

My strategy for winning is given in the proof. The start of this strategy is to choose $\epsilon$, then $\delta$. I then give you, as my move,$$J := [a - \delta, a + \delta]$$and use contraction methods to define $f$, and give that to you as well. In the proof, I want to convince myself that, no matter which $x_*$ in $J$ you choose, the contraction method will yield $f(x_*)$ in such a way that$$G(x_*, f(x_*)) = 0.$$I think you may be concerned that, if $x_*$ is chosen too far away from $a$, then the contraction method yields a sequence that does not converge, and possibly gets caught in an infinite loop. However, remember that, as part of my move in the game, I choose $\delta$, and then use it to define $J$.

You are therefore constrained on where you can choose $x_*$.

So, if you can come up with an example (maybe in pictures) where you see an $x_*$ that leads to an infinite loop, it may be that, as part of my strategy in choosing $J$, I do not allow you to use that $x_*$.

So the theorem says that for every $G$ and $(a, b)$ there exists $J$ and $f$ such that, for every $x_*$ in $J$,$$G(x_*, f(x_*)) = 0.$$It does not say that for every $G$ and $(a, b)$ and $x_*$ there exists $f$ such that$$G(x_*, f(x_*)) = 0.$$That is, in the game, you do not get to choose $x_*$ before I give you $J$.

The "for every" and "there exists" are called quantifiers. Order of quantification is something that often causes confusion.

I do understand where the contraction constant $k={1\over2}$ came from. It is a direct result of the choice of $\epsilon$. I also understand that $\delta$ further restricts the interval in which $x$ lives. What I don't understand is why the particular choice of $\delta$ protects me from a loop condition. I'm also dubious of the $\mathscr{C}^1$ hypothesis. It seems the loop condition involves what I consider to be $\mathscr{C}^2$ properties. To wit, inflection.

Do you agree that, for all integers $n > 0$, for all $y$ in $[b - \epsilon, b + \epsilon]$, we have $f_{n+1}(y) = \phi(f_n(y))$?

Step 1: Prove that $b$, $\phi(b)$, $\phi(\phi(b))$, $\phi(\phi(\phi(b)))$, $\ldots$ are all elements of $[b - \epsilon, b + \epsilon]$.

Step 2: Prove that$$f_0(x_*) = b, \text{ }f_1(x_*) = \phi(b), \text{ }f_2(x_*) = \phi(\phi(b)), \text{ }f_3(x_*) = \phi(\phi(\phi(b))), \ldots$$etc.

Step 3: Prove that the sequence$$b, \text{ }\phi(b), \text{ }\phi(\phi(b)), \text{ }\phi(\phi(\phi(b)))), \ldots$$is Cauchy, and so is convergent, and in particular, cannot be a repeating nonconstant sequence. That is, it cannot be an "infinite loop".

Do you agree that, if I can get these three steps proved, then we are done?

Let us handle Step 1.

Step 1: Prove that $b$, $\phi(b)$, $\phi(\phi(b))$, $\phi(\phi(\phi(b)))$, $\ldots$ are all elements of $[b - \epsilon, b + \epsilon]$.

If you look at the third line from the bottom of page 168, you see$$|G(x, b)| \le {1\over2}\epsilon |D_2G(a, b)|.$$This implies the following inequality:$$|\phi(b) - b| \le {1\over2}\epsilon.\tag*{$(1)$}$$Then, because $\phi$ is a ${1\over2}$-contraction, this implies the following inequality:$$|\phi(\phi(b)) - \phi(b)| \le {1\over4}\epsilon.\tag*{$(2)$}$$Then, because $\phi$ is a ${1\over2}$-contraction, this implies the following inequality:$$|\phi(\phi(\phi(b))) - \phi(\phi(b))| \le {1\over8}\epsilon.\tag*{$(3)$}$$Then, because $\phi$ is a ${1\over2}$-contraction, this implies the following inequality:$$|\phi(\phi(\phi(\phi(b)))) - \phi(\phi(\phi(b)))| \le {1\over{16}}\epsilon.\tag*{$(4)$}$$ Etc.

By $(1)$,$$|\phi(b) - b | \le {1\over2}\epsilon < \epsilon,$$ so $\phi(b)$ is in $[ b - \epsilon , b + \epsilon ]$.

By $(1)$, $(2)$, and the triangle inequality,$$| \phi(\phi(b)) - b | \le {1\over2}\epsilon + {1\over4}\epsilon < \epsilon,$$ so $\phi(\phi(b))$ is in $[ b - \epsilon , b + \epsilon ]$.

By $(1)$, $(2)$, $(3)$, and the triangle inequality, $$|\phi(\phi(\phi(b))) - b | \le {1\over2}\epsilon + {1\over4}\epsilon + {1\over8}\epsilon < \epsilon,$$ so $\phi(\phi(\phi(b)))$ is in $[ b - \epsilon , b + \epsilon ]$. Etc.

This finishes Step 1. (Let me know if you understand. If not, try to point out the first statement I made for which you cannot see that it follows from earlier statements.)

Could you look at Step 2, and see if you can figure it out? Just look carefully at the definitions of $f_{n+1}$ and $\phi$. If it does not make sense, let me know, and I will write something about it.

Now let us work on Step 3.

Step 3: Prove that the sequence$$b, \text{ }\phi(b), \text{ }\phi(\phi(b)), \text{ }\phi(\phi(\phi(b)))), \ldots$$is Cauchy, and so is convergent, and in particular, cannot be a repeating nonconstant sequence. That is, it cannot be an "infinite loop".

The sequence$$b,\text{ }\phi(b),\text{ }\phi(\phi(b)),\text{ }\phi(\phi(\phi(b))), \text{ }\phi(\phi(\phi(\phi(b)))),\ldots$$is sometimes called the "forward orbit" of $b$ under $\phi$.

A key observation is that each consecutive distance between pairs of points in any forward orbit is less than or equal to ${1\over2}$ times the preceding consecutive distance. So every forward orbit has geometrically decaying consecutive distances. Such a sequence is always Cauchy, hence convergent. Here is an explanation of why.

First, note that a sequence may not converge, even if consecutive distances go to zero. For example, in the sequence of real numbers$$1,\text{ } 1 + {1\over2}, 1 + {1\over2} + {1\over3}, 1 + {1\over2} + {1\over3}+ {1\over4}, \ldots$$ the consecutive distances are $${1\over2},\text{ }{1\over3},\text{ }{1\over4}, \ldots$$which tends to zero. However, the sequence does not converge to any real number, because it is the sequence of partial sums of the harmonic series, see the following.

https://en.wikipedia.org/wiki/Harmonic_series_(mathematics)

On the other hand if you have a sequence that has geometrically decaying consecutive distances, like $$1, \text{ }1 + {1\over2},\text{ }1 + {1\over2} + {1\over4},\text{ }1 + {1\over2} + {1\over4} + {1\over8}, \ldots$$ then it is always Cauchy, hence convergent.

For example, note that if $i<j$, then the distance between the $i$th and $j$th terms of$$1, \text{ }1 + {1\over2},\text{ }1 + {1\over2} + {1\over4},\text{ }1 + {1\over2} + {1\over4} + {1\over8}, \ldots$$ is less than or equal to$$\left({1\over2}\right)^i + \ldots + \left({1\over2}\right)^{j-1},$$which is less than or equal to $({1\over2})^{i-1}$. When $i$ is large, $({1\over2})^{i-1}$ can be made as small as you like.

Let me know if all this makes sense. If not, then please let me know the first statement I make that you cannot figure out and I will try to add detail. I am leaving a lot here for you to figure out, so I really do understand if you need more explanation here and there.

Why is this a safe neighborhood for uniform convergence of the sequence implicitly defining a function?

Tags:

Sequences And Series

Real Analysis

Differential Topology

Multivariable Calculus

Implicit Function Theorem

Related

Recent Posts