Why are removable discontinuities even discontinuities at all?

It comes down to what we mean by a function. In specifying the function, we need to specify not just the "rule", but also the domain, i.e., what are you allowed to put into the function?

In particular, to answer your question directly: if we want $f(x)=x+3$ to be defined at $x=2$, then we aren't allowed to say things like $$``f(x)=f(x)\cdot\dfrac{x-2}{x-2},\,''$$ because the expression on the right is not defined at $x=2$.

I think it might be the norm to brush over this kind of thing in introductory calculus courses, because in such a course we tend to assume that the function is defined "everywhere it makes sense" (a frame of mind which is convenient, but carries many problems with it besides just this).

To be a bit more explanatory, if you just say the function $f$ is specified by $$f(x) = \frac{x^{2}+x-6}{x-2},$$ then you haven't specified the function enough to be satisfactory for a really rigorous treatment. In particular, is your function defined at $x=2$, or is it not? Given the way that you have written it down, I would assume that it is not, since, as you point out, we cannot divide by $0$.

The way we deal with this in a rigorous manner is to require the definition of the function to specify the domain. For example, you could say that $f$ has domain $\mathbb{R}$ and is given by $$f(x) = \begin{cases}\dfrac{x^{2}+x-6}{x-2}, & x\neq2, \\ 5, & x=2.\end{cases}$$ Note that the only reason this makes sense is that your function is specified for all $x$ in the domain (in particular, it is clear that there is, for this value of $f(2)$, no discontinuity at all). If we had said "$f$ has domain $\mathbb{R}$ and is given by $$f(x) = \dfrac{x^{2}+x-6}{x-2}$$ for all $x$" then we would run into problems: we claim that $f$ has domain $\mathbb{R}$, but $2\in\mathbb{R}$ and $f(2)$ is not defined.

By contrast, you could specify your function as the function with, for example, domain $[0,1]$ given by $$f(x)=\dfrac{x^{2}+x-6}{x-2},$$ when we see that there is, in a sense, no discontinuity at $x=2$ since $f(2)$ has not been defined. In this case, it makes sense to say $$f(x)=f(x)\cdot\dfrac{x-2}{x-2},$$ since this expression now makes sense for all of the values in the domain of the function. It also now makes sense to make the cancellation and say $f(x)=x+3$, because our knowledge of the domain of the function ensures that we are not dividing by $0$.


What's stopping you from multiplying [a continuous function] by $1$ in the form $\frac{x−2}{x−2}$? Can multiplying by $1$ really introduce a discontinuity to the function?

Ah, but $\frac{x−2}{x−2}$ is not equal to $1$. Or, to be more, precise, it is only equal to $1$ for values of $x$ that are different from $2$. If $x=2$, then $\frac{x−2}{x−2}$ is undefined.

And that's more or less the answer to your question: If you take a given function $f(x)$ that is continuous on all of $\mathbb{R}$, and you multiply it by a function $g(x)$ that is equal to $1$ everywhere except at a single isolated point, at which point $g(x)$ is undefined, then the product $f(x)g(x)$ will be exactly the same as $f(x)$ everywhere except at that point, where it, too, will be undefined.


Neither the function $$f(x)=\frac{x^2+x-6}{x-2}$$ nor the function $g(x)=x+3$ has a discontinuity at $x=2.$ In the first case, it is because $f$ is not defined at $x=2,$ so is not (continuous or discontinuous) there. In the latter case, it is continuous there.

Now, consider the family of functions $h_\alpha(x)$ given by $$h_\alpha(x)=\begin{cases}x+3 & x\ne 2\\\alpha & x=2,\end{cases}$$ where $\alpha$ is a real constant. The function $h_5(x)$ is simply $g(x),$ expressed in a more complicated form. For any real $\alpha\ne5,$ $h_\alpha(x)$ has a removable discontinuity at $x=2.$ What this means is that we can make $h_\alpha(x)$ into a (different) function that is continuous at $x=2$ by choosing a different value for $h_\alpha(2).$

Consider the function $i(x)=\frac{x-2}{x-2},$ now. Note that $i(2)$ is undefined, and so $i(x)$ doesn't make sense unless $x\ne2.$ And, of course, for $x\ne2,$ we have $i(x)=1.$ Now, in order to multiply $g(x)$ by $i(x),$ both of these have to be defined, and so we have to require that $x\ne 2$ once again. So, while $g(x)$ is defined everywhere, $f(x)=g(x)\cdot i(x)$ is not.


Added: It's worth noting that the sources to which you linked in the comments are not using the standard definition. (However, this abuse of terminology is far from uncommon, even among textbooks!) Moreover, their definitions don't even agree with each other!

The first source (or, at least, what I can read of it) seems to adhere to an informal definition that suggests the following formal definition:

A function $f$ has a removable discontinuity at a point $x_0$ if there is a number $L$ such that $\lim_{x\searrow x_0}f(x)=L=\lim_{x\nearrow x_0}$ and either (i) $f(x_0)$ is undefined or (ii) $f(x_0)$ is defined and is not equal to $L.$ In the former case, we say that $f$ has a hole at $x_0.$ In the latter case, we say that $f$ has a created discontinuity at $x_0$.

The terminology "hole" (and this definition of it) is fairly standard. However, the definition of "created discontinuity" (a term I've never seen before) is exactly the standard definition of "removable discontinuity"!

The sketched graph provided there gives an example of a hole, and the source also seems to refer to a graph giving an example of a "created discontinuity," but it isn't visible to me.

The second source, on the other hand, conflates the terms "hole" and "removable discontinuity," as opposed to your first source (which treats holes as special cases of removable discontinuities) and the standard usages (in which holes and removable discontinuities are mutually exclusive). Moreover, its attempt at a formal definition is at best informal, and at worst self-contradictory!

Formally, a removable discontinuity is one at which the limit of the function exists but does not equal the value of the function at that point; this may be because the function does not exist at that point.

The part preceding the semicolon is the usual definition, as, by speaking of "the value of the function at that point," it is implied that the function takes a unique value at that point! However, this is directly contradicted by the part after the semicolon. Thus, if we attempt to make it formal, we fail. If we leave it informal, we can ignore the implication and thereby avoid the contradiction (but not potential confusion).

The graph provided there gives an example of a removable discontinuity (as usually defined), but not of a hole (as usually defined).