Integration with respect to a measure
I am trying to get an explanation in words, or math, of what the $d\mu$ means in an integration statement. Such as: $$\int f \ d\mu$$ How does the measure change our old "calculus" notion of integration? What is going on here that is different?
Solution 1:
I actually struggled with this concept in grad school since I was studying applied math and was sort of thrust into higher level theory without building it up rigorously like I assume would be done in a pure math program.
If we are integrating over a space $X$, I sometimes prefer the notation $\int_X f(x)\mu(dx)$. I like to think of it as splitting the space we are integrating over into infinitesimal pieces, but we have to take the measure of those infinitesimal pieces as they may not all be identical under $\mu$. I'm used to working with probability measures, and at least in that case, you can often think of it similar to the way Lebesgue and Riemann integration are developed.
Create a disjoint partition $X=\cup_{k=1}^N A_k$, and define the sum which will approximate the integral using appropriately chosen sample points $x_k\in A_k$.
$$\int_X f(x)\mu(dx) \approx \sum_{k=1}^N f(x_k) \mu(A_k).$$
Ideally, taking the limit as $N\rightarrow\infty$ (carefully refining the partition and choosing appropriate sample points as $N$ increases) will make the sum converge to the integral. What we are doing here is effectively approximating the function $f$ with a simple function which is constant on a finite collection of measurable sets.
Solution 2:
Think of it physically: each measure assigns different weights to given sets: consider for example the particular case $d\mu=df(x)=f'(x)dx$ for a well behaved $f(x)$. Here you can really see the difference between the "ordinary" measure $dx$, which does not care about the location of the set, and $f'(x)dx$, which indeed does! In formulas: \begin{equation} \int_{[0,1]}dx =1= \int_{[1,2]}dx \end{equation} But in general \begin{equation} \int_{[0,1]}df(x) = f(1)-f(0) \neq f(2)-f(1) = \int_{[1,2]}df(x) \end{equation} This is just an example, but the idea applies with general measures $\mu$; it is in this sense that this perspective conveys the concept - in my view - of a weighted measure. It allows far more general and powerful structures than the old Riemann integration idea with shrinking rectangles.
One can then go further and (try to) relate different measures, for example via the Radon-Nikodym theory. There's a new world out there. :-)