Radon-Nikodym derivative as a measurable function in a product space
Solution 1:
Let me start by mentioning that the kind of things you are trying to understand are folklore in the worst sense. I can't resist quoting R.W. Thomason (in a completely different context, but equally applicable here): "Most of these facts are well-known in outline, although many people exhibit some confusion and fuzziness on the details when pressed."
Some good references:
Appendix B and Chapter 2 of R.J. Zimmer, Ergodic theory and semisimple groups, Birkhäuser 1984. MR776417
David Fisher, Dave Witte Morris, and Kevin Whyte, Nonergodic actions, cocycles and superrigidity, New York Journal of Mathematics, Volume 10 (2004) 249–269, MR2114789 and the references therein.
For an excellent introduction to the bare minimum on descriptive set theory and selection theorems, I recommend Chapter III of Arveson, An invitation to $C^{\ast}$-algebras, Springer GTM 39, 1979, MR512360.
By far the best introduction to descriptive set theory and the circle of ideas you're considering is A.S. Kechris, Classical descriptive set theory, Springer GTM 156, 1995. MR1321597.
Chapter V of Varadarajan's book The geometry of quantum theory, second edition, Springer 1969, MR805158. In particular the Mackey-Weil theorem is proved in section 6 of that chapter.
If you can manage to read Mackey's original texts, you will certainly only profit, but I found them to be on the extremely indigestible side if you pardon my blunt assessment.
Edit: (in view of Mark's comment below) If your ultimate goal is to learn about the "Mackey machine" and induced representations, you can follow Mark's recommendation and look at Folland's Abstract Harmonic Analysis, Studies in Advanced Mathematics. CRC Press, Boca Raton, FL, 1995. MR1397028.
Another nice text on that topic is Barut–Rączka, Theory of group representations and applications. Second edition. World Scientific Publishing Co., Singapore, 1986. MR0889252.
So, let's look at the math, finally:
I'd like to prove a stronger statement than the one you ask about:
Let $G$ be a standard Borel group and let $X$ be a standard Borel space equipped with a Borel $G$-action and a quasi-invariant probability measure $\mu$. There is a Borel measurable map $\rho: G \times X \to [0,\infty)$ such that $$\int_{X} \rho(g,x)f(x)\,d\mu(x) = \int_{X} f(gx)\,d\mu(x)$$ for all $g \in G$ and all $f \in L^1(X,\mu)$.
You'll find this statement and some applications as Lemma 1.1.1 of Appendix D on page 84 of my thesis (where it is formulated for $G$ Polish but that isn't used here). It is extracted from Fisher–Witte Morris–Whyte's Proposition 2.22.
First of all recall that the space $\mathscr{F}(X)$ of $\mu$-equivalence classes of Borel-measurable functions $X \to [0,\infty)$ is Polish with respect to the topology of convergence in measure. In fact the metric $$d_{\mathscr{F}}(f,g) = \min{\{\varepsilon \geq 0\,:\,\mu(\{|f(s) - g(s)| \gt \varepsilon\})\leq \varepsilon\}}$$ is particularly convenient.
Translating Fisher–Witte Morris–Whyte's Lemma 2.7 into our situation we get:
Lemma. Given a Borel measurable function $f: G \to \mathscr{F}(X)$ there exists a Borel measurable function $\varphi: G \times X \to [0,\infty)$ such that for all $g \in G$ we have $$\varphi(g,x) = f(g)(x) \quad \text{ for almost every } x \in X.$$
Note that his implies the exponential law $\mathscr{F}(G \times X) = \mathscr{F}(G,\mathscr{F}(X))$.
The proof of the lemma is relatively simple: partition $\mathscr{F}(X)$ into countably many disjoint Borel sets $\{D_{i}^{(n)}\}_{i=1}^{\infty}$ of diameter $2^{-n}$, for each $i$ pick $\varphi^{(n)}_i \in D_{i}^{(n)}$ and put $\varphi^{(n)}(g,x) = \varphi^{(n)}_i(x)$ if $f(g) \in D_{i}^{(n)}$. Then it is easy to see that for fixed $g$ the function $\varphi^{(n)}(g, \cdot)$ is Borel and converges a.e. to a limit $\varphi(g,\cdot)$. For details see loc. cit.
Given this, it remains to show that the map $r: G \to \mathscr{F}(X)$ given by $g \mapsto \frac{d(g\mu)}{d\mu}$ is Borel measurable. To see this, choose a countable separating and generating set $\{A_n\}_{n=1}^{\infty}$ of $\Sigma$ and notice that $$ \mathscr{R} = \bigcap_{n=1}^{\infty} \left\{(g,f) \in G \times \mathscr{F}(X)\,:\,\mu(gA_n) = \int_{A_n} f(x)\,d\mu(x)\right\} $$ is the graph of $r$ because the fiber of $\mathscr{R}$ over $g$ is the Radon-Nikodym derivative $\frac{d(g\mu)}{d\mu}$ of the Borel automorphism $x \mapsto gx$ of $(X,\Sigma)$. Hence, it suffices to show that $\mathscr{R}$ is Borel by the Borel graph theorem (e.g. Kechris, Theorem 14.12 on page 88). We're done as soon as we show that $g\mapsto\mu(gA_n)$ and $f \mapsto \int_{A_n} f\,d\mu$ are Borel on $G$ and $\mathscr{F}$, respectively. For the latter map this is an application of the monotone convergence theorem, see Fisher–Witte Morris–Whyte, Lemma 2.17, and for the former we write $gA_n \subset G \times X$ as $\psi(\{g\} \times A_n) \cap (\operatorname{pr}_G)^{-1}(\{g\})$ where $\psi: G \times X \to G \times X$ is the Borel automorphism $\psi(g,x) = (g,gx)$, hence it maps Borel sets to Borel sets and measurability of $g \mapsto \mu(gA_n)$ now follows e.g. from Kechris's theorem 17.25 on page 113.
Solution 2:
Just to add to Theo's excellent answer, I will quote in full Exercise 6.10.72 from page 63 of V. Bogachev's Measure Theory (volume 2). A long hint is also given there.
Let $(X,{\cal A},\mu)$ be a probability space, ${\cal A}$ a countably generated $\sigma$-algebra, $(T,{\cal B})$ a measurable space, and let $\mu_t$, where $t\in T$, be a family of bounded measures on $\cal A$ absolutely continuous with respect to $\mu$ such that for every $A\in {\cal A}$, the function $t\mapsto \mu_t(A)$ is measurable with respect to $\cal B$. Prove that one can find an ${\cal A}\otimes{\cal B}$-measurable function $f$ on $X\times T$ such that for every $t\in T$, the function $x\mapsto f(x,t)$ is the Radon-Nikodym density of the measure $\mu_t$ with respect to $\mu$.