Notation for probability density

On one of my other questions here, I was criticized (and rightly so, as it was the source of my mistake) for using this notation for a continuous random variable $X$ with pdf $f(x)$: $$\mathbb{P}\{X\in dx\} = f(x) \mathop{dx} .$$ I was taught this notation by my teacher. The purpose of this notation is to stress the fact that technically, $\mathbb{P}\{X=x\}=0$ if $X$ is continuous. I believe the notation is shorthand for $\mathbb{P}\{X\in [x, x+dx)\} = f(x) \mathop{dx}$.

As I am only familiar with this notation and didn't know that other people did not use it, I would really appreciate it if someone could give me a better sense of what notation is more accepted/used. Is the $f$ and $F$ notation of pdf and cdf respectively widely used?

Given a pdf $f(x)$ for a continuous random variable $X$, its cdf is $F(x)=\int_{-\infty}^x f(t)dt=\mathbb{P}\{X<x\}$ (thanks for catching the typo, André!), which corresponds to the discrete case where $F(x) = \sum_{k=0}^x f(x) = \mathbb{P}\{X\le x\}$. Is there just no continuous analog for the discrete expression of $\mathbb{P}\{X=x\}=f(x)$? One piece of advice that I was given by another user was to deal primarily with cdfs and not with pdfs.

Thank you in advance, and apologies if this was not the right place to ask this question.


Solution 1:

The notation $\mathbb P(X\in\mathrm dx)=f_X(x)\mathrm dx$ is odd, even as a shorthand, since $\mathrm dx$ can only mean an infinitesimal interval near $0$ and one wants to indicate an interval at $x$ or around $x$. The notation $\mathbb P(X=x)=f_X(x)$ is of course absurd since $\mathbb P(X=x)=0$ when $f_X$ exists. The notation $\mathbb P(X\in(x,x+\mathrm dx))=f_X(x)\mathrm dx$ is fine (and widely used). Personally I would not be too picky about using $(x,x+\mathrm dx)$ or $[x,x+\mathrm dx)$ as the infinitesimal interval here, since, anyhow, these are just semi-rigorous shorthands.

Solution 2:

The notation $F_X$ is very widely used to represent the cumulative distribution function (cdf) of a random variable $X$, and is defined as $F_X(x)=P[X\le x]$. This is valid whether or not $X$ is absolutely continuous. The probability density function, $f_X$, is only well-defined when $X$ is absolutely continuous, in which case $F_X$ is differentiable and $f_X=F'_X$. In this case only, your shorthand is legitimate, because $$ f(x)=F'(x)=\lim_{\Delta x\rightarrow 0}\frac{F(x+\Delta x)-F(x)}{\Delta x}=\lim_{\Delta x\rightarrow 0}\frac{1}{\Delta x}P[X > x \wedge X\le x+\Delta x]. $$ In terms of the cdf, the probability that $X$ takes on a particular value $x$ is $$ P[X=x]=P[X\le x]-\lim_{y\rightarrow x-}P[X\le y]=F_X(x)-F_X(x-), $$ i.e., the difference between $F_X$'s value at $x$ and its limiting value approaching $x$ from below. Of course, this can only be nonzero if $F_X$ has a discrete component.