A PDF is a probability distribution that describes the relative likelihood of a continuous random variable taking on values in different ranges.
It’s a model of a random process:
→ You can do hypothesis testing with data
→ You can estimate parameters (mean, std, …) with a small sample
The probability of a random variable falling within a range is the integral of the PDF over that range:
→ Probabilities are only meaningful over intervals/ranges, not individual points.
→ The value of can be greater than , as it represents density not probability.
→ for any single point , as it represents an interval of width zero.Example: For a standard normal distribution, , but , however, , representing the probability of falling within that interval.
So while taking sums of probabilities over individual points works in a discrete context (probability mass function):
It doesn’t work in a continuous context: