The exponential distribution is a continuous probability distribution often used to model the time elapsed between events. A key characteristic is its memoryless property.
Applications
The exponential distribution is useful for modeling various real-world phenomena, such as:
- Lifetimes of light bulbs or electronic components.
- Time until radioactive decay of an element.
- Service times, for example a call center.
Exponentially distributed random variable
A continuous random variable follows an exponential distribution with a rate parameter , denoted as .
Its Probability Density Function (PDF) is given by:
Here, is the rate of the process , where is the expected value of .
So the mean of the distribution is , the variance is , and the standard deviation is too.
Memoryless Property
The memoryless property means that the probability of an event occurring in the future is independent of how much time has already passed.
Mathematically, for :“If amount of time has already passed, the probability that it last another amount of time after that is the same as the distribution starting from /now again.” (continually resetting probability)
This makes it suitable for modeling components that do not “age” in the sense of becoming more or less likely to fail over time, given they haven’t failed yet.
Hazard Rate
The hazard rate (or failure rate) represents the instantaneous rate of failure at time , given survival up to that time. For the exponential distribution, it’s defined as:
The constant hazard rate is a direct consequence of the memoryless property - the failure rate doesn’t change over time. This means an exponentially distributed component is just as likely to fail in the next instant whether it’s brand new or has been running for years.
The relationship (given I’ve lasted time, what’s the probability I’ll last another ) tells us that for a small time interval , the probability of failure is approximately times the length of that interval, regardless of how long the component has already survived.
\begin{align*}
P(T<t + dt | T > t) &= 1 - P(T> t+dt|T>t) \
&= 1 -P(T>t) \quad \text{memoryless property} \
&= 1-e^{-\lambda dt}
\end{align*}When we do [[taylor expansion]] for the exponential and make a small $dt$ approximation, we get the hazard rate:= 1- [1 - \lambda dt + \frac{1}{2}\lambda^{2} dt^{2} - \dots]
\approx \lambda dt$\implies P(t \lt T \le t + dt) = \lambda P(T \gt t)dt$
Exercise 1: Derive the Cumulative Distribution Function (CDF)
The Cumulative Distribution Function is defined as . We want to show that for , .
- Define the PDF :
- Consider the case for :
For , over the entire integration range .
- Consider the case for :
The integral is split based on the definition of :Since for :
- Evaluate the integral:
- Substitute the limits:
Since :
Exercise 2: Verify that is a valid Probability Density Function (PDF)
To verify that (as defined below) is a valid PDF, we need to show two conditions:
- for all .
- .
Condition 1: for all
The PDF is defined as:
- For , , which is .
- For , . Since (rate parameter) and is always positive for any real , their product is also .
Thus, for all .
Condition 2:
We need to integrate over its entire domain:Since for :
Now, evaluate the definite integral:
This involves a limit:
Since , as , , so . And .
Both conditions are satisfied, so is a valid PDF.