Denoted by f. Does not give the probability at an exact point (which is always 0 for continuous variables). Instead, gives the relative likelihood of the random variable taking on a value in a small interval around a point.
The probability of an event is the integral of the PDF over the region corresponding to the event:
P(a≤X≤b)=∫abf(x)dx
Probability of all the values combined, is 1:
∫−∞∞f(x)dx=1
Mean
The integral is taken over the entire domain of the random variable
E(X)=μ=∫−∞∞x⋅f(x)dx
Here f(x) is the probability density function.
Variance
Var(X)=σ2=∫−∞∞(x−μ)2⋅f(x)dx
An equivalent computational formula is:
Var(X)=∫−∞∞x2⋅f(x)dx−μ2
Covariance
Cov(X,Y)=E(XY)−E(X)E(Y)
Cumulative Distribution Function
F(x)=∫−∞xf(t)dt
Example
Consider a continuous random variable X that follows a uniform distribution over the interval [0, 4], with probability density function:
f(x)={0.250for 0≤x≤4otherwise
The mean would be: E(X)=∫04x⋅41dx=41⋅2x204=41⋅216=2.
The variance would be: Var(X)=∫04(x−2)2⋅41dx=41⋅1216=34≈1.33.
Types
Uniform Distribution
A type of continuous probability distribution where all outcomes are equally likely within a specified range. Has the PDF:
It’s often used to model real-world phenomena like heights, test scores, or measurement errors, where most values are near the mean, and fewer occur as you move away from it.
Standard normal distribution
A special case of the normal distribution where μ=0 and σ=1. Denoted by N(0,1). Has PDF:
f(x)=2π1exp(−2x2)
Chi-square Distribution
Distribution of the sum of the squares of k independent standard normal random variables, where k is the degrees of freedom. Has the PDF:
Used in tests like the Chi-square goodness-of-fit test and tests for independence in contingency tables.
Student’s t-distribution
A more conservative form of the standard normal distribution. Has heavier tails than the standard normal distribution.
Denoted by X∼t(v). Here the parameter v is the degrees of freedom, and is often v=n−1. Gets close to standard normal distribution as v→∞. Centered at zero.
For a Student’s t-distribution, with v>2:
Var(X)=v−2v
Used when:
Sample size is small: n<30
Population standard deviation is unknown
Noncentral t-distribution
A generalization of the Student’s t-distribution. Denoted as X∼t(v,δ). Centered at δ.