PGD01C01
Module 5 · Probability Theory

Discrete and Continuous Random Variables

Core Titles
Key headlines and terms for quick recall
  • Random variable (RV) X:ΩRX : \Omega \to \mathbb{R}
  • Discrete vs Continuous RV
  • Probability mass function (PMF) p(x)=P(X=x)p(x) = P(X = x)
  • Probability density function (PDF) f(x)f(x)
  • Cumulative distribution function (CDF) F(x)=P(Xx)F(x) = P(X \le x)
  • Standard distributions: Bernoulli, Binomial, Poisson, Uniform, Exponential, Normal
Basic Idea
What it is, why it matters, how it works

Random variable

An RV is a function X:ΩRX: \Omega \to \mathbb{R} that assigns a real number to each outcome.

  • Discrete — takes countably many values (coin flips, counts).
  • Continuous — takes values in an interval (time, measurements).

Discrete RV — PMF

p(x)=P(X=x),xp(x)=1.p(x) = P(X = x), \quad \sum_x p(x) = 1.

Examples.

  • Bernoulli(pp): X{0,1}X \in \{0, 1\}; P(X=1)=pP(X=1) = p.
  • Binomial(n,pn, p): number of successes in nn Bernoulli trials. P(X=k)=(nk)pk(1p)nkP(X=k) = \binom{n}{k} p^k (1-p)^{n-k}.
  • Poisson(λ\lambda): P(X=k)=eλλkk!P(X=k) = \dfrac{e^{-\lambda} \lambda^k}{k!} — rare event counts.

Continuous RV — PDF

f(x)0f(x) \ge 0, f(x)dx=1\int_{-\infty}^\infty f(x) \, dx = 1. Probabilities come from integrating: P(aXb)=abf(x)dx.P(a \le X \le b) = \int_a^b f(x) \, dx. Important: P(X=c)=0P(X = c) = 0 for any single point.

Examples.

  • Uniform(a,b)(a,b): f(x)=1baf(x) = \dfrac{1}{b-a} on [a,b][a, b].
  • Exponential(λ\lambda): f(x)=λeλx,x0f(x) = \lambda e^{-\lambda x}, x \ge 0 — waiting times.
  • Normal(μ,σ2\mu, \sigma^2): f(x)=12πσexp ⁣((xμ)22σ2)f(x) = \dfrac{1}{\sqrt{2\pi}\sigma} \exp\!\left(-\dfrac{(x-\mu)^2}{2\sigma^2}\right).

CDF

F(x)=P(Xx).F(x) = P(X \le x). Defined for both discrete and continuous. Non-decreasing, limxF=0\lim_{x \to -\infty} F = 0, limxF=1\lim_{x \to \infty} F = 1.

For continuous XX: F(x)=f(x)F'(x) = f(x).

Why this matters in Data Science

Every dataset is modeled by RVs and their distributions. Choosing the right distribution drives generative models, MLE, hypothesis testing.

Mind Map
Visual structure of the concept
RANDOM VARIABLES
├── X : Ω → ℝ
├── DISCRETE
│   ├── PMF p(x) = P(X=x)
│   ├── Σ p(x) = 1
│   └── Bernoulli, Binomial, Poisson, Geometric
├── CONTINUOUS
│   ├── PDF f(x) ≥ 0, ∫f = 1
│   ├── P(a≤X≤b) = ∫ₐᵇ f dx
│   └── Uniform, Exponential, Normal
└── CDF F(x) = P(X ≤ x)
    ├── Non-decreasing
    ├── Limits 0 and 1
    └── Continuous: F'(x) = f(x)
Exam Q&A
Part A (2 marks) and Part B (20 marks) style questions

Part A (2 marks each)

Q1. Define random variable. A measurable function X:ΩRX : \Omega \to \mathbb{R} that assigns a real number to each outcome.

Q2. State the PMF of Binomial(n,p)\text{Binomial}(n, p). P(X=k)=(nk)pk(1p)nk,  k=0,1,,nP(X = k) = \binom{n}{k} p^k (1-p)^{n-k}, \; k = 0, 1, \dots, n.

Q3. State the PDF of Exponential(λ)\text{Exponential}(\lambda). f(x)=λeλx,  x0f(x) = \lambda e^{-\lambda x}, \; x \ge 0.

Q4. Define CDF and state two properties. F(x)=P(Xx)F(x) = P(X \le x). It is non-decreasing and right-continuous, with limF=0\lim_{-\infty} F = 0 and limF=1\lim_{\infty} F = 1.


Part B (20 marks)

Q. Differentiate discrete and continuous random variables. Discuss Bernoulli, Binomial, Poisson, Uniform, Exponential and Normal distributions with their PMF/PDFs, means and variances. Verify that the binomial mean is npnp.

Discrete vs continuous.

DiscreteContinuous
Countable valuesUncountable, interval
PMF p(x)=P(X=x)p(x) = P(X=x)PDF f(x)0f(x) \ge 0, f=1\int f = 1
P(X=c)P(X=c) can be positiveP(X=c)=0P(X=c) = 0
Sum: xp(x)=1\sum_x p(x) = 1Integral: f=1\int f = 1
CDF is step functionCDF is continuous

Standard distributions.

NamePMF / PDFMeanVarianceUse
Bernoulli(pp)P(1)=p,P(0)=1pP(1)=p, P(0)=1-pppp(1p)p(1-p)single trial
Binomial(n,pn, p)(nk)pk(1p)nk\binom{n}{k}p^k(1-p)^{n-k}npnpnp(1p)np(1-p)# successes in nn trials
Poisson(λ\lambda)eλλk/k!e^{-\lambda}\lambda^k / k!λ\lambdaλ\lambdarare events
Uniform(a,b)(a, b)1ba\dfrac{1}{b-a} on [a,b][a,b]a+b2\dfrac{a+b}{2}(ba)212\dfrac{(b-a)^2}{12}equal-likelihood
Exponential(λ\lambda)λeλx\lambda e^{-\lambda x}1/λ1/\lambda1/λ21/\lambda^2waiting time
Normal(μ,σ2\mu, \sigma^2)12πσe(xμ)2/(2σ2)\dfrac{1}{\sqrt{2\pi}\sigma}e^{-(x-\mu)^2/(2\sigma^2)}μ\muσ2\sigma^2natural variation

Derivation of binomial mean.

A binomial RV X=i=1nXiX = \sum_{i=1}^n X_i where each XiBernoulli(p)X_i \sim \text{Bernoulli}(p) is independent.

E[Xi]=0(1p)+1p=pE[X_i] = 0 \cdot (1-p) + 1 \cdot p = p.

By linearity (no independence needed for the mean): E[X]=E[Xi]=E[Xi]=np.E[X] = E\left[\sum X_i\right] = \sum E[X_i] = np.

Direct calculation also works: E[X]=k=0nk(nk)pk(1p)nk=npk=1n(n1k1)pk1(1p)nk=np1=np.E[X] = \sum_{k=0}^n k \binom{n}{k} p^k (1-p)^{n-k} = np \sum_{k=1}^n \binom{n-1}{k-1} p^{k-1} (1-p)^{n-k} = np \cdot 1 = np.