Probability Spaces, Conditional Probability and Independence
Core Titles
Key headlines and terms for quick recall- Sample space , event
- Probability axioms (Kolmogorov): , , -additivity
- Conditional probability
- Independence
- Law of Total Probability
- Bayes' theorem
Basic Idea
What it is, why it matters, how it worksProbability space
A probability model has three pieces:
- Sample space — all possible outcomes.
- -algebra — the events we measure (subsets of ).
- Probability measure — function satisfying Kolmogorov's axioms.
Axioms.
- for every event .
- .
- -additivity: for disjoint events , .
Useful consequences.
Conditional probability
If has occurred, what's the probability of ?
Independence
and are independent iff Equivalently — knowing doesn't change the probability of .
Law of Total Probability
If partitions :
Bayes' theorem
"Invert conditioning" — update prior into posterior using likelihood .
Why this matters in Data Science
- Naive Bayes classifier is direct Bayes.
- Bayesian inference / probabilistic ML / Markov chains.
- A/B testing rests on probability axioms.
- Conditional independence is the spine of Bayesian networks.
Mind Map
Visual structure of the conceptPROBABILITY SPACES
├── (Ω, ℱ, P)
├── Axioms
│ ├── P(A) ≥ 0
│ ├── P(Ω) = 1
│ └── σ-additivity
├── P(A∪B) = P(A) + P(B) − P(A∩B)
├── Conditional P(A|B) = P(A∩B)/P(B)
├── Independence P(A∩B) = P(A)P(B)
├── Total Probability Σ P(A|Bᵢ)P(Bᵢ)
└── Bayes P(A|B) = P(B|A)P(A)/P(B)
Exam Q&A
Part A (2 marks) and Part B (20 marks) style questionsPart A (2 marks each)
Q1. State Kolmogorov's axioms of probability. ; ; for disjoint events, .
Q2. Define conditional probability. for .
Q3. Define independent events. and are independent iff .
Q4. State Bayes' theorem. .
Part B (20 marks)
Q. State and prove Bayes' theorem from the definition of conditional probability and the law of total probability. A disease affects 1% of a population. A test detects it 99% of the time but has a 5% false-positive rate. If a person tests positive, what is the probability they actually have the disease?
Bayes' theorem. If is a partition of with and is any event with :
Proof. From the definition of conditional probability:
So .
By the law of total probability, .
Substituting gives Bayes' formula. ∎
Disease example. Let = event "has disease", = event "tests positive".
Given.
- Sensitivity:
- False positive:
Total probability. .
Bayes.
Interpretation. Even with a 99%-accurate test, a positive result means only about 1 in 6 people truly have the disease — because the disease is rare, false positives dominate. This is the base-rate fallacy and the classic motivation for Bayesian reasoning.