Full Joint Distribution Table
A full joint probability distribution specifies the probability of every possible combination of variable values.
For n binary variables:
- 10 variables → 1,024 entries
- 20 variables → 1,048,576 entries
- 30 variables → 1,073,741,824 entries
- Impossible to store or learn!
Factorization using BNs:
- k = max parents per node
- 30 variables, k=3 → ~240 parameters
- Reduction: 1 billion → 240!
- Makes learning and inference tractable
Key Insight: Bayesian Networks achieve this massive reduction by exploiting conditional independence — not all variables depend on all others. The graph structure tells us which dependencies matter.