Given two events
A and
B, with their respective probabilities
P(A) and
P(B),
the probability of their union is given by the additive rule:
Note: Only in the case that A and B are disjoint (mutually exclusive) it simplifies to:
Although it's a simple formula, the way to derive it is not so straightforward. There are actually (only)
three axioms of probability, out of which everything was constructed:
1. axiom states that probability cannot be negative. For any event
Ei (from event space
F) its probability is at least 0:
2. axiom basically says that “something must happen” (the formal definition somewhat differs in source texts, but let’s stick to
WolframMathworld definition). The event space
F that contains
N (could be also infinity) elements is defined as:
Then the axiom simply states that:
That you can read as
“probability of the entire event space is 1” or alternatively
“probability that at least one event happens is 1”.
3. axiom is actually the very statement about disjoint events. If any
E1 and
E2 events are mutually exclusive
Then probability of their union is equal to sum of their probabilities:
Or in an extended version for n mutually exclusive events:
So after we know our three axioms, let’s get back and express probability of union
A and
B. With aid of Venn diagram
the union A∪B can be decomposed into three disjoint sets:
So then with aid of our axioms:
What to do with three or more events? Actually the union can be always decomposed into disjoint sets thanks to the mathematical
inclusion-exclusion principle. It is somewhat mind boggling way of decomposing a union of sets by addition and subtraction of subset intersections. For three events
A,
B,
C it leads to: