Probability Spaces
A probability function takes an event, and tells us how likely it is as a number from to . For example, if I roll a (fair) die, the event that it came out four has probability .
What is the probability that it will be either three or four? Let be the event that it is three, and the event that it is four, then the event that it is either is denoted . Note that it is impossible that die is both three and four, we call such events disjoint. Since and are disjoint, we get that the probability either happens is the sum of probabilities the do happen. There's a one in six chance to get three, and a one in six chance to get four, hence there's a two in six chance to get either. Or in math:
When events are not disjoint, we cannot sum them up like this. For example, the probability that the result is odd is three in six, or one half. The probability that the result is at most three is also three in six, or one half. So the probability that the result is either odd or at most three is half + half = one? Of course not, because four is neither. when summing the two we overlook the fact that a result can be both odd and at most three, making us count these events twice. We said that three in six are odd and three in six are at most three, but three and one are both. We overcounted.
Exercise: Given two events and , let be the event that they both happen (if they are disjoint, then is the empty event that satisfies ), convince yourself that
Atomic Events
We say that an event contains another event , and denote , if is guaranteed to happen given that happens. For example, the event is contained in the event that is even. An event is called atomic if it has positive probability, but it does not contain any other event that has positive probability. For example, a fair die has six atomic events, for each we have the atomic event .
It is conventional to call the set of atomic events . For example describes the atomic events of any six-sided die. The die doesn't have to be fair, but it does have to satisfy that each of its six sides has some probability to be the result.
Exercise: prove that any two different atomic events are disjoint
The set is also commonly called the "sample space", as it describes all possible results of an experiment.
Finite and Discrete Probability Spaces
Given a set of atomic events , we can endow them with a probability function a probability function that assigns to each atomic event a number . The pair is called a discrete probability space if the probabilities of all atomic events sum to one. In other words, if all events can be described as a union of atomic events.
Note that since atomic events are disjoint, the definition of naturally extends to all events, simply as the sum of atomic events that comprise them. For example, if we be the event that the result is at most three, then we have
If we want to describe a fair die, we will define and for any , and we will indeed get that the probability that the die is less than three is
With a bit of thought, one can convince herself that if is finite then any probability space over is discrete. However, a space can be discrete also if is infinite. For example, lets say I flip a fair coin util it lands on head, how many flips would it take? Well, for any positive integer there is some probability that it will take attempts. So our sample space is has an event for any positive natural numbe .
The probability that a single flip of a fair coin will land on head is . For it to take exactly two attempts means that the first flip landed on tails and the second landed on heads, which has a probability (as it is one of four equally likely results of flipping two coins). It would take exactly three attempts with probability and so on. The pattern is quite obvious: the probability it would take exactly flips is , which we can more comfortable write as .
We said that the probabilities sum to one, and indeed if we let be the event that it took flips, we get that
I will not prove the last equality, but I can provide a beautiful visual intuition:
We can now ask ourselves, say, what is the probability this would take at most ten tries, and compute that the result is
Exercise: prove a more generalized version of the equality above, namely that the probability it would take at most coin flips is . Hint, recall the finite geometric sum formula: for any we have that
We can even look at infinite unions. For example, we can ask ourselves what is the probability that it took us an even number of attempts? One can compute that this comes out to
Exercise: prove the above inequality. Hint, recall the infinite geometric sum formula: if then
Remark: When is a probability space not discrete? We won't talk about them much, and definitely not introduce them formally, but there could be continuous probability spaces. For example, we can uniformly sample real number between and . One can easily see that the probability to sample any particular number is . Worse yet, for any the event "we sampled a number between and " is not atomic, as it strictly contains the event "we sampled a number between and " whenever . So such events are never atomic, and a similar approach shows that, in fact, there are no atomic events at all in this space. This means that our definitions are too narrow to expend to the continuous case, and while that definitely can be done, we have no reason to do so.
Last updated