Lesson 19 Marginal Distributions
Motivating Example
In Lesson 18, we found the joint distribution of \(X\), the number of bets that Xavier wins, and \(Y\), the number of bets that Yolanda wins. If all we have is the joint distribution of \(X\) and \(Y\), can we recover the distribution of \(X\) alone?
Theory
Recall from Lesson 10 that the p.m.f. of \(X\) is defined to be \(P(X = x)\) as a function of \(x\). To calculate a probaebility from a joint p.m.f., we sum over the relevant outcomes. In this case, we need to sum the joint p.m.f. over all the possible values of \(y\) for the given \(x\): \[\begin{equation} P(X = x) = \sum_y f(x, y). \tag{19.1} \end{equation}\]
If the joint p.m.f. is written out in table form, then (19.2) corresponds to the column sums of the table, as illustrated in Figure 19.1.Notice how natural it was to write the column totals in the margins of the table in Figure 19.1. For this reason, this collection of probabilities has come to be known as the marginal distribution of \(X\).
There is also a marginal distribution of \(Y\). As you might guess, the marginal p.m.f. is symbolized \(f_Y\) and is calculated by summing over all the possible values of \(X\): \[\begin{equation} f_Y(y) \overset{\text{def}}{=} P(Y=y) = \sum_x f(x, y). \tag{19.3} \end{equation}\] On a table, the marginal distribution of \(Y\) corresponds to the row sums of the table, as illustrated in Figure 19.2.
Remember that we know the distribution of \(Y\). It is \(\text{Binomial}(n=5, p=18/38)\). You should verify that the marginal distribution we calculated in 19.2 matches that of a \(\text{Binomial}(n=5, p=18/38)\) distribution.
Let’s calculate another marginal distribution—this time from the formula representation of the joint p.m.f.
Essential Practice
- Let \(X\) be the number of times a certain numerical control
machine will malfunction on a given day. Let \(Y\) be the number of times
a technician is called on an emergency call. Their joint p.m.f. is given by
\[ \begin{array}{rr|cccc}
& 5 & 0 & .20 & .10 \\
y & 3 & .05 & .10 & .35 \\
& 1 & .05 & .05 & .10 \\
\hline
& & 1 & 2 & 3\\
& & & x
\end{array}. \]
- Calculate the marginal distribution of \(X\).
- Calculate the marginal distribution of \(Y\).
- Are \(X\) and \(Y\) independent? How do you know?
Use the joint p.m.f. of the smaller and the larger of two dice rolls that you calculated in Lesson 18 to find the p.m.f. of the larger number. Use this p.m.f. to solve the “last banana” problem from Lesson 7.
- Suppose two random variables \(X\) and \(Y\) both have marginal \(\text{Binomial}(n=3, p=0.5)\)
distributions. In this exercise, you will see that there are many joint distributions that could
have those marginal distributions.
- What is the joint p.m.f. if \(X\) and \(Y\) are independent?
- Can you find at least 2 more joint p.m.f.s with the same marginal distributions? (Hint: What happens if you define \(X\) and \(Y\) based on just 3 tosses of a coin? What about 4 tosses of a coin?)
Additional Exercises
Two tickets are drawn without replacement from a box with \(N_1\) \(\fbox{1}\)s and \(N_0\) \(\fbox{0}\)s. Let \(X\) be the number of \(\fbox{1}\)s on the first draw and \(Y\) be the number of \(\fbox{1}\)s on the second draw. It is clear that the first draw has a \(\frac{N_1}{N}\) probability of being a \(\fbox{1}\). But what about the second draw?
In Lesson 18, you found the joint distribution of \(X\) and \(Y\). Use this joint p.m.f. to show that the probability that the second draw is a \(\fbox{1}\) is \(\frac{N_1}{N}\).