8 Random Variables

In the preceding chapters, we calculated the probability of different events. We described each event as a set of outcomes. It is usually easier to assign a numeric value to each outcome and describe an event in terms of that numeric value. A random variable $X(\omega)$, often abbreviated $X$, is simply a function that assigns a numeric value to each possible outcome $\omega$.

8.1 Discrete Random Variables

Definition 8.1 (Random variable) Let $\Omega$ be a sample space of an experiment. Then, a random variable $X$ is a function from the sample space $\Omega$ to the real numbers $\mathbb{R}$. In other words, for each outcome $\omega \in \Omega$, $X(\omega)$ is a real number.

Example 8.1 (Numbers of heads and tails) Suppose a fair coin is tossed three times. The sample space $\Omega$ is shown below. We can define different random variables on this sample space, depending on what quantity we are interested in. For example,

$\X = \text{the number of heads}$
$\Y = \text{the number of tails}$

are both random variables, and they are illustrated in Figure 8.1.

Figure 8.1: Two random variables $\X$ and $\Y$ defined on the sample space $\Omega$ for three tosses of a fair coin.

That is, the random variable $\X$ is defined as: \[ \X(\omega) = \begin{cases} 0 & \omega \in \{ \texttt{TTT} \} \\ 1 & \omega \in \{ \texttt{HTT}, \texttt{THT}, \texttt{TTH} \} \\ 2 & \omega \in \{ \texttt{HHT}, \texttt{THH}, \texttt{HTH} \} \\ 3 & \omega \in \{ \texttt{HHH} \} \\ \end{cases}, \]

while the random variable $\Y$ is defined as: \[ \Y(\omega) = \begin{cases} 0 & \omega \in \{ \texttt{HHH} \} \\ 1 & \omega \in \{ \texttt{HHT}, \texttt{THH}, \texttt{HTH} \} \\ 2 & \omega \in \{ \texttt{HTT}, \texttt{THT}, \texttt{TTH} \} \\ 3 & \omega \in \{ \texttt{TTT} \} \\ \end{cases}, \]

It is usually easier to express events of interest in terms of the random variables. For example, we can express the event $\{ \text{more heads than tails} \}$ in a number of ways, such as:

$\{ \X \geq 2 \}$ (which is shorthand for $\{ \omega: \X(\omega) \geq 2 \}$)
$\{ \Y \leq 1 \}$
$\{ \X > \Y \}$

So instead of writing $P(\text{more heads than tails})$, we could instead write $P(\X \geq 2)$ or $P(\X > \Y)$.

In general, there are many random variables that could be defined on any sample space, and the “right” random variable is the one that helps us solve the problem.

Example 8.2 (Profits in roulette) In Example 1.4, we introduced the casino game roulette. We discussed various bets in roulette, but we did not discuss their profits. Since the profit depends on the random outcome of the roulette wheel, it is a random variable.

Suppose we place a $1 bet on our favorite number, 23. A bet on a single number pays 35-to-1. That is, if the ball lands in the 23 pocket, then we win $35; if it lands in any other pocket, then we lose the $1 wagered.

We might be interested in the random variable $\S$, the profit from this bet. Note that \[ \S(\omega) = \begin{cases} 35 & \omega = 23 \\ -1 & \omega \neq 23 \end{cases} \tag{8.1}\]

The probability of winning a bet on a single number is only \[ P(\S > 0) = \frac{1}{38}. \tag{8.2}\]

Alternatively, we could have placed a $1 bet on red. However, this bet only pays 1-to-1. That is, if the ball lands in any one of the 18 red pockets, we win $1; otherwise, we lose the $1 we wagered.

If $\RR$ is the random variable representing the profit from a $1 bet on red, then \[ \RR(\omega) = \begin{cases} 1, & \omega \in \{ 1, 3, 5, 7, 9, 12, 14, 16, 18, 19, 21, 23, 25, 27, 30, 32, 34, 36\} \\ -1, & \omega \in \{ 0, 00, 2, 4, 6, 8, 10, 11, 13, 15, 17, 20, 22, 24, 26, 28, 29, 31, 33, 35 \}\end{cases} \tag{8.3}\] We see that the probability of winning with a bet on red is \[P(\RR > 0) = \frac{18}{38}. \tag{8.4}\] There are more chances to win, but we win less when we do win.

The random variables we have described so far have had a limited set of possible values. For example, $\X$ and $\Y$ in Example 8.1 only assume the values $\{ 0, 1, 2, 3 \}$, while $S$ in Example 8.2 only assumes the values $\{ -1, 35 \}$. These are all examples of discrete random variables. Later, in Chapter 18, we will encounter random variables that can assume any value in a range, such as $[0, 1]$, called continuous random variables.

Definition 8.2 (Discrete random variable) A random variable $X$ is said to be discrete if there is a finite or countable set $\{ x_1, x_2, \dots \}$ such that $\displaystyle \sum_i P(X = x_i) = 1$.

Although we formally defined a random variable in Definition 8.1 as a function (on the sample space), it is usually easier to think of a random variable $X$ as simply a variable (in the algebra sense) that happens to be representing a random quantity.

For example, since the number of heads and number of tails in Example 8.1 must add up to the number of tosses, $3$, we can write \[ \X + \Y = 3, \] and we can use algebra to rearrange this equation as \[ \Y = 3 - \X. \] Although $\X$ and $\Y$ are technically functions, they really behave just like ordinary variables for most purposes.

8.2 Probability Mass Function

How do we describe a random variable, like $\X$ from Example 8.1? We cannot know the value of $\X$ for certain, but we can repeat the experiment many times and observe the different values of $\X$ that occur.

The table above shows that $\X = 1$ and $\X = 2$ are more common than $\X = 0$ and $\X = 3$. The way that probabilities are distributed over these possible values is a property called the distribution of $\X$.

One way to describe the distribution of a discrete random variable is its probability mass function (or PMF, for short). The PMF is a function that specifies the probability of each possible value of the random variable.

Definition 8.3 (PMF of a discrete random variable) The probability mass function (or PMF) of a discrete random variable $X$ is the function $f_X$ defined as \[ f_X(x) \overset{\text{def}}{=}P(X = x). \]

Note that the values of $f_X(x)$ are necessarily between $0$ and $1$ because they represent probabilities.

In the example below, we describe the distributions of $\X$ and $\Y$ from Example 8.1 by determining their PMFs.

Example 8.3 (Distributions of heads and tails) In Example 8.1, we defined two random variables:

$\X$, the number of heads in three tosses of a fair coin, and
$\Y$, the number of tails in three tosses of a fair coin.

Because each of the $8$ outcomes in the sample space was equally likely, we can calculate the PMF by simply counting the number of outcomes corresponding to each value and dividing by $8$.

For example, we can calculate the PMF of $\X$ as follows:

$f_{\X}(0) = P(\X = 0) = \frac{1}{8}$
$f_{\X}(1) = P(\X = 1) = \frac{3}{8}$
$f_{\X}(2) = P(\X = 2) = \frac{3}{8}$
$f_{\X}(3) = P(\X = 3) = \frac{1}{8}$

It is common to lay these probabilities out in a table.

$x$	$0$	$1$	$2$	$3$
$f_{\X}(x)$	$\frac{1}{8}$	$\frac{3}{8}$	$\frac{3}{8}$	$\frac{1}{8}$

The PMF of $\X$ is graphed in Figure 8.2.

Figure 8.2: Visualization of the PMF of $X$

What about the PMF of $\Y$? Verify for yourself that it is the same!

$y$	$0$	$1$	$2$	$3$
$f_{\Y}(y)$	$\frac{1}{8}$	$\frac{3}{8}$	$\frac{3}{8}$	$\frac{1}{8}$

Even though $\X$ and $\Y$ have the same PMF, they are not the same random variable. In fact, $\X = \Y$ would imply that the number of heads is always equal to the number of tails. But this is impossible when a coin is tossed 3 times! There is no outcome $\omega$ for which $\X(\omega) = \Y(\omega)$, so \[ P(\X = \Y) = 0. \]

Example 8.4 (Distributions of roulette profits) In Example 8.2, we introduced two random variables, $\S$ and $\RR$, which represented the profits from $1 bets on the number 23 and red, respectively.

The PMF of $\S$ is

$x$	$-1$	$35$
$f_{\S}(x)$	$\frac{37}{38}$	$\frac{1}{38}$

and the PMF of $\RR$ is

$x$	$-1$	$1$
$f_{\RR}(x)$	$\frac{20}{38}$	$\frac{18}{38}$

Notice that the probabilities in any PMF always sum to $1$. This is because events of the form $\{ X = x_i \}$ are a partition of the sample space.

In order for a function $f(x)$ to be a valid PMF, it must satisfy two properties:

$f(x) \geq 0$ for any $x$, and
There exist values $x_1, x_2, \dots$ such that $\displaystyle \sum_i f(x_i) = 1$.

In fact, any function $f$ satisfying these two properties is the PMF of some random variable.

8.3 Cumulative Distribution Function

The PMF specifies the probability that a random variable is equal to a given value. Another way to describe the distribution of a random variable is to specify the probability that it is less than or equal to a given value. This function is called the cumulative distribution function (or CDF).

Definition 8.4 (CDF of a discrete random variable) The cumulative distribution function (or CDF) of a random variable $X$ with PMF $f$ is the function $F$ defined as \[ F_X(x) \overset{\text{def}}{=}P(X \leq x) = \sum_{x_i \leq x} f(x_i). \]

We can describe the random variable $\X$ from Example 8.1 using its CDF.

Example 8.5 (CDF of the number of heads) Using the PMF of $\X$ that we derived in Example 8.3, we can calculate the CDF of $\X$ to be: \[ \begin{aligned} F_{\X}(x) &= \begin{cases} 0 & x < 0 \\ \frac{1}{8} & 0 \leq x < 1 \\ \frac{1}{8} + \frac{3}{8} & 1 \leq x < 2 \\ \frac{1}{8} + \frac{3}{8} + \frac{3}{8} & 2 \leq x < 3 \\ \frac{1}{8} + \frac{3}{8} + \frac{3}{8} + \frac{1}{8} & x \geq 3 \end{cases} \\ &= \begin{cases} 0 & x < 0 \\ \displaystyle 0.125 & 0 \leq x < 1 \\ \displaystyle 0.5 & 1 \leq x < 2 \\ \displaystyle 0.875 & 2 \leq x < 3 \\ 1 & x \geq 3 \end{cases}. \end{aligned} \tag{8.5}\]

This CDF is graphed in Figure 8.3.

Figure 8.3: Visualization of the CDF of $X$

The CDF makes it easy to evaluate probabilities. For example, we can obtain the probability of getting more heads than tails, $P(\X > 2)$, from Equation 8.5 with minimal calculation: \[ P(\X > 2) = 1 - P(\X \leq 1) = 1 - F_{\X}(1) = 1 - \frac{1}{2} = \frac{1}{2}. \]

Example 8.5 suggests several properties of a CDF:

It is non-decreasing.
As $x$ approaches $-\infty$, $F_X(x)$ approaches $0$.
As $x$ approaches $\infty$, $F_X(x)$ approaches $1$.
It is continuous from the right, and it has a limit from the left. (Mathematicians call such a function “càdlàg”, a French acronym for the phrase “continue à droite, limite à gauche.”)

Since the CDF is simply another way of describing the distribution of a random variable, we should be able to recover the PMF from the CDF (and vice versa). The next example shows that the “jumps” in the CDF correspond to the values of the PMF.

Example 8.6 (Recovering the PMF from the CDF) Suppose $\X$ is a random variable with the CDF $F_{\X}(x)$ given in Equation 8.5.

Notice that the CDF jumps at the values $x=0, 1, 2, 3$. For example, the size of the jump at $x=2$ is the difference between \[ F_{\X}(1.9) = P(\X \leq 1.9) = f_{\X}(0) + f_{\X}(1) \] and \[ F_{\X}(2.1) = P(\X \leq 2.1) = f_{\X}(0) + f_{\X}(1) + f_{\X}(2), \] which is $f(2)$, the PMF evaluated at $x = 2$.

In other words, the size of each jump at $x$ is precisely $f_X(x)$, the PMF evaluated at $x$. So we can determine the value of $f_X(x)$ by calculating the size of each jump at $x$.

At $x = 0$, the CDF jumps from $0$ to $1/8$, so $f_{\X}(0) = 1/8$.
At $x = 1$, the CDF jumps from $1/8$ to $1/2$, so $f_{\X}(1) = 1/2 - 1/8 = 3/8$.
At $x = 2$, the CDF jumps from $1/2$ to $7/8$, so $f_{\X}(2) = 7/8 - 1/2 = 3/8$.
At $x = 3$, the CDF jumps from $7/8$ to $1$, so $f_{\X}(3) = 1/8$.

This matches the PMF of $\X$ that we derived in Example 8.3.

8.4 Bernoulli and Binomial Distributions

In this section, we introduce two families of distributions that arise so frequently in probability that they have names. Both can be described in terms of coin tossing.

Imagine that we have a coin that has a probability $p$ of coming up heads. (Note that our imaginary coin is not necessarily fair.)

The number of heads that come up in a single toss of this coin is a Bernoulli random variable.
The number of heads that come up in $n$ tosses of this coin is a binomial random variable.

If $X$ is a Bernoulli random variable, then clearly $X$ is either $0$ or $1$, and $P(X = 1) = p$.

Definition 8.5 (Bernoulli random variable) If $X$ is a random variable with the PMF \[ f_X(x) = \begin{cases} p, & x = 1 \\ 1-p, & x = 0 \end{cases} \] for some $0 \leq p \leq 1$, $X$ is said to be a Bernoulli random variable with parameter $p$. We write this as $X \sim \text{Bernoulli}(p)$.

If $X$ is a binomial random variable, the possible values of $X$ are $0, 1, \dots, n$. To determine the probability of each value, observe that the event $\left\{ X = x \right\}$ means that there are $x$ heads in the $n$ tosses. For example, if $n=4$ and $x=2$, then \[ \{ X = 2 \} = \{ \texttt{HHTT}, \texttt{HTHT}, \texttt{HTTH}, \texttt{THHT}, \texttt{THTH}, \texttt{TTHH} \}. \]

Since all of these sequences have exactly $x$ heads (and $n-x$ tails), the probability of any one of them is \[ p^x (1 - p)^{n-x} \] (by Proposition 6.1), and the number of such sequences is \[ \binom{n}{x}, \] so by Proposition 4.4, \[ P(X = x) = \binom{n}{x} p^x (1-p)^{n-x}. \]

Definition 8.6 (Binomial distribution) If $X$ is a random variable with PMF \[ f_X(x) = \binom{n}{x} p^x (1-p)^{n-x};\qquad x = 0, 1, \dots, n \tag{8.6}\] for some $n$ and $0 \leq p \leq 1$, we say that $X$ is a binomial random variable with parameters $n$ and $p$. We write this as $X \sim \text{Binomial}(n,p)$.

The binomial distribution reduces to the Bernoulli distribution when $n=1$.

Example 8.7 (Coin tosses as binomial) Recall Example 8.3, where a fair coin was tossed three times and $\X$ was the number of heads.

Since each toss is independent, with probability $p = 1/2$ of landing heads, we see that $\X$ matches the description for a $\text{Binomial}(n=3, p=1/2)$ random variable.

Therefore, its PMF can be written as the formula: \[ \begin{aligned} f_X(x) &= {3 \choose x} \left( \frac{1}{2} \right)^x \left( 1 - \frac{1}{2} \right)^{3-x}; & x = 0, 1, 2, 3. \end{aligned} \tag{8.7}\]

By plugging in $x=0, 1, 2, 3$ into Equation 8.7, we can verify that this formula yields the same probabilities that we obtained in Example 8.3.

$\displaystyle f_X(0) = {3 \choose 0} \left( \frac{1}{2} \right)^0 \left( 1 - \frac{1}{2} \right)^{3-0} = \frac{1}{8}$
$\displaystyle f_X(1) = {3 \choose 1} \left( \frac{1}{2} \right)^1 \left( 1 - \frac{1}{2} \right)^{3-1} = \frac{3}{8}$
$\displaystyle f_X(2) = {3 \choose 2} \left( \frac{1}{2} \right)^2 \left( 1 - \frac{1}{2} \right)^{3-2} = \frac{3}{8}$
$\displaystyle f_X(3) = {3 \choose 3} \left( \frac{1}{2} \right)^3 \left( 1 - \frac{1}{2} \right)^{3-3} = \frac{1}{8}$

Even though we described the Bernoulli and binomial distributions in terms of coin tosses, these distributions can be applied to a wide variety of real-world situations. In order to apply them, we analogize the particular situation to coin tosses. The following examples illustrate this process.

Example 8.8 (Number of albino children) In Example 1.6, we saw that the probability that two carrier parents will have an albino child is $1/4$. Now suppose that the same parents have $5$ children (with no twins). How many of these children will be albino?

The number of children who are albino is a $\textrm{Binomial}(n= 5, p= 1/4)$ random variable $X$. To see why, make the analogy with coin tossing:

A coin is tossed $n=5$ times, once for each child.
A “heads” means that the child is albino. The probability of this is $p=1/4$.
Because there are no twins, the children inherit the OCA2 gene independently. So this situation really is like repeated tosses of a coin.

Now that we have established that $X$ is binomial, we can immediately write down its PMF:

\[ f_X(x) = \binom{5}{x} \left(\frac{1}{4}\right)^x \left(1 - \frac{1}{4}\right)^{5-x}; \qquad x=0,1,2,3,4,5. \]

We can plug values into this PMF to obtain the probabilities:

$x$	$0$	$1$	$2$	$3$	$4$	$5$
$f_X(x)$	$\frac{243}{1024}$	$\frac{405}{1024}$	$\frac{270}{1024}$	$\frac{90}{1024}$	$\frac{15}{1024}$	$\frac{1}{1024}$

The next example is another historical problem that engaged one of the greatest mathematicians of all time.

Example 8.9 (The Newton-Pepys problem) In 1693, Isaac Newton and Samuel Pepys, an English writer most famous today for his diary, corresponded about the following problem. Pepys wrote Newton a letter asking which of the three has the greatest chance of success:

Six fair dice are tossed independently and at least one six appears.
Twelve fair dice are tossed independently and at least two sixes appear.
Eighteen fair dice are tossed independently and at least three sixes appear.

Pepys thought that option C was the most likely and wanted Newton to verify. We can compute the three probabilities ourselves using the binomial distribution.

In scenario A, let $X$ denote the number of sixes in six tosses. Then, $X \sim \textrm{Binomial}(n= 6, p= \frac{1}{6})$. The probability can now be obtained by plugging in appropriate values into the binomial PMF: \[ P(X \geq 1) = 1 - P(X = 0) = 1 - \binom{6}{0} \left( \frac{1}{6} \right)^0 \left( \frac{5}{6} \right)^6 \approx 0.6651. \] In scenario B, let $Y$ denote the number of sixes in twelve tosses. Then, $Y \sim \textrm{Binomial}(n= 12, p= \frac{1}{6})$ and \[\begin{align*} P(Y \geq 2) &= 1 - P(Y = 0) - P(Y = 1) \\ &= 1 - \binom{12}{0} \left( \frac{1}{6} \right)^0 \left( \frac{5}{6} \right)^{12} - \binom{12}{1} \left( \frac{1}{6} \right)^1 \left( \frac{5}{6} \right)^{11} \\ &\approx 0.6187. \end{align*}\] In scenario C, let $Z$ denote the number of sixes in eighteen tosses. Then, $Z \sim \textrm{Binomial}(n= 18, p= \frac{1}{6})$ and \[\begin{align*} P(Z \geq 3) &= 1 - P(Z = 0) - P(Z = 1) - P(Z = 2) \\ &= 1 - \binom{18}{0} \left( \frac{1}{6} \right)^0 \left( \frac{5}{6} \right)^{18} - \binom{18}{1} \left( \frac{1}{6} \right)^1 \left( \frac{5}{6} \right)^{17} - \binom{18}{2} \left( \frac{1}{6} \right)^2 \left( \frac{5}{6} \right)^{16} \\ &\approx 0.5973. \end{align*}\]

Newton obtained these probabilities and correctly concluded that scenario A is the most likely.

8.5 Geometric Distribution

In this section, we introduce another family of distributions that is so common that it has a name. It can also be described in terms of coin tossing.

Once again, suppose we have a coin that has a probability $p$ of coming up heads. However, instead of tossing the coin a fixed number of times, we now toss the coin until heads comes up. The random variable $N$ is the number of tosses.

Unlike the random variables we have considered so far, there is no upper bound to the possible values of $N$ because tails could come up indefinitely. However, the idea is the same; to determine the distribution of $N$, we need to calculate \[ f_N(n) = P(N = n) \] for $n=1, 2, 3, \dots$.

The event $\left\{ N = n \right\}$ means that the first $n-1$ tosses were tails and the $n$th toss was heads. Because the tosses are independent, the probability of this is \[ P(N = n) = (1 - p)^{n-1} p. \]

This motivates the following definition.

Definition 8.7 (Geometric distribution) If $N$ is a random variable with the PMF \[ f_N(n) = (1-p)^{n-1} p, \qquad n = 1, 2, \dots, \tag{8.8}\] for some $0 \leq p \leq 1$, we say that $N$ is a geometric random variable with parameter $p$. We write this as $N \sim \text{Geometric}(p)$.

To apply the geometric distribution to problems other than coin tossing, it helps to analogize the situation to coin tossing.

Example 8.10 (Number of rolls in craps) Suppose that in a round of craps (Example 1.9), the point has been set at five. What is the probability that the round lasts longer than 4 rolls, not counting the come-out roll?

The number of (additional) rolls until the round ends is a $\text{Geometric}(p=\frac{10}{36})$ random variable $N$. To see why, make the analogy with coin tossing:

A coin is tossed repeatedly, representing the dice rolls.
A “heads” means that either a five or a seven was rolled. This probability is $p=\frac{10}{36}$.
The dice rolls are independent, just like the coin tosses.

Therefore, the PMF of $N$ is \[ f_N(n) = \left( 1 - \frac{10}{36} \right)^{n-1} \frac{10}{36}; \qquad n=1, 2, 3, \dots. \]

We can use this PMF to calculate the probability that the round lasts longer than 4 rolls: \[ \begin{align} P(N > 4) &= \sum_{n=5}^\infty f_N(n) \\ &= \sum_{n=5}^\infty \left( 1 - \frac{10}{36} \right)^{n-1} \frac{10}{36} \\ &= \frac{10}{36} \left( 1 - \frac{10}{36} \right)^{4} \sum_{m=0}^\infty\left( 1 - \frac{10}{36} \right)^{m} & \text{(pull out constants and reindex sum)} \\ &= \frac{10}{36} \left( 1 - \frac{10}{36} \right)^{4} \frac{1}{1 - \left( 1 - \frac{10}{36} \right)} & \text{(sum of geometric series)}\\ &= \left( 1 - \frac{10}{36} \right)^{4}. \end{align} \]

Notice that we used the formula for the sum of a geometric series \[ \sum_{k=0}^\infty r^k = \frac{1}{1 - r}; |r| < 1. \] In fact, this is why this distribution is known as the geometric distribution.

In retrospect, we did not need the geometric distribution to answer this question. In order for the round to last longer than $4$ rolls, we must roll something other than a five or a seven $4$ times in a row. By independence, the probability of this is \[ \left( 1 - \frac{10}{36} \right)^{4} \approx 0.272. \]

8.6 Exercises

When a question asks for the distribution of a random variable, any one of the following is sufficient:

the PMF (either as a table or a formula)
the CDF (either as a table or a formula)
the name of the distribution (e.g., Bernoulli, binomial, geometric), along with the values of all parameters

Exercise 8.1 (Secret Santa random variable) Recall the Secret Santa example (Example 1.7). If $X$ represents the number of friends who draw their own name,

Determine the distribution of $X$. Why is it not $\textrm{Binomial}(n= 4, p= 1/4)$?
Verify that $P(X > 0)$ agrees with the answer we obtained in Example 4.7.

Exercise 8.2 (One-and-one in basketball) In NCAA basketball, a team is “in the penalty” after 6 team fouls are committed in a half. If a player is fouled and the opposing team is in the penalty, the player is awarded a “one-and-one.”

The player is awarded a free throw (worth one point), and if they make it, they are awarded a second free throw.

If $X$ is the number of points scored in a one-and-one, find the distribution of $X$, assuming that the probability of making a free throw is $0.75$, independent of any other free throw.

Exercise 8.3 (Field bet in craps) One of the most popular side bets in craps (Figure 1.8) is the field bet. Unlike the pass line bet, which is resolved over multiple rolls, the field bet is resolved in a single roll. The field bet wins if the next roll is a 2, 3, 4, 9, 10, 11, or 12. Typically, 2 and 12 pay double (2-to-1), while the other numbers pay 1-to-1. What is the PMF of the profit if you make a $1 field bet?

Exercise 8.4 (Floor function CDF) Let $X$ be a random variable with CDF \[ F_X(x) = \begin{cases} 1 - 2^{\lfloor x \rfloor}, & x \geq 0 \\ 0, & \text{otherwise} \end{cases}. \] Note that $\lfloor x \rfloor$ denotes the floor function, which is the largest integer $\leq x$, so $\lfloor 1.4 \rfloor = 1$, $\lfloor -2.9 \rfloor = -3$, and $\lfloor 3 \rfloor = 3$.

Show that $F_X$ is a valid CDF.
Graph $F_X(x)$.
Calculate $P(X > 3)$ and $P(X = 3)$.
What is the PMF $f_X(x)$?

Exercise 8.5 (Wine competition) In a wine competition, there are five wines from Napa and five wines from Bordeaux. Each judge is asked to rank the wines from 1 (best) to 10 (worst). Ties are not allowed. Suppose that a judge cannot tell the difference between the ten wines so that all $10!$ possible rankings are equally likely. Let $R$ be the rank of the best Napa wine on this judge’s scorecard. Calculate the PMF of $R$, and check that it is a valid PMF.

Exercise 8.6 (Random variables in roulette) Xavier places a $5 bet on red on each of 3 spins of a roulette wheel.

Let $X$ be the number of bets he wins. Find the distribution of $X$.
Let $W$ be his profit over the 3 spins (which may be negative). Find the distribution of $W$.

Exercise 8.7 (Trying keys at random) A friend asked you to house-sit while she is on vacation. She gave you a keychain with $n$ keys, but she forgot to tell you which key was the one to her apartment. You decide to try keys at random until you find the right one.

Let $X$ be the number of keys that you need to try if you discard keys that do not work. What are the PMF and CDF of $X$?
Let $Y$ be the number of keys that you need to try if you do not discard keys that do not work. What are the PMF and CDF of $Y$?

Exercise 8.8 (Properties of the geometric distribution)

Show that Equation 8.8 is a valid PMF.
Derive a simple formula (not involving sums) for the CDF $F_X(x)$ of a geometric random variable. Note that your formula must be valid for all values of $x$, not just integer values. (Hint: It might be helpful to express your formula in terms of the floor function $\lfloor x \rfloor$, which is defined to be the largest integer $\leq x$.)