39 Gamma Distribution

In Example 37.1, we saw that the MLE for the rate parameter \(\lambda\) from i.i.d. observations \(X_1, \dots, X_n \sim \text{Exponential}(\lambda)\), \[ \hat\lambda = \frac{1}{\bar X}, \] was positively biased for \(\lambda\). We also determined that \(\hat\lambda\) is consistent for \(\lambda\), so this bias decreases to \(0\) as \(n\to\infty\). But what is the exact value of the bias? In order to answer this question, we need to determine the exact distribution of \(\bar X\), the sample mean of i.i.d. exponential random variables.

Since the sample mean \(\bar X\) is related to the sample sum \(S_n = \sum_{i=1}^n X_i\) by a scaling, we can equivalently determine the distribution of a sum of i.i.d. exponential random variables.

Of course, \(S_1 = X_1\), which is \(\text{Exponential}(\lambda)\) and has PDF \(f(x) = \lambda e^{-\lambda x}\) for \(x > 0\). We know further from Example 33.4 that:

\(S_2 = X_1 + X_2\) has PDF \(f(x) = \lambda^2 x e^{-\lambda x}\) for \(x > 0\).
\(S_3 = X_1 + X_2 + X_3\) has PDF \(f(x) = \frac{\lambda^3}{2} x^2 e^{-\lambda x}\) for \(x > 0\).

Based on these three examples, we might conjecture that \(S_n\) has PDF \[ f(x) = \frac{\lambda^n}{(n-1)!} x^{n-1} e^{-\lambda x}; \qquad x > 0. \] This is in fact the case as the next proposition shows.

Proposition 39.1 (Distribution of a sum of exponentials) Let \(X_1, \dots, X_n\) be i.i.d. \(\text{Exponential}(\lambda)\). The PDF of \(S_n \overset{\text{def}}{=}\sum_{i=1}^n X_i\) is \[ f(x) = \frac{\lambda^n}{(n-1)!} x^{n-1} e^{-\lambda x}; x > 0. \tag{39.1}\]

Proof

Equation 39.1 can be proved in a number of ways. For example, Exercise 33.6 asks you to prove it by induction. Here is an alternative approach using moment generating functions.

First, we determine the MGF corresponding to Equation 39.1. Assuming that \(f(x)\) is indeed a valid PDF (i.e., it integrates to \(1\)), the MGF is \[ \begin{align} \int_0^\infty e^{tx} \cdot \frac{\lambda^n}{(n-1)!} x^{n-1} e^{-\lambda x}\,dx &= \lambda^n \int_0^\infty \frac{1}{(n-1)!} x^{n-1} e^{-(\lambda - t) x}\,dx \\ &= \frac{\lambda^n}{(\lambda - t)^n} \underbrace{\int_0^\infty \frac{(\lambda - t)^n}{(n-1)!} x^{n-1} e^{-(\lambda - t) x}\,dx}_1 \\ &= \left(\frac{\lambda}{\lambda - t}\right)^n, \end{align} \] for \(t < \lambda\) (since otherwise the integral diverges).

On the other hand, the MGF of \(S_n\), as the sum of i.i.d. exponential random variables, must be \[ \begin{align*} M_{S_n}(t) = \text{E}\!\left[ e^{tS_n} \right] = \text{E}\!\left[ e^{t(X_1 + \dots + X_n)} \right] = \text{E}\!\left[ e^{tX_1} \right] \cdots \text{E}\!\left[ e^{tX_n} \right] &= M_{X_1}(t)^n \\ &= \left( \frac{\lambda}{\lambda - t} \right)^n \end{align*} \] for \(t < \lambda\), where we use the MGF of the exponential distribution that we derived in Example 34.8.

Since the MGFs match, Equation 39.1 must be the PDF of \(S_n\) by Theorem 34.1.

The distribution of \(S_n\) is called the \(\text{Gamma}(n, \lambda)\) distribution, with PDF given by Equation 39.1. Because we know that it arose from the sum of \(n\) i.i.d. exponentials, we can immediately conclude the following about the gamma distribution:

In the special case \(n = 1\), it reduces to the \(\text{Exponential}(\lambda)\) distribution.
Its expectation is \(\text{E}\!\left[ S_n \right] = n\text{E}\!\left[ X_1 \right] = \frac{n}{\lambda}\).
Its variance is \(\text{Var}\!\left[ S_n \right] = n\text{Var}\!\left[ X_1 \right] = \frac{n}{\lambda^2}\).

We will generalize these formulas in Proposition 39.2.

39.1 Generalizing the Gamma Distribution

The following data contains the prices of 100 home sales (in thousands of dollars) in Gainesville, Florida. (Agresti 2015) Suppose we wish to model the prices using a gamma distribution. Two candidates are \(\text{Gamma}(n=3, \lambda=.021)\) and \(\text{Gamma}(n=4, \lambda=.021)\), shown in blue and orange, respectively.

Unfortunately, \(n=3\) is too far to the left and \(n=4\) is too far to the right. Can we model the data using some value of \(n\) in between?

At first glance, the answer appears to be no. Equation 39.1 contains \((n - 1)!\), and factorials are only defined for integers. But what if we could generalize the factorial to non-integer values?

Definition 39.1 (Gamma function) The gamma function \(\Gamma(\alpha)\) is defined by \[ \Gamma(\alpha) = \int_0^\infty t^{\alpha - 1} e^{-t} \, dt \tag{39.2}\] for any \(\alpha > 0\).

We can evaluate \(\Gamma(\alpha)\) for particular values of \(\alpha\):

\(\Gamma(1) = \int_0^\infty e^{-t}\,dt = 1\), since this is the total probability under a standard exponential PDF.
\(\Gamma(2) = \int_0^\infty t e^{-t}\,dt = 1\), since this is \(\text{E}\!\left[ Z \right]\) for a standard exponential random variable \(Z\).
\(\Gamma(3) = \int_0^\infty t^2 e^{-t}\,dt = 2\), since this is \(\text{E}\!\left[ Z^2 \right]\) for a standard exponential random variable \(Z\).

The gamma function generalizes the factorial to non-integer values.

Theorem 39.1 (Gamma function generalizes the factorial) The gamma function (Equation 39.2) satisfies the recursion \[ \Gamma(\alpha + 1) = \alpha\Gamma(\alpha), \tag{39.3}\]

which implies that for positive integer values of \(n\), \[\Gamma(n) = (n - 1)! \tag{39.4}\]

Proof

To establish the recursion Equation 39.3, we use integration by parts.

\[\begin{align*} \Gamma(\alpha+1) &= \int_0^\infty \underbrace{t^{\alpha}}_u \underbrace{e^{-t} \, dt}_{dv} \\ &= - t^\alpha e^{-t} \Big|_0^\infty - \int_0^\infty \underbrace{-e^{-t}}_{v} \underbrace{\alpha t^{\alpha - 1}\,dt}_{du} \\ &= (-0 + 0) + \alpha \underbrace{\int_0^\infty t^{\alpha - 1} e^{-t} \, dt}_{\Gamma(\alpha)} \\ &= \alpha \Gamma(\alpha). \end{align*}\]

To establish Equation 39.4, we recursively apply Equation 39.3: \[\begin{align} \Gamma(n) &= (n - 1) \Gamma(n - 1) \\ &= (n - 1) (n - 2) \Gamma(n - 2) \\ &= (n - 1) (n - 2) (n - 3) \Gamma(n - 3) \\ &= \dots \\ &= (n - 1) (n - 2) (n - 3) \dots 2 \cdot 1 \Gamma(1) \\ &= (n - 1)! \end{align}\] where the last step follows because we earlier calculated \(\Gamma(1) = 1\).

Figure 39.1 shows the gamma function on top of \((n - 1)!\) (for integer values of \(n\)). Notice how the gamma function connects the dots, allowing us to make sense of non-integer factorials, like \(3.5!\).

Figure 39.1: Gamma function vs. \((n - 1)!\)

Therefore, we can replace \((n-1)!\) in Equation 39.1 with \(\Gamma(n)\) and generalize the gamma distribution to non-integer parameters. As a reminder that \(n\) need not be an integer, we will rename this parameter to \(\alpha\).

Definition 39.2 (Gamma distribution) A random variable \(X\) is said to follow a \(\text{Gamma}(\alpha, \lambda)\) distribution if its PDF is \[ f_X(x) = \begin{cases} \displaystyle\frac{\lambda^\alpha}{\Gamma(\alpha)} x^{\alpha - 1} e^{-\lambda x} & x > 0 \\ 0 & \text{otherwise} \end{cases}, \] where \(\alpha > 0\) parametrizes the shape and \(\lambda > 0\) parametrizes the rate.

When the shape parameter \(\alpha\) is not an integer, the gamma distribution does not have a simple interpretation (unlike for integer values of \(\alpha\), where it can be interpreted as the sum of i.i.d. exponentials). Nevertheless, non-integer shape parameters afford us flexibility in modeling.

Next, we calculate formulas for the expected value and variance. Previously, we obtained these formulas for integer shape parameters, by representing the gamma random variable as a sum of i.i.d. exponentials and appealing to linearity of expectation. However, to prove these formulas for general gamma distributions, we have to evaluate an integral.

Proposition 39.2 (Expectation and variance of gamma) Let \(X\) be a \(\text{Gamma}(\alpha, \lambda)\) random variable.

\[ \begin{aligned} \text{E}\!\left[ X \right] &= \frac{\alpha}{\lambda} & \text{Var}\!\left[ X \right] &= \frac{\alpha}{\lambda^2}. \end{aligned} \]

Proof

We compute the first moment:

\[\begin{align*} \text{E}\!\left[ X \right] &= \int_0^\infty x \cdot \frac{\lambda^\alpha}{\Gamma(\alpha)} x^{\alpha - 1} e^{-\lambda x} \, dx \\ &= \frac{1}{\Gamma(\alpha)} \int_0^\infty (\lambda x)^\alpha e^{-\lambda x} \, dx \qquad \qquad (y = \lambda x) \\ &= \frac{1}{\Gamma(\alpha)} \int_0^\infty \frac{1}{\lambda} y^\alpha e^{-y} \, dy \\ &= \frac{1}{\lambda \Gamma(\alpha)} \int_0^\infty y^{(\alpha+1)-1} e^{-y} \, dy \\ &= \frac{1}{\lambda \Gamma(\alpha)} \cdot \Gamma(\alpha+1) \\ &= \frac{1}{\lambda \Gamma(\alpha)} \cdot \alpha \Gamma(\alpha) \\ &= \frac{\alpha}{\lambda}. \end{align*}\] In the second-to-last step, we used the recursion property (Equation 39.3).

We can compute the second moment similarly: \[\begin{align*} \text{E}\!\left[ X^2 \right] &= \int_0^\infty x^2 \cdot \frac{\lambda^\alpha}{\Gamma(\alpha)} x^{\alpha - 1} e^{-\lambda x} \, dx \\ &= \frac{1}{\lambda \Gamma(\alpha)} \int_0^\infty (\lambda x)^{\alpha + 1} e^{-\lambda x} \, dx \qquad \qquad (y = \lambda x) \\ &= \frac{1}{\lambda \Gamma(\alpha)} \int_0^\infty \frac{1}{\lambda} y^{\alpha + 1} e^{-y} \, dy \\ &= \frac{1}{\lambda^2 \Gamma(\alpha)} \int_0^\infty y^{(\alpha+2) - 1} e^{-y} \, dy \\ &= \frac{1}{\lambda^2 \Gamma(\alpha)} \cdot \Gamma(\alpha+2) \\ &= \frac{1}{\lambda^2 \Gamma(\alpha)} \cdot (\alpha+1) \alpha \Gamma(\alpha) \\ &= \frac{\alpha(\alpha+1)}{\lambda^2}. \end{align*}\] In the second-to-last step, we used the recursion property (Equation 39.3) twice.

Thus, the variance is \[ \text{Var}\!\left[ X \right] = \frac{\alpha(\alpha+1)}{\lambda^2} - \frac{\alpha^2}{\lambda^2} = \frac{\alpha}{\lambda^2}. \]

Of course, we could have also derived the expectation and variance if we knew its moment generating function. The MGF is derived below; Exercise 39.1 asks you to rederive the expectation and variance using Equation 39.5.

Proposition 39.3 (MGF of gamma) Let \(X\) be a \(\text{Gamma}(\alpha, \lambda)\) random variable. Then, its MGF is

\[ M_X(t) = \left(\frac{\lambda}{\lambda - t}\right)^{\alpha}; t < \lambda. \tag{39.5}\]

Proof

Suppose \(t < \lambda\). Then, using the definition of MGFs, we get \[\begin{align*} M_X(t) &= \int_0^\infty e^{tx} \cdot \frac{\lambda^\alpha}{\Gamma(\alpha)} x^{\alpha-1} e^{-\lambda x} \, dx \\ &= \frac{\lambda}{\Gamma(\alpha)} \int_0^\infty (\lambda x)^{\alpha-1} e^{-(\lambda - t)x} \, dx \qquad \qquad (y = (\lambda - t)x) \\ &= \frac{\lambda}{\Gamma(\alpha)} \int_0^\infty \frac{1}{\lambda - t} \left( \frac{\lambda}{\lambda - t} y \right)^{\alpha - 1} e^{-y} \, dy \\ &= \left( \frac{\lambda}{\lambda - t} \right)^\alpha \cdot \frac{1}{\Gamma(\alpha)} \int_0^\infty y^{\alpha - 1} e^{-y} \, dy \\ &= \left( \frac{\lambda}{\lambda - t} \right)^\alpha \cdot \frac{1}{\Gamma(\alpha)} \cdot \Gamma(\alpha) \\ &= \left( \frac{\lambda}{\lambda - t} \right)^\alpha. \end{align*}\]

The MGF provides quick proofs of the following properties.

Proposition 39.4 (Properties of the gamma distribution) Let \(X\) and \(Y\) be independent \(\text{Gamma}(\alpha,\lambda)\) and \(\text{Gamma}(\beta,\lambda)\) random variables, respectively.

Then,

\[ aX \sim \text{Gamma}\left(\alpha, \frac{\lambda}{a}\right). \tag{39.6}\] and

\[ X+Y \sim \text{Gamma}(\alpha + \beta, \lambda). \tag{39.7}\]

Proof

The MGF of \(aX\) is \[\begin{align*} M_{aX}(t) &= \text{E}\!\left[ e^{t \cdot aX} \right] \\ &= \text{E}\!\left[ e^{(at)X} \right] \\ &= M_X(at) \\ &= \left( \frac{\lambda}{\lambda - at} \right)^\alpha \\ &= \left( \frac{\lambda/a}{\lambda/a - t} \right)^\alpha, \end{align*}\] which is the MGF of \(\text{Gamma}\left( \alpha, \frac{\lambda}{a} \right)\). Thus, \[ aX \sim \text{Gamma}\left( \alpha, \frac{\lambda}{a} \right) \] by Theorem 34.1.

Next, the MGF of \(X + Y\) is \[\begin{align*} M_{X+Y}(t) = M_X(t) M_Y(t) = \left( \frac{\lambda}{\lambda - t} \right)^\alpha \left( \frac{\lambda}{\lambda - t} \right)^\beta = \left( \frac{\lambda}{\lambda - t} \right)^{\alpha+\beta}, \end{align*}\] which is the MGF of \(\text{Gamma}(\alpha+\beta,\lambda)\). Thus, \[ X+Y \sim \text{Gamma}(\alpha+\beta, \lambda) \] by Theorem 34.1.

Proposition 39.4 is intuitive when both \(\alpha\) and \(\beta\) are integers because then \(X\) is the sum of \(\alpha\) i.i.d. exponentials and \(Y\) is the sum of \(\beta\) i.i.d. exponentials.

If we scale \(X\) by \(a\), that is equivalent to scaling each exponential by \(a\), which we know from Proposition 22.3 results in an exponential with rate \(\lambda / a\). Therefore, we are simply adding \(\alpha\) i.i.d. exponential random variables with rate \(\lambda / a\), which follows a \(\text{Gamma}(\alpha, \frac{\lambda}{a})\) distribution.
If we add \(X\) and \(Y\), then together they represent the sum of \(\alpha + \beta\) i.i.d. exponentials, which follows a \(\text{Gamma}(\alpha + \beta, \lambda)\) distribution.

Now that we know the gamma distribution and its properties, we can return to the problem posed at the beginning of this chapter.

Example 39.1 (Exact bias of the exponential MLE) Since \(S_n \sim \text{Gamma}(n, \lambda)\), \[ \bar X = \frac{1}{n} S_n \sim \text{Gamma}(n, n\lambda) \] by Proposition 39.4.

Now, we can evaluate \(\text{E}\!\left[ \frac{1}{\bar X} \right]\) using LotUS, assuming \(n > 1\). \[ \begin{align} \text{E}\!\left[ \hat\lambda \right] = \text{E}\!\left[ \frac{1}{\bar X} \right] &= \int_{0}^\infty \frac{1}{y} \cdot \frac{(n\lambda)^n}{\Gamma(n)} y^{n-1} e^{-n\lambda y} \, dy \\ &= \int_{0}^\infty \frac{(n\lambda)^{n}}{\Gamma(n)} y^{(n-1)-1} e^{-n\lambda y} \, dy \\ &= \frac{n\lambda}{(n-1)} \int_{0}^\infty \underbrace{ \frac{(n\lambda)^{n-1}}{\Gamma(n-1)} y^{(n-1)-1} e^{-n\lambda y}}_{\text{Gamma}(n-1, n\lambda) \text{ PDF}} \, dy\\ &= \frac{n}{n-1} \lambda. \end{align} \]

Now, we can plainly see that \(\hat\lambda\) overestimates \(\lambda\) by a factor \(\frac{n}{n-1} > 1\), which confirms that it is positively biased. However, as \(n\to\infty\), this factor approaches \(1\), which also makes sense, since \(\hat\lambda\) is consistent for \(\lambda\).

To calculate the exact value of the bias, we subtract the true value of the parameter from this expectation: \[ \text{E}\!\left[ \hat\lambda \right] - \lambda = \left(\frac{n}{n-1} - 1\right)\lambda = \frac{\lambda}{n-1}. \]

One of the advantages of calculating the exact bias (which was only possible because we knew the exact distribution of \(\bar X\)) is that it suggests an unbiased estimator of \(\lambda\). Exercise 39.3 asks you to find such an estimator and compare its MSE with the MLE \(\hat\lambda\).

39.2 Maximum Likelihood Estimation

Now that we can model data using the gamma distribution with non-integer values of \(\alpha\), the question is how do we find the best value of \(\alpha\). The answer is maximum likelihood, of course.

The likelihood for a sample of data \(X_1, \dots, X_n\) is \[ L(\alpha, \lambda) = \prod_{i=1}^n \frac{\lambda^\alpha}{\Gamma(\alpha)} X_i^{\alpha - 1} e^{-\lambda X_i}, \] so the log-likelihood is \[ \ell(\alpha, \lambda) = n\alpha \log\lambda - n\log\Gamma(\alpha) + (\alpha - 1) \sum_{i=1}^n \log X_i - \lambda \sum_{i=1}^n X_i.\]

If we know \(\alpha\), then the optimal \(\lambda\) satisfies \[ \frac{\partial\ell}{\partial\lambda} = \frac{n\alpha}{\lambda} - \sum_{i=1}^n X_i = 0, \] so the optimal value of \(\lambda\) (for a given value of \(\alpha\)) is \[ \hat\lambda(\alpha) = \frac{\alpha}{\bar X}. \tag{39.8}\]

Substituting this back into the log-likelihood, we see that the optimal \(\alpha\) must maximize \[ \ell(\alpha, \hat\lambda(\alpha)) = n\alpha \log \frac{\alpha}{\bar X} - n\log\Gamma(\alpha) + (\alpha - 1) \sum_{i=1}^n \log X_i - n\alpha \] or equivalently, \[ J(\alpha) = \frac{1}{n} \ell(\alpha, \hat\lambda(\alpha)) = \alpha \log \frac{\alpha}{\bar X} - \log\Gamma(\alpha) + (\alpha - 1) \frac{1}{n} \sum_{i=1}^n \log X_i - \alpha. \tag{39.9}\]

Solving for the optimal \(\alpha\) by hand is hopeless, not least because there is an \(\alpha\) nested inside a gamma function. However, for any given data set, we can find the optimal \(\alpha\) numerically using Newton’s method.

Example 39.2 (Newton’s method for maximum likelihood)

Isaac Newton (1643-1727)

Newton’s method, first published by Isaac Newton in 1669, is a general method for maximizing (or minimizing) a function \(J(\alpha)\).

We start with a guess for \(\alpha\) and iteratively update the guess as follows: \[ \alpha \leftarrow \alpha - \frac{J'(\alpha)}{J''(\alpha)}. \tag{39.10}\] After several iterations, \(\alpha\) will converge to a (local) maximum or minimum.

To maximize Equation 39.9 using Newton’s method, we will first need to calculate its derivatives.

The first derivative is \[ J'(\alpha) = \log\alpha - \log \bar X - \underbrace{\frac{\Gamma'(\alpha)}{\Gamma(\alpha)}}_{\psi(\alpha)} + \frac{1}{n}\sum_{i=1}^n \log X_i \] and involves the digamma function \(\psi(\alpha) \overset{\text{def}}{=}\frac{\Gamma'(\alpha)}{\Gamma(\alpha)}\), built into many scientific programming libraries and languages, including R.

The second derivative is \[ J''(\alpha) = \frac{1}{\alpha} - \psi'(\alpha), \] which involves the derivative of the digamma function, called the trigamma function.

Now, we implement Newton’s method on the prices data from above in R.

Notice that Newton’s method converges after only about 5 or 6 iterations. Try different starting guesses for \(\alpha\). No matter where you start, Newton’s method should converge to the same estimate.

Now that we have the optimal value of \(\alpha\), we can plug it into Equation 39.8 to obtain the corresponding value of \(\lambda\). Finally, we plot the estimated gamma distribution on top of the data.

If you are disappointed by the fit of this curve to the data, do not blame the gamma distribution! This is merely the member of the gamma family that fits this data the best. The fact that it does not fit the data particularly well suggests that we should look beyond the gamma distribution for a model.

39.3 Chi-Square Distribution

The gamma distribution with non-integer shape parameter \(\alpha\) also arises in the guise of the chi-square, or \(\chi^2\), distribution, an important distribution in statistics.

Let \(Z_1, \dots, Z_k\) be i.i.d. standard normal. The random variable \[S_k = Z_1^2 + \dots + Z_k^2\] is said to follow a \(\chi^2_k\) distribution. But the \(\chi^2_k\) distribution is not a new distribution; it is merely a special case of the gamma distribution!

Proposition 39.5 (Chi-square is gamma) The \(\chi^2_k\) distribution is the \(\text{Gamma}(\alpha=\frac{k}{2}, \lambda=\frac{1}{2})\) distribution.

Proof

We can prove this using MGFs. First, we calculate the MGF of \(Z_1^2\): \[\begin{align*} M_{Z_1^2}(t) &= \text{E}\!\left[ e^{tZ_1^2} \right] \\ &= \int_{-\infty}^\infty e^{tz^2} \frac{1}{\sqrt{2\pi}} e^{-\frac{1}{2} z^2}\,dz & \text{(LotUS)} \\ &= \int_{-\infty}^\infty \frac{1}{\sqrt{2\pi}} e^{-\frac{1}{2} (1 - 2t) z^2}\,dz \\ &= \begin{cases} \frac{1}{\sqrt{1 - 2t}} \int_{-\infty}^\infty \underbrace{\frac{1}{\frac{1}{\sqrt{1 - 2t}} \sqrt{2\pi}} e^{-\frac{1}{2} (1 - 2t) z^2}}_{\textrm{Normal}(\mu= 0, \sigma^2= \frac{1}{1-2t}) \text{ PDF}} \,dz & t < \frac{1}{2} \\ \infty & \text{otherwise} \end{cases} \\ &= \begin{cases} \left(\frac{1}{1 - 2t}\right)^{1/2} & t < \frac{1}{2} \\ \infty & \text{otherwise} \end{cases}. \end{align*}\]

Another way to write this is \[ M_{Z_1^2}(t) = \left(\frac{\frac{1}{2}}{\frac{1}{2} - t}\right)^{1/2}; \qquad t < \frac{1}{2}, \] which we recognize from Proposition 39.3 as the MGF of a \(\text{Gamma}(\alpha=\frac{1}{2}, \lambda=\frac{1}{2})\) distribution.

Now, \(S_k = Z_1^2 + \dots + Z_k^2\) is just the sum of \(k\) independent such gamma random variables. By Equation 39.7, we know that its distribution must therefore be \(\text{Gamma}(\alpha=\frac{k}{2}, \lambda=\frac{1}{2})\).

39.4 Exercises

Exercise 39.1 (Deriving the gamma expectation and variance using MGFs) Rederive the expectation and variance of the gamma distribution (Proposition 39.2) using the MGF of the gamma distribution (Equation 39.5).

Exercise 39.2 (Moments of the gamma distribution) Let \(X \sim \text{Gamma}(\alpha, \lambda)\) and \(k\) be a positive integer. Use the moment generating function to show that \[ \text{E}\!\left[ X^k \right] = C \frac{\Gamma(\alpha+k)}{\Gamma(\alpha)} \] for some constant \(C_k\) (that depends on \(k\)). What is \(C_k\)?

Exercise 39.3 (Comparing estimators of the exponential rate parameter) In Example 39.1, we determined the bias of the MLE \(\hat\lambda\) of the rate parameter \(\lambda\) from i.i.d. observations \(X_1, \dots, X_n \sim \text{Exponential}(\lambda)\). We will assume \(n > 2\).

Use the results to propose an unbiased estimator \(\hat\lambda_{\text{unbiased}}\) of \(\lambda\).
Calculate the MSE of \(\hat\lambda\).
Calculate the MSE of \(\hat\lambda_{\text{unbiased}}\).
Which estimator is better?

(Hint: You should be able to calculate \(\text{E}\!\left[ \frac{1}{\bar X^2} \right]\) in a similar way to Example 39.1.)

Exercise 39.4 (Moments of the exponential distribution) Let \(X \sim \text{Exponential}(\lambda)\). Show that \[ \text{E}\!\left[ X^k \right] = \frac{k!}{\lambda^k} \] for any positive integer \(k\).

Exercise 39.5 (Modeling rainfall using the gamma distribution) The total rain water collected in the Crystal Springs Reservoir in a year is gamma distributed with mean \(1000\) gallons and standard deviation \(300\) gallons. Assuming rain is independent year to year, what is the probability that

the total rainwater collected by the reservoir in two years is less than \(2500\) gallons?
more than average collection of rain water happens in at least three of the next five years?

Exercise 39.6 (Relationship between the gamma and Poisson distributions) Let \(\alpha\) be a positive integer.

Show that \[ \int_x^\infty \frac{1}{\Gamma(\alpha)} z^{\alpha - 1} e^{-z} \, dz = \sum_{y=0}^{\alpha-1} \frac{x^y e^{-x}}{y!}. \]
If \(X \sim \text{Gamma}(\alpha,\lambda)\), show that \[ P(X \leq x) = P(Y \geq \alpha), \] where \(Y \sim \text{Poisson}(\lambda x)\).