Chapter 5 Discrete Probability Distributions

Learning Outcome


Compute measures of expectation and variation for a discrete probability distribution.


In this chapter, we will extend the concept of relative frequencies to understand and calculate the probability of occurrence of a random event. We will also learn about normal distribution, its properties, and methods to calculate probabilities of random events that are described by this distribution.

5.1 Definitions

An event is any collection of results or outcomes of a procedure. For example, tossing a coin is an event with possible outcomes heads and tails.

A simple event is an event that has one outcome. For example, births of \(2\) girls followed by a boy is a simple event because the only possible outcome is \(\{ggb\}\).

However, births of \(2\) girls and a boy is an event that has three possible outcomes \(\{ggb, gbg, bgg \}\).

A sample space for a procedure consists of all possible simple events. For example, with births of three children, the sample space consists of eight different simple events \(\{ bbb, bbg, bgb, bgg, gbb, gbg, ggb, ggg\}\)

An event is random if individual outcomes of it are unpredictable, meaning they have no apparent pattern of occurrence, but there is nonetheless a predictable distribution (i.e. the frequencies) of those different outcomes over a large number of repetitions of the event.

The probability of any outcome of a random event can be defined as the proportion of times the outcome would occur in a very long series of repetitions.

The probability is defined as a proportion, and it always takes values between \(0\) and \(1\) (inclusively). It may also be displayed as a percentage between \(0\%\) and \(100\%\).

  • \(0\%: \text{event is impossible}\)
  • \(100\%: \text{event is certain}\)


THEORETICAL VERSUS EXPERIMENTAL PROBABILITY

The Theoretical probability is the likelihood of occurring of an event. It is simply the ratio of the number of desired outcomes and the number of all possible outcomes.

The experimental probability is an estimate of the likelihood of occurring of an event based on repeated trials.

The subjective probability is an estimate of the likelihood of occurring of an event based on someone’s belief.


LAW OF LARGE NUMBERS


Consider: Rolling a 1 of a die

If the sample space of a random experiment consists of \(n\) equally likely outcomes and an event \(E\) consists of \(m\) of those outcomes, then

\[\text {Theorerical Probability} : P(E) = \frac{\text{number of outcomes in the event}(m)}{\text{total number of outcomes}(n)}\]

Let \(\hat{p_n}\) be the proportion of outcomes that are \(1\) after the \(n\) rolls. As the number of rolls \((n)\) increases, \(\hat{p_n}\) (the relative frequency of rolls, or the experimental probability) will converge to the theoretical probability of rolling a \(1,\space p = 1/6.\) The figure shows the convergence for \(100,000\) die rolls.

The tendency of \(\hat{p_n}\) to stabilize around \(p\), i.e. the tendency of the relative frequency to stabilize around the true probability, is described by the Law of Large Numbers.

As more observations are collected, the observed proportion \(\hat{p_n}\) of occurrences with a particular outcome after \(n\) trials converges to the true probability \(p\) of that outcome.

Die Rolls Simulation

The figure shows the fraction of die rolls that are \(1\) at each stage in a simulation. The relative frequency tends to get closer to the probability \(1/6 \approx 0.167\) as the number of rolls increases.


Example: Calculating Classical Probabilities

Assuming that births of boys and girls are equally likely, find the probability of getting children of all of the same gender when three children are born.

Recall the sample space of three children: \(\{ bbb, bbg, bgb, bgg, gbb, gbg, ggb, ggg \}\), includes eight equally likely outcomes, and there are exactly two outcomes in which three children are of the same gender: \(\{ bbb, ggg \}\) .

\[ P(\text{ three children of the same gender}) = \dfrac{2}{8} = \dfrac{1}{4} = 0.25 \]

5.2 Finding Probabilities

This section presents the addition and multiplication rules of calculating probabilities.

5.2.1 Disjoint or mutually exclusive outcomes

Two events or outcomes are called disjoint or mutually exclusive if they cannot both happen in the same trial.

When rolling a die, the outcomes \(1\) and \(2\) are disjoint, and we compute the probability that one of these outcomes will occur by adding their separate probabilities: \[P(1 \text{ or } 2)=P(1)+P(2)=1/6+1/6=1/3\]

What about the probability of rolling a \(1, 2, 3, 4, 5, \ or \ 6\) ?

\[ \begin{array}{ll} P(1 \text{ or } 2 \text{ or } 3 \text{ or } 4 \text{ or } 5 \text{ or }6) = P(1)+P(2)+P(3)+P(4)+P(5)+P(6) \\ =1/6+1/6+1/6+1/6+1/6+1/6 =1 \end{array} \]

It is no surprise that the probability is \(1\), since it is certain that one of the six outcomes must occur.


ADDITION RULE OF DISJOINT OUTCOMES


If \(A_1,...,A_k\) represent \(k\) disjoint outcomes, then the probability that one of them occurs is given by: \[P(A_1\text{ or }A_2 \text{ or ... or }A_k)=P(A_1)+P(A_2)+...+P(A_k)\]


Example: Consider a standard deck of cards.

\[ \text {4 suits} \left\{ \begin{array}{ll} \text{hearts: } \color{red}{\heartsuit} \\ \text{diamonds: } \color{red}{\diamondsuit} \\ \text{clubs: } \spadesuit \\ \text{spades: } \clubsuit \end{array} \right. \] \[\text{13 cards in each suit: } Ace, 2, 3, 4, 5, 6, 7, 8, 9, 10, Jack, Queen, King\] One card is dealt from a well shuffled deck.

\[ \begin{align} P(\text{the card is an ace or a king}) &= P(\text{it's an ace})+P(\text {it's a king}) \\ & = 4/52+4/52 \\ & = 8/52 \\ & = 2/13 \end{align} \]



Venn Diagram - a diagram style to illustrate simple set relationships in probability.


Venn Diagram | When events are disjoint

\[ \begin{align} P(\text{the card is an ace or a king}) &= P(\text{it's an ace})+P(\text {it's a king}) \\ & = 4/52+4/52 \\ & = 2/13 \end{align} \]

## (polygon[GRID.polygon.471], polygon[GRID.polygon.472], polygon[GRID.polygon.473], polygon[GRID.polygon.474], text[GRID.text.475], text[GRID.text.476], text[GRID.text.477], text[GRID.text.478])

5.2.2 Probabilities when events are NOT disjoint or mutually exclusive

\[ \begin{align} & P(\text{the card is an ace or a heart}) \\ &= P(\text{it's an ace})+P(\text {it's a heart})-P(\text{it's an ace & heart}) \\ & = 4/52+13/52 - \underbrace{1/52}_{\text {adjustment made to avoid double-counting of the ace of hearts}} \\ & = 16/52 \\ & = 4/13 \end{align} \]

Venn Diagram | When events are not disjoint

\[ \begin{align} & P(\text{the card is an ace or a heart}) \\ & = P(\text{it's an ace})+P(\text {it's a heart})-P(\text{it's an ace AND heart}) \\ & = 4/52+13/52 - 1/52 = 16/52 \end{align} \]

FALSE (polygon[GRID.polygon.479], polygon[GRID.polygon.480], polygon[GRID.polygon.481], polygon[GRID.polygon.482], text[GRID.text.483], text[GRID.text.484], text[GRID.text.485], text[GRID.text.486], text[GRID.text.487])


GENERAL ADDITION RULE OF PROBABILITIES


\[ \bbox[yellow,5px] {\color{black}{P(A \space or \space B) = P(A) + P(B) - P(A \space and \space B)}} \] where \(P(A \text{ and } B)\) is the probability that both events occur.

If \(A\) and \(B\) are mutually exclusive, \(P(A \space and \space B) = 0\)

Therefore,

\[ P(A \space or \space B) = P(A) + P(B)\]


5.2.3 Complement of an event

The complement of event \(A\) is denoted \(A^c\), and \(A^c\) represents all outcomes not in \(A\). \(A\) and \(A^c\) are mathematically related:

\[ \begin{align} & P(A) + P(A^c) = 1 \\ or, \space & P(A^c) = 1 - P(A) \end{align} \]

Example: if an event has chance \(40\%\), then the chance that it doesn’t happen is \(60\%\).

Venn Diagram | Exercise

\[ \begin{align} P(email) &=0.73 \\ P(text) &= 0.62 \\ P(\text {email & text}) &= 0.49 \\ P(\text {only email}) &= 0.73 - 0.49 = 0.24 \\ P(\text{only text}) &= 0.62 - 0.49 = 0.13 \\ P(\text{neither email nor text}) &= 1 - (0.24 + 0.49 + 0.13) = 0.14 \end{align} \]

(polygon[GRID.polygon.488], polygon[GRID.polygon.489], polygon[GRID.polygon.490], polygon[GRID.polygon.491], text[GRID.text.492], text[GRID.text.493], text[GRID.text.494], text[GRID.text.495], text[GRID.text.496]) 

5.2.4 Multiplication Rule | for independent processes

If \(A\) and \(B\) represent events from two different and independent processes, then the probability that both \(A\) and \(B\) occur can be calculated as the product of their seprarate probabilities:

\[P(A \text{ and } B) = P(A) \times P(B)\]

Similarly, if there are \(k\) events \(A_1,...,A_k\) from \(k\) independent processes, then the probability they all occur is

\[ \bbox[yellow,5px] { \color{black} {P(A_1\text{ and }A_2 \text{ and ... and }A_k)=P(A_1)\times P(A_2)\times...\times P(A_k)} } \]

Example 1:

About \(9\%\) of people are left-handed. Suppose \(5\) people are selected at random from the US population.
(a) What is the probability that all are right-handed?
(b) What is the probability that all are left-handed?
(c) What is the probability that not all of them are right-handed?

\[ \begin{align} &(a) \space P\text{(All are RH)} = (1-0.09)^5 = 0.624 \\ &(b) \space P\text{(All are LH)} = (0.09)^5 = 0.0000059 \\ &(c) \space P\text{(not all RH)} = 1- P(\text {all RH}) = 1-0.624 = 0.376 \end{align} \]


GENERAL MULTIPLICATION RULE


If \(A\) and \(B\) represent two outcomes or events, then

\[ \bbox[yellow,5px] {\color{black}{P(A \space and \space B) = P(A|B) \times P(B)}} \]


Example 2: If a card is randomly drawn from a well-shuffled deck, what is the probability that it is the ace of hearts? [Note: Ace and Hearts are two independent events.]

\[ \begin{align} P(Ace \text{ and } Hearts) &= P(Hearts) \times P(Ace \text { | } Hearts) \\ &= (13/52) \times (1/13) = 1/52 \end{align} \]

Example 3:

During the smallpox outbreak, \(96.08\%\) of Boston residents were not inoculated, and \(85.88\%\) of the residents who were not inoculated ended up surviving. What is the probability that a resident was not inoculated and lived?

To answer the question, we want to determine \(\text {P(lived and not inoculated)}\),

and we are given that,

\[ \begin{align} P(\text{lived | not inoculated}) &= 0.8588 \\ P(\text {not inoculated}) &= 0.9608 \end{align} \]

Among the \(96.08\%\) of people who were not inoculated, \(85.88\%\) survived:

\[ P(\text {lived and not inoculated}) = 0.8588 \times 0.9608 = 0.8251 \]

This is equivalent to the General Multiplication Rule.

5.2.5 Marginal and joint probabilities

If a probability is based on a single variable, it is a marginal probability. The probability of outcomes for two or more variables or processes is called a joint probability.

Exercise: Calculating Probabilities with a Contingency Table:

Table: College enrollment and parents’ educational attainment

\[ \begin{array} {l|cc|r} & \text{parents: degree} & \text{parents: no degree} & \text{total} \\ \hline \text {teen: college} & 231 & 214 & 445 \\ \text {teen: no college} & 49 & 298 & 347 \\ \hline \text {total} & 280 & 512 & 792 \end{array} \]

a) Finding Marginal and Joint Probabilities:

\[ \begin{array} {l|cc|c} & \text{parents: degree} & \text{parents: no degree} & \text{marginal} \\ \hline \text {teen: college} & \color{red}{0.29} & \color{red}{0.27} & \color{blue}{0.56} \\ \text {teen: no college} & \color{red}{0.06} & \color{red}{0.38} & \color{blue}{0.44} \\ \hline \text {marginal} & \color{blue}{0.35} & \color{blue}{0.65} & 1.00 \end{array} \]

\[ \begin{align} &\color{blue}{\text{Marginal Probability: }} P(\text{teen: college})=\frac{445}{792}=0.56 \\ &\color{red}{\text{Joint Probability: }} P(\text {teen: college and parents: no degree})=\frac{214}{792}=0.27 \end{align} \]

b) Finding Conditional Probability:


Conditional Probability

The conditional probability of the outcome of interest \(A\) given condition \(B\) is computed as the following:

\[P(A|B) = \frac{P(A \text{ and } B)}{P(B)}\]


\[ \begin{array} {l|cc|r} & \text{parents: degree} & \text{parents: no degree} & \text{total} \\ \hline \text {teen: college} & 231 & 214 & 445 \\ \text {teen: no college} & 49 & 298 & 347 \\ \hline \text {total} & 280 & 512 & 792 \end{array} \]

\[ \begin{align} P(\text {teen college | parents degree}) &= \frac{231/792}{280/792} = 0.825 \\ P(\text {teen college | parents no degree}) &= \frac{214/792}{512/792} = 0.418 \\ P(\text {teen no college | parents degree}) &= \frac{49/792}{280/792} = 0.175 \\ P(\text {teen no college | parents no degree}) &= \frac{298/792}{512/792} = 0.582 \end{align} \]

c) Condition of Independence

Verify whether one of the following equations holds:


\[ \begin{align} P(A|B) &= P(A) \tag 1 \\ P(A \space and \space B) &=P (A) \times P(B) \tag 2 \end{align} \]


Check if the equality holds in the following equation:

\[ \begin{align} P(\text{teen college | parent degree})&\stackrel{?}{=} P(\text {teen college}) \\ 0.825 &\ne 0.560 \end{align} \] Because both sides are not equal, teenager college attendance and parent degree are not independent.

Two events are mutually exclusive

If \(A\) and \(B\) are mutually exclusive events, then they cannot occur at the same time. If asked to determine if events \(A\) and \(B\) are mutually exclusive, verify one of the following equations holds:


\[ \begin{align} P(\text{A and B})&= 0 \tag 1 \\ P(\text{A or B}) &= P(A)+P(B) \tag 2 \end{align} \]


If the equation that is checked holds true, \(A\) and \(B\) are mutually exclusive. If the equation does not hold, then \(A\) and \(B\) are not mutually exclusive.

At Least One

A poker hand (5 cards) is dealt from a well shuffled deck. What is the chance that there is at least one ace in the hand?

\[ \begin{align} &P(\text{at least one ace}) \\ &=1-P(\text{no aces}) \\ &=1-(48/52) \times (47/51) \times (46/50) \times (45/49) \times (44/48) \\ &=34.11\% \end{align} \]


5.3 Random Variables

A random variable is a numerical measure of an outcome from a random experiment. It has a single numerical value, determined by chance, for each outcome of an event. We often use a capital letter such as \(X\) to stand for a random variable.

Let \(X\) be a random variable representing all possible outcomes of rolling a six-sided die once. Find the given probability:

\(1. P(X=4)\)

\(2. P(X\le 4)\)

\(3. P(X>4)\)

\(4. P(3 \le X \le 6)\)

  • A discrete random variable has a collection of values that is finite or countable, such as number of tosses of a coin before getting heads.

  • A continuous random variable has infinitely many values, and the collection of values is not countable, such as body temperature.

5.4 Probability Distribution for a Discrete Random Variable

A probability distribution for a discrete random variable is a description that gives the probability for each value of the random variable. It can be expressed as a table or graph, or formula.

Properties
  1. There is a numerical (not categorical) random variable \(x\), and its number values are associated with corresponding probabilities.

  2. \(\sum P(x)=1\), where \(x\) assumes all possible values.

  3. \(0 \le P(x) \le 1\) for every individual value of the random variable \(x\).

\[ \begin{array}{c|lcr} x: \text{ number of heads} \\ \text {when two coins are tossed} & P(x) \\ \hline 0 & 0.25 \\ 1 & 0.50 \\ 2 & 0.25 \end{array} \]

  1. Find the probability of getting two heads out of two tosses?
  2. Find the probability of getting at least one head out of two tosses?
  3. Find the probability of getting at most one head out of two tosses?

5.5 Parameters of a Probability Distribution

Mean \(\mu\) of a probability distribution

\(\mu = \sum [x_i \cdot P(x_i)]\)

Variance \(\sigma^2\) for a probability distribution

\(\sigma^2 = \sum[(x_i-\mu)^2 \cdot P(x_i)] = \sum[x_i^2 \cdot P(x_i)]-\mu^2\)

Standard deviation \(\sigma\) for a probability distribution

\(\sigma = \sqrt{\sum[x_i^2 \cdot P(x_i)]-\mu^2}\)


Expected Value

In probability theory, the expected value of a random variable, intuitively, is the long-run average value of repetitions of the experiment it represents. For example, the expected value in rolling a six-sided die is \(3.5\), because the average of all the numbers that come up in an extremely large number of rolls is close to \(3.5\). The law of large numbers states that the arithmetic mean of the values almost surely converges to the expected value as the number of repetitions approaches infinity.

The expected value of a discrete random variable is the probability-weighted average of all possible values. In other words, each possible value the random variable can assume is multiplied by its probability of occurring, and the resulting products are summed to produce the expected value. The same principle applies to an absolutely continuous random variable, except that an integral of the variable with respect to its probability density replaces the sum.

The expected value is a key aspect of how one characterizes a probability distribution; it is one type of location parameter. By contrast, the variance is a measure of dispersion of the possible values of the random variable around the expected value. The variance itself is defined in terms of two expectations: it is the expected value of the squared deviation of the variable’s value from the variable’s expected value.

Expected value of a discrete random variable

If \(X\) takes outcomes \(x_1, x_2,\cdots,x_m\) with probabilities \(p_1, p_2,\cdots, p_m\) the expected value of \(X\) is the sum of each outcome multiplied by its corresponding probability:

\[ \begin{align} E(X) &= \mu_X = x_1 \times p_1 + x_2 \times p_2 +\cdots+x_m \times p_m \\ &= \sum^{m}_{i=1}(x_i \times p_i) \end{align} \]

Example: \(\text{ Random Variable } X: \text{the number of spots on one roll of a die}\)

Probability distribution table for \(X\)

\[ \begin{array}{c|c|c|c|c|c|c} x & 1 & 2 & 3 & 4 & 5 & 6\\ \hline P(x) & 1/6 & 1/6 & 1/6 & 1/6 & 1/6 & 1/6 \end{array} \]

\[ E(X) = 1 \cdot (1/6) + 2 \cdot (1/6) + 3 \cdot (1/6) + 4 \cdot (1/6) + 5 \cdot (1/6) + 6 \cdot (1/6) = 3.5 \]


Variability in Discrete Random Variables

Variance and standard deviation of a discrete random variable

If \(X\) takes outcomes \(x_1, x_2, \cdots ,x_m\) with probabilities \(p_1, p_2, \cdots ,p_m\) and expected value \(\mu_x = E(X),\) then to find the standard deviation of \(X\), we first find the variance and then take its square root.

\[ \begin{align} Var(X)=\sigma^2_x &= (x_1 - \mu_x)^2 \times p_1 + (x_2 - \mu_x)^2 \times p_2 + \cdots + (x_m - \mu_x)^2 \times p_m \\ &= \sum^m_{i=1}(x_i-\mu_x)^2 \times p_i \\ \\ SD(X) = \sigma_x &= \sqrt{\sum^m_{i=1}(x_i-\mu_x)^2 \times p_i} \\ \\ \text {From the above example, } \\ \\ Var(X)=\sigma^2_x &= [(1 - 3.5)^2 + (2 - 3.5)^2 + \cdots + (6 - 3.5)^2]\times (1/6) \\ &= 2.92 \\ \\ SD(X) = \sigma_x &= \sqrt{2.92} = 1.71 \end{align} \]

Exercise 1: Coin Toss

\[ \begin{array}{c|lcr} x : \text{ number of heads} \\ \text {when two coins are tossed} & P(x) \\ \hline 0 & 0.25 \\ 1 & 0.50 \\ 2 & 0.25 \end{array} \] Calculate expected value/average number of heads and SD from three tosses.

Solution:

\[\begin{align} E(X) &= 0 \times .25 + 1 \times .50 + 2 \times .25 = 1 \\ VAR(X) &= (0 - 1)^2 \times .25 + (1 - 1)^2 \times .50 + (2 - 1)^2 \times .25 \\ &= .5 \\ SD(X) &= \sqrt {.5} \\ & = .7071 \end{align}\]

Exercise 2: Be A Better Bettor

\[ \begin{array}{c|c|lcr} \text{Roulette} \\ \text{Event} & x & P(x) \\ \hline \text{Lose} & - \$ 5 & \frac{37}{38} \\ \text{Win} & \$175 & \frac{1}{38} \end{array} \]

\[ \begin{array}{c|c|lcr} \text{Craps Game} \\ \text{Event} & x & P(x) \\ \hline \text{Lose} & - \$ 5 & \frac{251}{495} \\ \text{Win} & \$5 & \frac{244}{495} \end{array} \]

Which of the bets is better in the sense of producing higher expected value?

Solution:

Roulette

\[ E(X) = (- \$ 5) \cdot (\frac{37}{38}) + (\$175) \cdot (\frac{1}{38}) = -\$.26 \]

Craps Game

\[ E(X) = (- \$ 5) \cdot (\frac{251}{495}) + (\$5) \cdot (\frac{244}{495}) = -\$.07 \]

\(\therefore\) Craps Game seems to be a better bet since it has the higher expected return. However, both games will generate profit for the casino owner.

5.6 Binomial Probability Distribution

A binomial probability distribution results from a procedure that meets the following requirements:

  • The procedure has a fixed number of trials (A trial is a single observation).

  • The trials must be independent, meaning that the outcome of any individual trial doesn’t affect the probabilities in the other trials.

  • Each trial must have all outcomes classified into exactly two categories, commonly referred to as success and failure.

  • The probability of a success remains the same in all trials.

Notation for binomial probability distribution

\[ \begin{align} S, F &: \text {success and failure denote the two possible outcomes from each trial.} \\ P(S) &= p \text{ probability of success in one of the } n \text { trials} \\ P(F) &= q = 1 - p \text{ probability of failure in one of the } n \text { trials} \\ n &: \text { number of trials} \\ x &: \text {number of successes in } n \text { trials}; 0 \le x \le n \\ \end{align} \]

Probability Distribution of the number of observing \(1\) in \(3\) independent Rolls of a dice

Probability distribution of the number of successes \((x)\) in \(n = 3\) independent success/failure trials, each of which is a success with chance \(p = \dfrac{1}{6}.\)

\[ \begin{array}{rclr} x=k & \text{pattern} & \text{chance of pattern} & P(x) \\ \hline 0 & FFF & (5/6) \cdot (5/6) \cdot (5/6) & 1 \cdot (5/6)^3=0.5787 \\ \hline 1 & SFF & (1/6) \cdot (5/6) \cdot (5/6) & \\ & FSF & (5/6) \cdot (1/6) \cdot (5/6) & \\ & FFS & (5/6) \cdot (5/6) \cdot (1/6) & 3 \cdot (1/6) \cdot (5/6)^2 = 0.3472 \\ \hline 2 & SSF & (1/6) \cdot (1/6) \cdot (5/6) & \\ & SFS & (1/6) \cdot (5/6) \cdot (1/6) & \\ & FSS & (5/6) \cdot (1/6) \cdot (1/6) & 3 \cdot (1/6)^2 \cdot (5/6) = 0.0694 \\ \hline 3 & SSS & (1/6) \cdot (1/6) \cdot (1/6) & 1 \cdot (1/6)^3 = 0.0046 \\ \hline & & & \sum = 1.0000 \end{array} \]

Probability Mass Function for a Binomial Random Variable

Let’s reconsider the table above.

\[ \begin{array}{rclr} x=k & \text{pattern} & \text{chance of pattern} & P(x) \\ \hline 0 & FFF & (5/6) \cdot (5/6) \cdot (5/6) & \underbrace{1}_{\displaystyle \binom{3}{0}} \cdot \underbrace{(5/6)^3}_{\displaystyle (1/6)^0 \cdot (5/6)^3} = 0.5787 \\ \hline 1 & SFF & (1/6) \cdot (5/6) \cdot (5/6) & \\ & FSF & (5/6) \cdot (1/6) \cdot (5/6) & \\ & FFS & (5/6) \cdot (5/6) \cdot (1/6) & \underbrace{3}_{\displaystyle \binom{3}{1}} \cdot \underbrace{(1/6) \cdot (5/6)^2}_{\displaystyle (1/6)^1 \cdot (5/6)^2} = 0.3472 \\ \hline 2 & SSF & (1/6) \cdot (1/6) \cdot (5/6) & \\ & SFS & (1/6) \cdot (5/6) \cdot (1/6) & \\ & FSS & (5/6) \cdot (1/6) \cdot (1/6) & \underbrace{3}_{\displaystyle \binom{3}{2}} \cdot \underbrace{(1/6)^2 \cdot (5/6)^1}_{\displaystyle (1/6)^2 \cdot (5/6)^1} = 0.0694 \\ \hline 3 & SSS & (1/6) \cdot (1/6) \cdot (1/6) & \underbrace{1}_{\displaystyle \binom{3}{3}} \cdot \underbrace{(1/6)^3}_{\displaystyle (1/6)^3 \cdot (5/6)^0} = 0.0046 \\ \hline & & & \sum = 1.0000 \end{array} \]


The probability of \(k\) successes in \(n\) independent trials:

\[ \begin{align} \displaystyle P(x) &= \binom{n}{x}(p)^x(1-p)^{n-x} \\ &= \underbrace{\dfrac{n!}{x!(n-x)!}}_{\text{The number of outcomes with} \\ \text{exactly x successes among n trials }} \times \underbrace{(p)^x(1-p)^{n-x}}_{\text{ The probability of x successes among} \\ \text{ n trials for any particular order}} \end{align} \] where,

\[ \begin{cases} x = 0,1,2,......,n \\ n = \text{ number of trials} \\ k = \text{ number of successes among } n \text{ trials} \\ p = \text{ probability of success in any one trial} \end{cases} \]


When \(k = 0,\) the chance of no successes (in other words, the chance of \(n\) failures in a row) is

\[ \frac{n!}{0!(n)!}p^0(1-p)^{n} = (1-p)^n \]


Question: What is the probability that the first success will be observed after \(n\) trials?

This means the first success is observed on \((n+1)^{th}\) trial.

\[ \underbrace{F}_{ 1^{st} } FF.......FF \underbrace{F}_{ n^{th} } \underbrace{S}_{ n+1 } \\ \underbrace{FFF.......FFF}_{ \displaystyle \binom{n}{0} (p)^0(1-p)^n } \underbrace{S}_{\times p } \\ = (1-p)^n \cdot p \\ \text{ (This is called geometric distribution.)} \]

Sampling with Replacement

A random number generator draws at random with replacement from the ten digits \(0,1,2,3,4,5,6,7,8,9. \space\) Run the generator 20 times.

Find the chance that \(0\) appears once.

Solution:

\[binomial: n=20, p = 0.1, k = 1 \implies \binom{20}{1}(0.1)^1(0.9)^{19} = 0.2702\]

Find the chance that \(0\) appears at most once.

Solution:

\[binomial: n=20, p = 0.1, k = (0,1) \\ \implies \binom{20}{0}(0.1)^0(0.9)^{20} + \binom{20}{1}(0.1)^1(0.9)^{19} = 0.3917\]

Find the chance that \(0\) appears more than once.

Solution:

\[binomial: n=20, p = 0.1, k = (2,3,...,20) \\ \implies 1-P(k=0,1)= (1- 0.3917) = 0.6083.\]


Exercise: Cytomegalovirus (CMV) is a virus that infects one half of young adults. If a random sample of \(10\) young adults is taken, find the probability that between \(30\%\) and \(40\%\) (inclusive ) of those sampled will have CMV.

Solution:

\[\begin{align} P((.3)(10) \le X \le (.4)(10)) &= P(X=3) + P(X=4) \\ &= \binom{10}{3}(.5)^3(.5)^7 + \binom{10}{4}(.5)^4(.5)^6 \\ &= 0.1172 + 0.2051 \\ &= 0.3223 \end{align}\]

5.6.1 Expected Value of the Binomial Distribution

\(X:\) number of successes with probability \(p\) of success on each trial.

Probability distribution table for \(X:\)

\[ \begin{array}{c|c|c|c|c|c|c} x & 1 & 0 \\ \hline P(x) & p & 1-p \end{array} \]

\[ \begin{align} \text{Average successes per trial: } E(X) &= 1 \times p + 0 \times (1-p) = p \\ \text{Expected total successes from } n \text { trials: } E(X) &= \sum_{i=0}^n x_i \cdot p(x_i) \\ &= np \end{align} \]


Example: From the probability distribution of the number of observing \(1\) in \(3\) independent Rolls of a dice:

\[ \begin{align} &E(X) \\ &= (0) \times (0.5787) + (1) \times (0.3472) + (2) \times (0.0694) + (3) \times (0.0046) \\ \text{alternatively,} \\ &= 3 \times \dfrac{1}{6} \\ &= 0.5 \end{align} \]


5.6.2 Standard Deviation of the Binomial Distribution

\(X:\) number of successes with probability \(p\) of success on each trial.

Probability distribution table for \(X:\)

\[ \begin{array}{c|c|c|c|c|c|c} x & 1 & 0 \\ \hline P(x) & p & 1-p \end{array} \]

\[ \begin{align} \text {for one trial} \\ VAR(X) &= (1-p)^2 \times p + (0-p)^2 \times (1-p) \\ &= (1-p)^2 \times p + p^2 \times (1-p) \\ &= p(1-p)(1-p+p) \\ &= p(1-p) \\ \\ SD(X) &= \sqrt{p(1-p)} \\ \\ \text {for } n \text{ trials} \\ VAR(X) &= \sum_{i=0}^n (x_i - E(X))^2 \cdot p(x_i) \\ &= np(1-p) \\ \\ SD(X) &= \sqrt{np(1-p)} \end{align} \]


Example: From the probability distribution of the number of observing \(1\) in \(3\) independent Rolls of a dice:

\[ \begin{align} &VAR(X) \\ &= (0 - 0.5)^2 \times (0.5787) + (1-0.5)^2 \times (0.3472) + (2-0.5)^2 \times (0.0694) + (3-0.5)^2 \times (0.0046) \\ \text{alternatively,} \\ &= 3 \times \dfrac{1}{6} \times \dfrac{5}{6} \\ &= 0.4167 \\ \\ SD(X) &= \sqrt{0.4167} = 0.6455 \end{align} \]


Example: Calculate expected total number of heads from \(100\) tosses

\(X\) is a binomial distributed random variable with parameters \(n=100\) and \(p=0.5\)

\[ \begin{align} P(x=k) &= \binom{100}{k}(0.5)^k(1-0.5)^{100-k}, k = 0,1,2,...,100 \\ \\ E(X) &= 100 \times 0.5 \\ &= 50 \\ \\ SE(X) &= \sqrt{100 \times 0.5 \times 0.5} \\ &= 5 \end{align} \]

Exercise Calculate expected total number of sixes from \(100\) rolls

\(X\) has the binomial distribution with parameters \(n=100\) and \(p=1/6\)

Solution: \[ \begin{align} P(X=k) &= \binom{100}{k}(1/6)^k(1-1/6)^{100-k}, k = 0,1,2,...,100 \\ \\ E(X) &= 100 \times 1/6 \\ &= 16.7 \\ \\ SE(X) &= \sqrt{100 \times 1/6 \times 5/6} \\ &= 3.73 \end{align} \]

5.7 Example Problems

1. In a state’s pick 3 lottery game, a player pays \(\$1.46\) to select a sequence of three digits (from \(0\) to \(9\)), such as \(822\). If someone selects the same sequence of three digits that are drawn, the player collects \(\$499.38\).

(a) How many different selections are possible?

\[ 10 \times 10 \times 10 = 1000 \]

(b) What is the probability of winning?

\[ \dfrac{1}{1000} \]

(c) If a player wins, what is their net profit?

\(\$499.38 - \$1.46 = \$497.92\)

(d) Find the expected value.

\(E(X) = \$499.38 \times \dfrac{1}{1000} + (-\$1.46) \left( 1 - \dfrac{1}{1000}\right) = -\$0.96\)


2. Based on data from Bloodjournal.org, \(10\%\) of women 65 years of age or older suffer from anemia. In tests of anemia, blood samples from 8 women in that age group are combined. What is the probability that the combined sample tests positive for anemia?

For the combined sample to be tested positive, at least one of the eight samples has to be tested positive.

\[ \begin{align} P(X \ge 1) &= 1 - P(X = 0) \\ &= 1- \displaystyle \binom {8}{0}(.1)^0(1-.9)^{8-0} \\ &= 0.570 \\ \end{align} \] Therefore, there is a \(57\%\) probability that the combined sample will be tested positive for anemia.


3. The MedAssist Pharmaceutical Company receives large shipments of aspirin tablets and uses this acceptance sampling plan: Randomly select and test \(40\) tablets, then accept the whole batch if there is only one or none that doesn’t meet the required specifications. If one shipment of \(5000\) aspirin tablets actually has a \(3\%\) rate of deficits, what is the probability that this whole shipment will be accepted?

Solution:

Given,

\[ \begin{align} n &= 40 \\ p &= 0.03 \end{align} \]

The whole shipment will be accepted if the sample has at most one does not meet the standard.

\[ \begin{align} P(X \le 1) &= P(X = 0) + P(X = 1) \\ &= \binom{40}{0}(0.03)^0(1-0.03)^{40} + \binom{40}{1}(0.03)^1(1-0.03)^{39} \\ &= 0.6615 \end{align} \]

Therefore, there is a \(66.15\%\) chance that the whole shipment will be accepted.


4. In a poll conducted by the General Social Survey, \(75\%\) of respondents said that their jobs were sometimes or always stressful. Eleven workers are chosen at random. Round the answers to at least four decimal places.

(a) What is the probability that exactly \(10\) of them find their job stressful\(?\)

Solution: \[ \begin{align} p &= 0.75 \\ n &= 11 \\ \displaystyle P(x = 10) &= \binom{11}{10}(.75)^{10}(1-.75)^{11-10} = 0.1549 \end{align} \]

(b) What is the probability that more than \(8\) find their jobs stressful\(?\)

Solution:

\[ \begin{align} \displaystyle P(x \gt 8) &= P(x = 9) + P(x = 10) \\ &= \binom{11}{9}(.75)^9(1-.75)^{11-9} + \binom{11}{10}(.75)^{10}(1-.75)^{11-10} \\ &= 0.4552 \\ \\ &\text{Alternatively,} \\ \displaystyle P(x \gt 8) &= 1 - P(x \le 8) \\ &= 1 - 0.5448 \\ &= 0.4552 \end{align} \]

(c) What is the probability that fewer than \(4\) find their jobs stressful\(?\)

Solution:

\[ \begin{align} \displaystyle P(x \lt 4) &= P(x = 0) + P(x = 1) + P(x = 2) + P(x = 3) \\ &= \binom{11}{0}(.75)^0(1-.75)^{11-0} + \binom{11}{1}(.75)^{1}(1-.75)^{11-1} + \\ & \binom{11}{2}(.75)^2(1-.75)^{11-2} + \binom{11}{3}(.75)^3(1-.75)^{11-3} \\ &= 0.0012 \\ \end{align} \]


5. In a litter of seven kittens, three are female. You pick two kittens at random without replacement.

(a) Create a probability model for the number of male kittens you get.

\[ \begin{array} {l|c|c|c} \text{number of males }(X) & 0 & 1 & 2 \\ \hline P(X) & \dfrac{3}{7} \cdot \dfrac{2}{6} = \dfrac{6}{42} & \dfrac{4}{7} \cdot \dfrac{3}{6} + \dfrac{3}{7} \cdot \dfrac{4}{6} = \dfrac{24}{42} & \dfrac{4}{7} \cdot \dfrac{3}{6} = \dfrac{12}{42} \\ \hline \end{array} \\ \]

(b) Find the expected number of male kittens.

\[ E(X) = 0 \cdot \dfrac{6}{42} + 1 \cdot \dfrac{24}{42} + 2 \cdot \dfrac{12}{42} = 1.14 \text { males} \\ \]

(c) Find the standard deviation of the number of male of kittens.

\[ \begin{align} \sigma^2 &= (0-1.14)^2 \cdot \dfrac{6}{42} + (1-1.14)^2 \cdot \dfrac{24}{42} + (2-1.14)^2 \cdot \dfrac{12}{42} = 0.4082 \\ \\ \sigma &= \sqrt {0.4082} = 0.64 \text { males} \end{align} \]


6. Since the stock market began in 1872, stock prices have risen in about \(73\%\) of the years. Assuming that market performance is independent from year to year, what is the probability that the market will fall during at least one of the next \(5\) years?

Solution:

In any given year, the probability that the market will fall \(= 1 - 0.73 = 0.27\)

\[P\text{(market will fall at least once in 5 yr.)} \\ = 1 - P\text{(market will not fall at all in 5 yr.)}\]

\[ \begin{align} P(X \ge 1 ) &= 1 - P(X = 0) \\ &= 1 - \binom{5}{0}(0.27)^0(1- 0.27)^5 \\ &= 0.7927 \end{align} \] Hence, there is \(79.27\%\) probability that the market will fall during at least one of the next \(5\) years


7. The Center for Disease Control and Prevention say that about \(30\%\) of high school students smoke tobacco. Suppose you randomly select high school students to survey them on their attitudes toward scenes of smoking in the movies. What is the probability that there are no more than \(2\) smokers among \(10\) people you randomly choose?

Solution:

Given,

\[ \begin{align} n &= 10 \\ p &= 0.3 \end{align} \]

\[ \begin{align} P(X \le 2) &= P(X = 0) + P(X = 1) + P(X = 2) \\ &= \binom{10}{0}(.3)^0(0.7)^{10} + \binom{10}{1}(.3)^1(0.7)^{9} + \binom{10}{2}(.3)^2(0.7)^{8} \\ &= 0.3828 \end{align} \] Therefore, there is \(38.38\%\) probability that there are no more than \(2\) smokers among \(10\) people chosen.


8. Your statistics test has \(10\) multiple choice questions each with \(5\) answer choices.

(a) If you randomly select an answer from each question, what is the probability that you will pass the exam?

Solution:

\[ \begin{align} p &= \dfrac{1}{5} = 0.2 \\ n &= 10 \\ P(X \ge 6) &= 1 - P(X \le 5) = 0.0064 = 0.64\% \end{align} \] \(\therefore\) there is less than \(1\%\) chance of passing the test by random guessing.

(b) What are the expected value and standard deviation of your test score?

Solution:

\[ E(X) = np = 10 \cdot 0.2 = 2 \\ SD(X) = \sqrt{np(1-p)} = \sqrt{10 \cdot 0.2 \cdot 0.8 } = 1.265 \]