2000 Flex-Time and Flex-Mode Prob-Stat I Topic #8A Page

LEGACY CONTENT. If you are looking for Voteview.com, PLEASE CLICK HERE

This site is an archived version of Voteview.com archived from University of Georgia on May 23, 2017. This point-in-time capture includes all files publicly linked on Voteview.com at that time. We provide access to this content as a service to ensure that past users of Voteview.com have access to historical files. This content will remain online until at least January 1st, 2018. UCLA provides no warranty or guarantee of access to these files.

45-733 PROBABILITY AND STATISTICS I Topic #8A

29 February 2000

Hypothesis Testing

The Classical Theory of Two Simple Hypotheses

We have a large shipment of devices delivered to our manufacturing 
plant. Suppose we know with certainty that either the proportion of 
defective devices is either .01 or .001.  We take a random sample and 
        _Ù         _Ù
compute p.  Given p, how do we decide between .01 and .001?

In the classical theory of two simple hypotheses we denote these two possibilities as:

H₀: p = p₀
H₁: p = p₁

Where H₀: is known as the Null Hypothesis and H₁: is known as the Alternative Hypothesis.

Given a decision, there are four possibilities:

Accept H₀: and H₀: is True.
Accept H₀: and H₁: is True.
Reject H₀: and H₀: is True.
Reject H₀: and H₁: is True.

We can represent these possibilites by a two by two table:


                       True State of World 

                           H₀     H₁
                         --------------
              Accept H₀  |1 - a |  b  |  
                         |      |     |
          Decision       |------|-----|  
                         |      |     |
              Reject H₀  |  a   |1 - b|  
                         |      |     |
                         --------------

Where:
a = TYPE I Error = P[Reject H₀: | H₀: is True] and
b = TYPE II Error = P[Accept H₀: | H₁: is True]

Hypothesis Test Between Two Means for the Normal Distribution (s² is known)

H₀: m = m₀
H₁: m = m₁ > m₀

A reasonable decision rule for this problem is:
```
   _
If X_n > m₀ + c then Reject H₀:
   _
If X_n < m₀ + c then Do Not Reject H₀:
```
Where c is some constant. In most circumstances m₀ < m₀ + c < m₁.
The a and b errors are:
```
      _
a = P[X_n > m₀ + c | m = m₀]
      _
b = P[X_n < m₀ + c | m = m₁]
```

Example: Suppose n = 25, s² = 400, and a = .05, and we have the hypothesis test:

H₀: m = 100
H₁: m = 110

      _
a = P[X_n > 100 + c ] = .05 = 
   _
P[(X_n - 100)/20/5 > (100 + c - 100)/4 ] = P[Z > c/4]
Now, since P[Z > 1.645] = .05, c/4 = 1.645, and c = 6.58
      _
b = P[X_n < 106.58 | m = 110] = 
   _
P[(X_n - 110)/4 < (106.58 - 110)/4 ] = 

P[Z < -.855] = F(-.855) = 1 - F(.855) = .1967

The only way to simultaneously reduce a and b is to increase the sample size. With fixed sample size, n, reducing a causes b to increase and vice versa. There are many situations in which it is desirable to make a or b as small as possible even at the cost of greatly increasing the other error. A good example of this is disease testing:
```
                       True State of World 

                           Has   Not
                         Disease Have 
                                 Disease
                         --------------
     Patient Has Disease |1 - a |  b  |  
                         |      |     |
      Doctor's Decision  |------|-----|  
                         |      |     |
 Patient Does Not Have   |  a   |1 - b|  
                Disease  |      |     |
                         --------------  
```
Clearly, telling a patient that he/she does not have a disease when they in fact have the disease -- the Type I error -- is much more costly than telling a patient that he/she has a disease when they in fact do not have the disease -- the Type II error. In the first instance, a sick person can go infect other people and cause great harm. In the second, the harm is to scare a healthy person. Clearly, in disease testing, minimizing the a probability makes sense.

Recall that the Hypothesis Test Between Two Means for the Normal Distribution where s² is known is:

             H₀: m = m_o
             H₁: m = m₁ > m_o

     The decision rule for this problem is:

   _
If X_n > m_o + c then Reject H₀:
   _
If X_n < m_o + c then Do Not Reject H₀:

Which is equivalent to:

    _
If (X_n - m_o)/s/n^1/2 > z_a then Reject H₀:
    _
If (X_n - m_o)/s/n^1/2 < z_a then do not Reject H₀: