AP Statistics

Section 11.2

“Comparing Two Means”

---

 

Last section, we looked at inference based on one sample or on the differences in a matched pairs design (still one set of data).  We will now look at having TWO samples/sets of data.  This is a much more common situation than just having one set!

 

Look at Example 11.9 on p. 648 in your text for examples of two-sample problems.

 

---

 

In order for a comparison of two means to be valid (i.e. in order for us to use two-sample inference reliably), we need to address the following two assumptions:

 

(1)      There should be 2 INDEPENDENT SRS’s!

          **If the SRS’s are not independent, then DO NOT USE the t-test!  The t-test is NOT robust in this sense!

 

(2)     The two populations should be approximately normally distributed.

          **You can check this with graphs (dotplot, stemplot, histogram, etc.).

          If this assumption is violated, you can still proceed with the t-test.  It IS robust to this assumption!

 

In examining 2 different populations or samples, we will use subscripts to distinguish between the two.  (Because it often does not matter which you label as population 1 and population 2, it is IMPERATIVE that you clearly state which is which in the beginning of the problem!)

 

Population    Variable     Mean      St. Dev.       Sample Size   Sample Mean        Sample St. Dev.

 

          1            x1                                                  n1                                                    s1

 

          2            x2                                                 n2                                             s2

 

 

---

 

To compare two means, we can perform inference in two ways …

(1)      Establish a CI for their difference   

 

(2)     Test a hypothesis of no difference

          H0:                     OR               H0:      

          (Notice these two statements are equivalent.)

 

---

Just as we used  as an unbiased estimator of , we will use  as an unbiased estimator of .

 

Using the rules of means & variances for random variables that we examined in Chapter 7, we can examine characteristics of the sampling distribution of .

 

          (1)      is the mean of the distribution.

 

(2)     The variance of  is the SUM of the individual variances (and the standard deviation is the square root of this):

                                     

          *This is coming from each individual standard error being = to  , and therefore, each individual variance being = to .

 

          **Also remember that variances add – standard deviations (standard errors) to not!  To combine two populations/samples, you must find the  variances first, then add, and then finally take the square root for the difference’s standard deviation/error.

(3)     The distribution of  is normal if both populations are normal OR if the sample size is large enough for the Central Limit Theorem to “kick in” (to become evident).

 

---

Because of the normal distribution, if  and  are known, we can standardize using a

z test statistic:

 

One sample z:                 

 

 

Two-sample z:                

 

Since it is not realistic to say that we would know  and , we will need to use s1 and s2 as estimates.  As a result, we will need to find a t test statistic instead:

 

 

The Two-Sample t:         

 

 

NOTE:         When we introduced s in place of , we introduced more uncertainty and ended up with a t distribution with n-1 degrees of freedom instead of a z distribution (Standard Normal).

 

When we introduce two ’s, we introduce even more uncertainty … 

For this reason, the t distribution for two samples is not exactly the same as the t distribution for one sample.  The two-sample distribution is a t distribution with degrees of freedom given by:

 

 

 

instead of df = n – 1.

 

*If you use your calculator to run a two-sample t test, it will produce results based on this accurate number of degrees of freedom.

 

**If you choose to use the one-sample t-table & do it by hand, you will want to choose the smaller degrees of freedom between n1 – 1 and n2 – 1.  This will provide a conservative estimate of the p-value. 

 

(Better to overestimate your error and p-values, than underestimate it!  Remember, lower degrees of freedom imply smaller sample sizes, which imply more variability – more area in the tails.) 

 

---

 

The College Board will accept BOTH methods as correct on the AP Exam.  J

 

---

 

The t procedures for two-samples will remain basically the same as those for the one-sample t:

 

          CI:             

 

 

For hypothesis testing, the two-sample t test statistic:

                                                         

 

          Notice that the is missing from the two-sample t test statistic above.  The reason for this is that the hypothesis H0 suggests that , which means that , which means that you would be subtracting 0 when you are subtracting .

 

(EX)   The heights (measured in inches) of 20 randomly selected women & 30 randomly selected men were independently obtained from a student body of a certain college in order to estimate the difference in their mean heights.

 

Assume the heights are approximately normally distributed for both populations.  Here is the sample data:

 

POPULATION                 n                  x-bar           s

1:  Male                           30               69.8            1.92

2:  Female                       20               63.8            2.18

 

Find a 95% Confidence Interval for the difference between mean heights, .

 

**Do by hand first.

**With calc:  STAT, TESTS, 2SampleTInt

Notice that with the calculator, you are asked whether or not you want the variances “pooled.”

 

What this means

If population variances are equal (if ), then the 2 sample variances can be averaged or “pooled” to estimate the common population variance.  The result is called the “pooled 2-sample t.”

 

Its advantage

If population variances are the same, then the distribution is exactly a t distribution with df = n1 + n2 – 2.

 

On the computer output on p. 666 of your text, the “EQUAL” choice is pooled; the “UNEQUAL” choice is not (it’s a regular 2-sample t).

 

(EX)   Suppose we are interested in comparing the academic success of college students who belong to fraternities with the academic success of those who do not belong.  The reason for the comparison centers on the recent concern that fraternity members, on average, are achieving academically at a lower level than non-fraternity students.  (Cumulative GPA is used to measure academic success.)

Random samples of size 40 are taken from each population.  Sample results …

 

POPULATION                          n                  x-bar           s

1:  Non-fraternity                     40               2.21             0.59

2:  Fraternity                           40               2.03            0.68

 

Complete a hypothesis test using .  Assume GPA’s for both groups are approximately normally distributed.

On calc:       STAT, TESTS, 2SampleTTest

 

 

ASSIGNMENT:

 

p. 649 (11.37, 11.38)

 

p. 657-658 (11.40 – 11.43)

 

p. 665-666 (11.48)