Is the proportion of high school seniors at Tates Creek who skip frequently greater than that of all high school seniors in the 6th district?

 

 

By Jordan Brown and Nick Collar B2

 

 

 

Disclaimer:  This study was done in an AP Statistics course with relatively small sample sizes.  The validity of such studies must always be questioned.  Please keep this in mind if you use or report the results of this study.

 

Abstract: 

          High School Seniors across the country have a habit of skipping school frequently.  The goal for this study was to determine if the proportion of seniors who skip school frequently (5 or more times in a school year) at Tates Creek is greater than the proportion of seniors who skip frequently in the 6th District high schools of Kentucky.  The outcome that was expected is that Tates Creek has a greater proportion of seniors who skip frequently.  This hypothesis was determined by observing a large number of seniors at Tates Creek who skipped frequently. 

          The population from which inferences will be drawn is all seniors at Tates Creek High School.  A sample size of 40 randomly selected seniors was used.  Using a one-proportion z-test, the sample data was compared to the proportion of seniors who skip frequently in the 6th district high schools.    The sample data showed that the proportion of seniors who skip frequently is 0.5.  Te null hypothesis in the z-test is the claim that the proportion of seniors at TC who skip frequently is the same as the proportion of seniors in the 6th district.  It was tested against an alternative hypothesis (Ha), which is that the proportion of seniors at TC who skip frequently is greater than that of seniors in the 6th district.

 

Methodology:

          To answer the question of whether or not the proportion of seniors at TC who skip frequently is greater than the proportion of seniors in the district, the proportions of each must be found.  In order to find the proportion at Tates Creek, each of the 337senior students at TC were assigned a number. (001-337) Using a random number generator, a sample size of n=40 was obtained.  Each of the students names were given to the attendance clerk at Tates Creek High School.  The clerk then gave the numbers of how many skips each student in the sample had for the 2003-2004 school year.  The names of each student were kept confidential for legal purposes and only the numbers of skips were given.  In order to find the proportion of seniors who skip frequently in the 6th district, we contacted pupil personnel at the administration office of the 6th district school system.  This information was used in our one-proportion z-test to determine if the proportion of seniors who skip often at Tates Creek is significantly greater than the district average.

Analysis and Inference:

          The first step we took in drawing inference from our data was to check the assumptions of a one-proportion z-test.  We know that our data was obtained randomly and is an SRS.  We needed to find the proportions for Tates Creek and for the 6th district.  20 of the 40 students in our SRS of Tates Creek seniors had 5 or more skips in the 2003-2004 school year.  So po is equal to 0.5. (observed # of successes divided by the total # of subjects)  The proportion of seniors who skip in the 6th district is 0.46 according to pupil personnel in the 6th district high schools.  Two more assumptions that need to be met are n(po)>10 and n(1-po)>10 in order to draw inference from our significance test.

                             40(.5)>10      and        40(1-.5)>10

Our assumptions are met.  Next, we set up our null and alternative hypothesis.  As we mentioned earlier, the null is the claim that the proportions are the same, and the alternative is the claim that the proportion of seniors who skip frequently is greater than the district proportion. 

                             Ho: p=.5

                             Ha: p<.5

 (p is the proportion of seniors in the district who skip frequently)

Our critical z score was found algebraically and through the use of a graphing calculator. 

                             Z= .5-.46   = .5076

 

By using a z table and using a graphing calculator, we were given a p-value of P=.6941.  This p-value is the probability of a result being different than the one we obtained.  Therefore, the smaller the p-value is, the stronger evidence there is against the null hypothesis.  P=.6941 is very large though!

We cannot call our data statistically significant at an   =.05 level.  This is because 0.6941>0.05.  

          Because our data is not statistically significant at an    =.05 significance level, we fail to reject the null hypothesis.  This means that there is not significant evidence to support the claim that the proportion of seniors at Tates Creek who skip frequently is greater than the proportion of all seniors in the 6th district high schools.  Although the sample proportion we obtained is larger than the district proportion, sampling variability could have accounted for this increase.  We would need to take much larger sample sizes and replicate the test many times in order to get more accurate results to draw inference from.