PEP 6305 Measurement in Health & Physical Education

 

Topic 8: Hypothesis Testing

Section 8.1

 

n   This Topic has 3 Sections.

 

Reading

n   Vincent & Weir, Statistics in Kinesiology, 4th ed. Chapter 10 “The t Test: Comparing Means from Two Sets of Data”

Objective

n   This topic introduces the basic concepts of hypothesis testing.

n   It is more important to understand these concepts than to memorize the formulas and application of t tests.

¨  t tests, the focus of the chapter, are conceptually similar to Z-tests, except the statistic (t) is compared to the t distribution instead of comparing Z to the normal distribution.

¨  The t distribution is similar the normal distribution, except for an adjustment for samples with N < 120.

¨  Any hypothesis that can be tested with a t test can also be tested with analysis of variance (ANOVA), which is our next Topic (ANOVA and t test conclusions will always be exactly the same). ANOVA is more general than t tests because ANOVA can test a variety of other hypotheses.

¨  You can compare two groups (independent t test) or compare two measures in one group (paired t test) in R Commander, under Statistics>Means>Independent samples t-test... See the R Help menus for instructions.

Note

n   This topic is fairly long and detailed; please allow yourself sufficient time to study and complete it.

 

Overview: Research Hypothesis vs. Null Hypothesis

 

n   Recall from Topic 1 that a research hypothesis is a potential answer to a research question about a well-defined problem.

¨  The research hypothesis is developed from a thorough review of the literature; it is not “a guess.”

·       Ideally, the research hypothesis states the direction (see the Two-Tailed and One-Tailed Tests in the following section) and the size (see the Effect Size in the following section) of the hypothetical effect.

¨  The research hypothesis is not tested by statistical analysis. Statistical analyses test the null hypothesis.

¨  The null hypothesis is written so that if it is true then the research hypothesis cannot also be true.

·       The null hypothesis directly contradicts the research hypothesis. Both hypotheses cannot be true.

·       Data that show that the null hypothesis is probably false (see the Type I and Type II Errors in the following section), support (do not "prove") the research hypothesis.

¨  For example, you compare a treatment group to a non-treatment (control) group. The research hypothesis is that the treatment will improve outcome in the treatment group. The null hypothesis is that the treatment group will have the same or worse outcome as the control group.

·       In symbolic terms, for this example the research hypothesis is: Treatment (T) – Control (C) > 0, or T – C > 0. The outcomes in T will be higher (better) than the outcomes in C. If we rearrange the terms so that T is on the left of the greater than sign (add C to the right-hand side), then T > C is another way to state the research hypothesis.

·       The null hypothesis is: T – C ≤ 0. The null hypothesis has to account for everything not accounted for in the research hypothesis. Since the research hypothesis is T > C, the null hypothesis has to account for both T = C and T < C. Thus, T ≤ C.

·       The statements T > C and T ≤ C cannot both be true. T must either “greater than” or “less than or equal” to C.

·       If a statistical test shows a low error probability (p ≤ 0.05) assuming that T ≤ C (the null hypothesis) is true, then T > C (the research hypothesis) is more likely to be correct and is thus supported.

·       If a statistical test shows a high error probability (p > 0.05) assuming that T ≤ C (the null hypothesis) is true, then T > C  (the research hypothesis) is less likely to be correct and is thus not supported.

·       Suppose a statistical test of the data show that the error probability of the null hypothesis is p = 0.009; this means that the groups are likely to actually have different outcomes.

·       You conclude that the evidence supports the research hypothesis (treatment improves outcomes).

n   The null hypothesis is tested by comparing an observed value (computed from sample data) to a statistical distribution.

¨  The investigator designs a study to manipulate the conditions in order to produce data (evidence) to support the research hypothesis.

¨  The investigator then collects data and computes a statistical value. This value is the observed value of the statistic.

¨  The investigator identifies the statistical distribution that provides the sampling distribution of the statistic if the null hypothesis is true.

¨  The observed statistical value is compared to the statistical distribution to determine what percent of values are more extreme than the observed value (see the Two-Tailed and One-Tailed Tests in the following section).

¨  If the error probability of the observed value is low (<0.05), then the conclusion is that the investigator changed the conditions “significantly”. The null hypothesis is rejected, providing support for the research hypothesis since it is a more likely explanation for the data than the null.

·       What is the conclusion if the null hypothesis is not rejected?

·       The decision to reject or not reject the null hypothesis can never be made with certainty, although the probability of making an incorrect decision can be estimated (see the Type I and Type II Errors and the Power and Sample Size in the following sections).

n   Maybe another example would be useful? After reviewing the literature, an investigator states this research hypothesis: Daily aerobic exercise changes serum cholesterol level. The null hypothesis is that daily aerobic exercise has no effect on serum cholesterol.

¨  The investigator measures serum cholesterol in a sample of subjects; the mean serum cholesterol is the same in both groups.

¨  The investigator randomly assigns the subjects to two groups.

¨  The first group receives a treatment: aerobic activity for 30 minutes per day for six weeks. The second group receives no treatment.

·       Treatment (exercise or nothing) is the condition manipulated by the investigator (it is the independent variable). No other condition is manipulated (all other conditions are the same).

¨  At the end of six weeks, the investigator again measures serum cholesterol.

¨  The investigator computes a statistical value from the data (such as to compare the change over time between groups).

¨  The observed value is compared to the distribution of that statistic which would occur if the serum cholesterol were equal in both groups.

¨  This comparison shows that the observed statistical value is found to have an error probability of 0.003, which is <0.05.

·       The observed value would only occur 3 times in 1000 such studies if serum cholesterol were really equal in both groups.

¨  The investigator rejects the null hypothesis.

·       It is more likely that the serum cholesterol levels in the groups differed.

¨  The analysis thus supports the research hypothesis that daily aerobic exercise changes serum cholesterol.

·       Aerobic exercise must have affected serum cholesterol because the subjects were randomly selected and randomly assigned to groups, and all other conditions during the study were the same for all subjects in both groups.

 

n   Let's discuss the concepts underlying hypothesis testing in more detail...

 

Click to go to the next section (Section 8.2)