Summary of statistical tests

Intro

At this point, we’ve learned quite a few statistical tests. Here’s a recap:

\(z\)-test
- means (sections 4.1-4.3)
- proportions (sections 6.1-6.2)
\(t\)-test
- One sample mean (section 5.1)
- Paired data (section 5.2)
- Difference of two means (section 5.3)
ANOVA for many means (section 5.5)
\(\chi^2\)
- Goodness of fit (section 6.3)
- Independence (section 6.4)
Linear regression (Chapter 7)

Commonalities

All of the tests have a few things in common.

They all involve some well-formulated hypothesis - a null hypothesis \(H_0\) vs an alternative hypothesis \(H_A\).
Of course, they all involve data; the general question is, “do the data support the null hypothesis or the alternative”?
The precise formulation of the general question involves a \(p\)-value:
- The probability of observing data at least as favorable to the alternative hypothesis as our current data set, if the null hypothesis is true.
- The smaller the \(p\)-value, the less viable is the null hypothesis.

Differences

Perhaps the most obvious difference centers on the type of data being considered: numerical vs categorical.

There are other differences too, though, and understanding these helps you know which to apply in a certain situation

\(z\)-tests

The \(z\)-tests are the most basic and first hypothesis tests we meet.

The \(z\)-test for means

The \(z\)-test for means deals with numerical data. In the simplest case, we have one data sample - just a list of numbers.

The hypothesis test

The question is - does that data support the hypothesis that the mean of the population from which is was drawn is some particular number? If our data has sample mean \(\bar{x}\) and we suspec the population mean is \(\mu_0\), then our two-sided hypothesis can be written

\(H_0\): \(\bar{x}=\mu_0\)
\(H_A\): \(\bar{x}\neq\mu_0\)

A one sided hypothesis can be written with a greater or less, rather than a not equal.

Conditions to check

Random sample of numeric data
- Need less than 10% of population for independence
Large enough
- Typically, at least 30

The \(z\)-score

The \(z\)-score for our mean is \[Z = \frac{\bar{x} - \mu_0}{\sigma/n}.\] We use then compare this against the standard normal distribution to compute the \(p\)-value.

There are examples in our notes from 6/19.

The \(z\)-test for proportions

This is very much like the \(z\)-test for means, but we are dealing with proportions of categorical data. We often think of this in terms of a random variable \(X\) that is binomially distributed; thus, we need to know the binomoial distribution after dividing through by \(n\):

\[\begin{align} \mu &= p &\sigma^2 &= p(1-p)/n &\sigma &= \sqrt{p(1-p)/n} \end{align}\]

Our hypothesis can be written

\[\begin{align} H_0 : \hat{p}=p_0 \\ H_A : \hat{p} \neq p_0 \end{align}\]

There’s an example at the endo of our notes from 6/19.

The \(t\)-tests

The \(t\)-tests are very much like the \(z\)-tests. The primary difference is that \(t\)-test is applicable to smaller data sets.

The \(t\)-test for one sample mean

In the simplest case, we again have one data sample, which is just a list of numbers.

Again, this data set can be relatively small - less than 30.

We use a different distribution - the \(t\)-distribution. There are a slew of these though - expressed in terms of the degrees of freedom parameter, which is just the sample size minus one.

In addition, it’s more important that the population be normally distributed.

There are examples illustrating the \(t\)-test for one sample mean in our notes from 6/22.

The \(t\)-test for paired data

We use this when we habe two data sets that are paired in a natural way; that is, each data point in one set corresponds to a particular data point in the other set.

Such a data set can be translated to a single data set by simply subtracting the data sets pair-wise.

Our hypotesis test looks like

\[ \begin{array}{ll} H_0: & \mu_1 = \mu_2 \\ H_A: & \mu_1 \neq \mu_2 \\ \end{array} \]

There are some examples of this in our notes from 6/27.

The \(t\)-test for two sample means

We use this when we habe two data sets that are independent of one another.

If the sets have sizes \(n_1\) and \(n_2\), we analyze the difference of the two means using a \(t\)-test with

Mean \(\bar{x}_1 - \bar{x}_2\),
Standard error \[\sqrt{\frac{\sigma_1^2}{n_1} + \frac{\sigma_2^2}{n_2}},\]
and we use the minimum of \(n_1-1\) and \(n_2-1\) as the degrees of freedom.

Our hypothesis test again looks like

\[ \begin{array}{ll} H_0: & \mu_1 = \mu_2 \\ H_A: & \mu_1 \neq \mu_2 \end{array} \]

There are some examples of this in our notes from 6/27.

ANOVA

ANOVA or Analyis of Variations is used to compare some statistic (just the mean for us, though proportion is natural too) across several groups.

Our hypothesis test looks like

\[ \begin{array}{ll} H_0: & \mu_1 = \mu_2 = \cdots = \mu_k \\ H_A: & \text{at least two } \mu_i\text{s are different} \end{array} \]

The mathematics lurking in the background is based on a new distribution called the \(F\)-distribution.

This is more complicated than the distributions we’ve seen to this point and we only use software.

There are some examples in our notes of 7/5.

The \(\chi^2\)-test

The chi-square test is a method for assessing a model when the data are binned.

The one-way test

In this situation, we have two data sets, call them

\(O_1\), \(O_2\), …, \(O_k\), which represents observations in \(k\) categories and
\(E_1\), \(E_2\), …, \(E_k\), which represents expected counts in \(k\) categories.

Our hypothesis test looks like

\(H_0\): The observations are representative of the expected counts
\(H_A\): The observations are not representative of the expected counts

We then compute the \(\chi^2\) statistic \[\chi^2 = \frac{(O_1 - E_1)^2}{E_1} + \frac{(O_2 - E_2)^2}{E_2} + \cdots + \frac{(O_k - E_k)^2}{E_k}\] and use the \(\chi^2\) distribution with \(k-1\) degrees of freedom.

There are some examples in our notes of (7/7)[07.07.17.html].

Linear regression

Linear regression is topic that spans much more than just hypothesis testsing. There is an important hypothesis test that arises from linear regression, though.

In linear regression, we have two data samples \(x_1,\ldots,x_k\) and \(y_1,\ldots,y_k\). The question is - are they related?

The hypothsis statement looks like:

\(H_0\): The data are not related
\(H_A\): The data are related

This can be stated in terms of the slope of the regression line

\(H_0\): The slope is zero
\(H_A\): The slope is not zero.