All Flashcards

What is the formula for calculating expected counts in a Chi-Square test?

Expected Count = (Row Total * Column Total) / Grand Total

What is the formula for the Chi-Square test statistic?

$\chi^2 = \sum \frac{(O_i - E_i)^2}{E_i}$ where O is observed and E is expected.

How do you calculate degrees of freedom for a Chi-Square test?

df = (number of rows - 1) * (number of columns - 1)

What is the Chi-Square Test for Independence?

A test used to determine if there is a significant association between two categorical variables within a single population.

What is the Chi-Square Test for Homogeneity?

A test used to determine if the distribution of a categorical variable is the same across two or more populations or treatments.

Define 'expected counts' in a Chi-Square test.

The counts we would expect in each cell of a contingency table if the null hypothesis were true. Calculated as (Row Total * Column Total) / Grand Total.

What is a null hypothesis (H0) in the context of a Chi-Square test for independence?

There is no association between two categorical variables (they are independent).

What is a null hypothesis (H0) in the context of a Chi-Square test for homogeneity?

There is no difference in the distribution of a categorical variable across populations/treatments.

What is an alternative hypothesis (Ha) in the context of a Chi-Square test for independence?

There is an association between two categorical variables (they are dependent).

What is an alternative hypothesis (Ha) in the context of a Chi-Square test for homogeneity?

There is a difference in the distribution of a categorical variable across populations/treatments.

Explain the concept of the 10% condition in the context of Chi-Square tests.

When sampling without replacement, ensure the sample size (n) is less than 10% of the population size (N) to maintain independence: n < 0.10N.

Explain the concept of 'large counts' condition.

All expected counts must be at least 5 to ensure the Chi-Square test is valid.

Explain the importance of random sampling in Chi-Square tests.

Random sampling ensures that the sample is representative of the population, which is a fundamental assumption for the validity of the test.

Explain the importance of stating hypotheses in context.

Contextual hypotheses clearly define the variables and populations being studied, making the results more meaningful and interpretable.

Explain the role of the p-value in a Chi-Square test.

The p-value indicates the probability of observing a test statistic as extreme as, or more extreme than, the one calculated if the null hypothesis were true. It is used to make a decision about rejecting the null hypothesis.