All Flashcards
What is a Chi-Square Test?
A statistical test used to determine if observed data significantly differs from expected data, especially with categorical variables.
What is observed frequency?
The actual frequencies you see in your data.
What is expected frequency?
The frequencies you'd expect if there was no relationship between variables.
What is a two-way table?
A table that organizes categorical data, making it easier to calculate expected counts and perform the chi-square test.
What is a frequency table distribution?
A table showing the distribution of categorical data, used to calculate expected counts and perform the chi-square test.
What is the formula for the Chi-Square test statistic?
, where O is observed frequency and E is expected frequency.
How do you calculate expected counts in a Chi-Square test?
Expected Count = (Row Total * Column Total) / Grand Total
How do you calculate degrees of freedom for a Chi-Square test for independence?
df = (number of rows - 1) * (number of columns - 1)
What is the formula to calculate the Chi-Square statistic?
What is the formula for calculating expected frequency in a goodness-of-fit test?
Expected Frequency = Total Number of Observations * Hypothesized Proportion
What are the differences between the Chi-Square Test for Independence and Homogeneity?
Independence: Tests relationship between two variables in a single population. | Homogeneity: Compares distributions of a variable across multiple populations.
What are the differences between observed and expected frequencies?
Observed: Actual counts in your sample data. | Expected: Counts predicted under the null hypothesis (no association).
What are the differences between Goodness of Fit and Independence tests?
Goodness of Fit: Tests if one sample matches a hypothesized distribution. | Independence: Tests if two categorical variables are related.
What are the differences between a Chi-Square test and a z-test for proportions?
Chi-Square: Used for categorical variables with two or more categories. | Z-test: Used for comparing proportions of a single categorical variable with two categories.
What are the differences between rejecting and failing to reject the null hypothesis?
Rejecting: There is enough evidence to support the alternative hypothesis. | Failing to reject: There is not enough evidence to support the alternative hypothesis.