All Flashcards
What is the formula for the Chi-Square test statistic?
What is the formula for Expected Frequency (E)?
What is the formula for Degrees of Freedom (df)?
Explain the concept of the Chi-Square Test for Independence.
It determines if there is a relationship between two categorical variables within a single population. It compares observed data to expected data assuming no relationship.
Explain the concept of the Chi-Square Test for Homogeneity.
It determines if the distribution of a categorical variable is the same across multiple populations. It compares the distribution of data across these populations.
Explain the 'Large Counts' condition for Chi-Square tests.
All expected counts must be at least 5. This ensures that the chi-square distribution is a good approximation for the test statistic's distribution.
Explain the meaning of a small p-value in a Chi-Square test.
A small p-value (typically less than 0.05) suggests that the observed data is unlikely if the null hypothesis were true, leading us to reject H0.
Explain how to draw a conclusion from a Chi-Square Test.
Compare the p-value to the significance level (alpha). If p-value ≤ alpha, reject the null hypothesis. Otherwise, fail to reject the null hypothesis. State the conclusion in the context of the problem.
Define Chi-Square Test.
A statistical test used to analyze categorical data to determine if there's a significant association between variables or if observed data fits an expected pattern.
Define Null Hypothesis (H0) in a Chi-Square test.
There is no association between the variables (for independence) or the distributions are the same (for homogeneity).
Define Alternative Hypothesis (Ha) in a Chi-Square test.
There is an association between the variables (for independence) or the distributions are different (for homogeneity).
Define Observed Frequency (O).
The actual count of data points falling into a specific category.
Define Expected Frequency (E).
The count we would expect in a cell if the null hypothesis were true.
Define P-value.
The probability of observing a test statistic as extreme as, or more extreme than, the one calculated, assuming the null hypothesis is true.