Introducing Statistics: Are My Results Unexpected?

Noah Martinez
7 min read
Study Guide Overview
This AP Statistics study guide covers chi-square tests, focusing on variation between observed and expected data. It explains p-values, the impact of sample size on standard deviation and power, and the Law of Large Numbers. It includes practice multiple-choice and free-response questions with solutions covering hypothesis testing, calculating expected values, the chi-square statistic, degrees of freedom, and interpreting results.
AP Statistics: Chi-Square Tests - Your Ultimate Review Guide
Hey there, future AP Stats master! Let's break down chi-square tests and make sure you're totally ready for the exam. We'll turn those tricky concepts into easy wins! ๐
Introduction to Variation and Expected Counts
In chi-square tests, we're all about understanding variation. We're looking at the difference between what we observe in our data and what we expect to see based on a claim. Is the variation just random, or is something else going on? ๐ค

Image: Variation between observed and expected counts
Random Chance vs. Incorrect Claim
When we conduct a statistical test, there's always a chance that the difference we see is just due to random variation. Our goal is to figure out if the difference is significant or just random luck.
The Role of the P-value
The p-value tells us the probability of seeing our observed results (or more extreme results) if the null hypothesis is true.
- Low p-value (e.g., < 0.05): The observed difference is unlikely due to chance. We reject the null hypothesis and conclude there's a significant relationship. ๐
- High p-value (e.g., > 0.05): The observed difference could easily be due to chance. We fail to reject the null hypothesis. ๐
P-value low, null must go! (Reject the null hypothesis) P-value high, null will fly! (Fail to reject the null hypothesis)
Example: Coin Flipping
Let's say we flip a coin 10 times. We expect 5 heads and 5 tails, but we might get 4 heads and 6 tails. Is that weird? Not really. But if we got 10 heads and 0 tails, we might start to doubt the fairness of the coin.
Remember, it's not about whether the observed counts exactly match the expected counts, but whether the difference is statistically significant.
The Impact of Sample Size
Sample size is HUGE! It impacts everything. A small sample might not show a significant difference, but a large sample could reveal a pattern.
- Small Sample: 10 coin flips, 4 heads, 6 tails - no big deal.
- Large Sample: 1000 coin flips, 400 heads, 600 tails - that's more suspicious! ๐ช
Inverse Relationship
As sample size increases, standard deviation decreases. This inverse relationship is super important for the exam! ๐ก
Power in Statistical Tests
Power is the probability of correctly detecting a difference if it exists. Larger sample sizes increase the power of a test because they lead to smaller standard deviations.
Don't confuse statistical significance with practical significance. A statistically significant result might not be meaningful in the real world if the effect size is very small.
Law of Large Numbers
The law of large numbers states that as our sample size increases, the sample mean gets closer and closer to the true population mean. ๐๏ธ
Coin Flip Example
- 10 flips: 6 heads? Maybe just luck.
- 1000 flips: 500 heads? We're pretty confident the coin is fair. ๐
Think of the law of large numbers like this: the more data you have, the clearer the picture becomes. It's like zooming in on a blurry photoโmore pixels, more clarity!
Final Exam Focus
Key Concepts
- Variation: Understanding the difference between observed and expected counts.
- P-value: Interpreting the probability of seeing our results due to chance.
- Sample Size: How it affects standard deviation and the power of a test.
- Law of Large Numbers: How sample means converge to population means.
Common Question Types
- Multiple Choice: Interpreting p-values, understanding the effect of sample size, and identifying appropriate tests.
- Free Response: Performing chi-square tests, stating hypotheses, checking conditions, and drawing conclusions in context.
Last-Minute Tips
- Time Management: Pace yourself! Don't spend too long on one question.
- Common Pitfalls: Double-check your calculations and make sure you're using the correct test.
- Strategies: Read the questions carefully, underline key words, and show all your work.
Practice Questions
Practice Question
Multiple Choice Questions
-
A researcher is testing the hypothesis that the distribution of colors of candies in a bag is different from what the manufacturer claims. Which of the following is the most appropriate test? (a) A one-sample t-test (b) A two-sample t-test (c) A chi-square goodness-of-fit test (d) A chi-square test for independence (e) A z-test for proportions
-
A chi-square test for independence is conducted, and the p-value is 0.02. Which of the following is the best interpretation of this result? (a) There is a 2% chance that the null hypothesis is true. (b) There is a 2% chance that the alternative hypothesis is true. (c) There is a 2% chance of observing the data if the null hypothesis is true. (d) There is a 98% chance that the null hypothesis is true. (e) There is a 98% chance of observing the data if the null hypothesis is true.
-
As the sample size increases, what happens to the standard deviation of the sampling distribution of a statistic? (a) It increases. (b) It decreases. (c) It stays the same. (d) It becomes more variable. (e) It cannot be determined.
Free Response Question
A company claims that the distribution of the four colors of candies in their bags is as follows: 30% red, 30% blue, 20% green, and 20% yellow. A student buys a bag of 200 candies and counts the number of each color. The results are 68 red, 52 blue, 44 green, and 36 yellow.
(a) State the null and alternative hypotheses for this test. (b) Calculate the expected number of candies of each color. (c) Calculate the chi-square test statistic. (d) Calculate the degrees of freedom. (e) Find the p-value for this test. Interpret the p-value in context. (f) What conclusion can be made about the company's claim at the alpha = 0.05 significance level?
Scoring Guide for FRQ
(a) Hypotheses (1 point)
- Null Hypothesis (H0): The distribution of candy colors is as claimed by the company (30% red, 30% blue, 20% green, and 20% yellow).
- Alternative Hypothesis (Ha): The distribution of candy colors is different from what the company claims.
(b) Expected Counts (1 point)
- Red: 200 * 0.30 = 60
- Blue: 200 * 0.30 = 60
- Green: 200 * 0.20 = 40
- Yellow: 200 * 0.20 = 40
(c) Chi-Square Test Statistic (2 points)
(d) Degrees of Freedom (1 point)
- df = Number of categories - 1 = 4 - 1 = 3
(e) P-value and Interpretation (2 points)
- Using a calculator or chi-square table, the p-value is approximately 0.401
- Interpretation: If the null hypothesis is true (i.e., the distribution of candy colors is as claimed by the company), there is a 40.1% chance of observing a sample result as extreme as or more extreme than the one obtained.
(f) Conclusion (1 point)
- Since the p-value (0.401) is greater than the significance level (0.05), we fail to reject the null hypothesis. There is not enough evidence to suggest that the distribution of candy colors is different from what the company claims.
Alright, you've got this! Remember to stay calm, think clearly, and trust your knowledge. You're going to rock the AP Stats exam! ๐ช

How are we doing?
Give us your feedback and let us know how we can improve
Question 1 of 11
๐ In a study, you expect 50 individuals to prefer Brand A, but you observe 60. What does this difference represent?
Random sampling error
Observed variation
Statistical bias
Population parameter