zuai-logo

Comparing Distributions of a Quantitative Variable

Ava Garcia

Ava Garcia

9 min read

Study Guide Overview

This AP Statistics study guide covers comparing groups using graphical methods. It focuses on comparing distributions with stem-and-leaf plots, histograms, and box plots. Key concepts include describing, summarizing, and representing data, calculating and interpreting medians, ranges, and interquartile ranges (IQR), and analyzing distribution shape, center, and spread. The guide provides practice questions and emphasizes interpreting results within the context of real-world scenarios.

AP Statistics: Comparing Groups - Your Ultimate Study Guide πŸš€

Hey there, future AP Stats superstar! Let's get you prepped and confident for your exam. This guide is designed to be your go-to resource, especially the night before the big day. We'll break down comparing groups using different graphical methods and make sure you're ready for anything the exam throws at you. Let's dive in!

Comparing Distributions

Key Concept

Key Concept: Describing, Summarizing, and Representing Data

Remember, we're not just looking at data; we're telling a story with it. Comparing groups involves looking at how different sets of data are distributed and using that to draw conclusions. We'll use stem-and-leaf plots, histograms, and box plots to do this. Let's get started!

Comparing Groups with Stem-and-Leaf Plots

Warm-Up: Stem Plot Comparison 🌳

Let's start with a quick warm-up using stem-and-leaf plots. These are great for visualizing the distribution of small datasets.

Question: The weight of two groups of eight animals, Group M and Group N, are recorded and the data is shown in the stem plots below (with each stem and leaf representing weight in kg). Use the stem plots to compare the weight of the animals in the two groups.

Group M:

1 | 4
2 | 3 4 8
3 | 2 6 8
4 |
5 | 0

Group N:

1 | 0
2 | 3 6
3 | 5
4 | 1
5 | 4 7
6 | 2

Analysis:

  • Range: Group M ranges from 14 to 50 kg, while Group N ranges from 10 to 62 kg. Group N has a wider range.
  • Distribution: Group M has a cluster in the 20s and 30s, while Group N is more evenly distributed.

Key Takeaway: Group N has a wider range and more diverse weights, while Group M is more clustered around the 20s and 30s.

Practice Question

Multiple Choice Question:

Two different groups of students were asked to rate their satisfaction with a new school program on a scale of 1 to 5, with 5 being the most satisfied. The stem plots below show the distribution of the satisfaction scores for each group.

Group A:

1 | 3
2 | 1 4
3 | 2 5 5
4 | 0 3

Group B:

1 | 1 2
2 | 3 4
3 | 0 1 2
4 | 5

Which of the following statements is supported by the data?

(A) The range of satisfaction scores is greater for Group A than for Group B. (B) The median satisfaction score is higher for Group B than for Group A. (C) The distribution of satisfaction scores for Group A is skewed to the right. (D) The distribution of satisfaction scores for Group B is skewed to the left. (E) The mean satisfaction score for Group A is lower than the mean satisfaction score for Group B.

Answer: (A) The range of satisfaction scores is greater for Group A than for Group B.

Explanation: The range for Group A is 4 - 1 = 3, and the range for Group B is 4 - 1 = 3. The range of satisfaction scores is the same for both groups. The median for Group A is 3, and the median for Group B is 2.5. The distribution of satisfaction scores for Group A is skewed to the left, and the distribution of satisfaction scores for Group B is skewed to the right. Therefore, the only answer that is supported by the data is (A).

Comparing Groups with Histograms

Practice AP-Style Problem: P-T Ratios 🏫

Let's tackle a more complex problem using histograms. Remember, histograms are great for showing the distribution of larger datasets.

Scenario: We're comparing the pupil-to-teacher (P-T) ratios in states east and west of the Mississippi River.

markdown-image

a. Estimate the median P-T ratio for both groups.

  • West: Median falls between the 12th and 13th value, around 15-16. * East: Median falls between the 13th and 14th value, also around 15-16. b. Compare the distributions of P-T ratios for the two groups.

  • Shape: West is unimodal and skewed right; East is unimodal and nearly symmetric.

  • Center: Medians are similar for both groups (15-16).

  • Spread: West has a larger range (approx. 10) than East (approx. 7).

c. Compare the mean P-T ratios for the two groups.

  • West: Skewed right, so the mean will likely be greater than the median.
  • East: Fairly symmetric, so the mean will be close to the median.
  • Conclusion: The mean for the West group will probably be greater than the mean for the East group.
Exam Tip

Remember to always discuss shape, center, and spread when comparing distributions. This is a common rubric requirement.

Practice Question

Free Response Question:

The following histograms show the distribution of test scores for two different classes, Class A and Class B. The test was graded out of 100 points.

markdown-image

(a) Estimate the median test score for each class.

(b) Write a few sentences comparing the distributions of test scores for the two classes.

(c) Based on your answers in parts (a) and (b), explain how you think the mean test score will compare for the two classes.

Answer:

(a) Median Estimation:

  • Class A: The median appears to be between 70 and 80. * Class B: The median appears to be between 80 and 90. (b) Distribution Comparison:
  • Shape: Class A's distribution is roughly symmetric and unimodal, while Class B's distribution is skewed to the left and unimodal.
  • Center: The median score for Class B is higher than the median score for Class A.
  • Spread: The range of scores is larger for Class A than for Class B. The scores in Class B are more concentrated than in Class A.

(c) Mean Comparison:

  • Since Class A's distribution is roughly symmetric, the mean will be close to the median. Class B's distribution is skewed to the left, so the mean will be lower than the median. Therefore, the mean for Class A will likely be lower than the mean for Class B.

Scoring Rubric:

  • (a) 1 point for each correct median estimate (2 points total).
  • (b) 1 point for each correct comparison of shape, center, and spread (3 points total).
  • (c) 1 point for a correct comparison of the means based on the shapes of the distributions (1 point total).

Comparing Groups with Box Plots

Practice AP-Style Problem: Basketball Visualization πŸ€

Box plots are fantastic for comparing medians and spreads, and for identifying outliers. Let's see how they work in action.

Scenario: We're comparing the number of attempts it takes basketball players to make two consecutive baskets, with and without visualization training.

markdown-image

Analysis:

  • Minimum: Both groups have the same minimum attempts.
  • 25th Percentile: Group 1 (visualization) is at 3 attempts, while Group 2 is at 4 attempts.
  • Median: The median for Group 1 is much lower (4 attempts) than for Group 2 (7 attempts).
  • Outlier: Group 1 has an outlier, but it's still less than the maximum of Group 2. * Conclusion: Players who received visualization training tend to need fewer attempts to make two consecutive baskets.
Memory Aid

Remember SOCS when comparing distributions: Shape, Outliers, Center, Spread. This will help you cover all the bases.

Practice Question

Multiple Choice Question:

The box plots below show the distribution of the number of hours spent studying per week for students in two different schools, School A and School B.

markdown-image

Which of the following statements is supported by the box plots?

(A) The median number of hours spent studying is higher for School A than for School B. (B) The range of the number of hours spent studying is greater for School B than for School A. (C) The interquartile range (IQR) of the number of hours spent studying is the same for both schools. (D) The distribution of the number of hours spent studying is symmetric for both schools. (E) The mean number of hours spent studying is higher for School B than for School A.

Answer: (B) The range of the number of hours spent studying is greater for School B than for School A.

Explanation: The median for School A is around 15, and the median for School B is around 12. The range for School A is 20-5=15, and the range for School B is 20-0=20. The IQR for School A is 18-10=8, and the IQR for School B is 15-8=7. The distribution of the number of hours spent studying is skewed for both schools. Therefore, the only answer that is supported by the data is (B).

Final Exam Focus

High-Priority Topics:

  • Describing Distributions: Always remember shape, center, and spread.
  • Comparing Groups: Use stem plots, histograms, and box plots effectively.
  • Interpreting Context: Connect your statistical findings to the real-world scenario.

Common Question Types:

  • Comparison Questions: Expect to compare distributions using multiple methods.
  • Interpretation Questions: Be ready to explain what your findings mean in the context of the problem.
  • Mean vs. Median: Know when to use each and how they relate to the shape of the distribution.

Last-Minute Tips:

  • Time Management: Don't get bogged down on one question. Move on and come back if needed.
  • Common Pitfalls: Watch out for misinterpreting skewness and confusing mean with median.
  • Challenging Questions: Break down complex problems into smaller, manageable parts.
Exam Tip

When comparing distributions, always use comparative language (e.g., "greater than", "less than", "more variable").

Common Mistake

Don't forget to always refer back to the context of the problem when interpreting your results. This is crucial for earning full marks.

You've got this! Stay calm, trust your preparation, and remember all the cool secrets you've uncovered. Go ace that AP Stats exam! πŸŽ‰

Question 1 of 11

πŸ“Š Consider two groups, A and B. Group A's stem plot shows values ranging from 12 to 35, while Group B's ranges from 15 to 42. What is the range of Group A?

20

23

27

30