Explain the concept of a confidence interval.
A range of values that likely contains the true population parameter, calculated from sample data and a chosen confidence level. It provides a measure of uncertainty around a point estimate.
Explain the concept of a significance test.
A procedure to assess the evidence against a null hypothesis. It involves calculating a test statistic and p-value to determine if the observed data is statistically significant.
Explain the concept of the p-value in hypothesis testing.
The probability of observing a sample statistic as extreme as, or more extreme than, the one observed if the null hypothesis is true. A small p-value suggests evidence against the null hypothesis.
Explain the importance of randomness in statistical inference.
Random sampling ensures that the sample is representative of the population, reducing bias and allowing for valid generalizations from the sample to the population.
Explain the relationship between sample size and the width of a confidence interval.
Larger sample sizes lead to narrower confidence intervals because they reduce the standard error of the sample proportion, providing a more precise estimate of the population proportion.
Explain the concept of Type I error in hypothesis testing.
A Type I error occurs when we reject the null hypothesis when it is actually true. The probability of making a Type I error is equal to the significance level (alpha).
What is the formula for the standard error of a sample proportion?
SE(pฬ) = $\sqrt{\frac{pฬ(1-pฬ)}{n}}$
What is the general form of a confidence interval?
Estimate ยฑ (Critical Value) * (Standard Error)
What is the formula for the test statistic (z) in a one-proportion z-test?
z = $\frac{pฬ - p_0}{\sqrt{\frac{p_0(1-p_0)}{n}}}$
What is the formula for the standard error of the difference between two sample proportions?
SE($pฬ_1 - pฬ_2$) = $\sqrt{\frac{pฬ_1(1-pฬ_1)}{n_1} + \frac{pฬ_2(1-pฬ_2)}{n_2}}$
What is the formula for the test statistic (z) in a two-proportion z-test?
z = $\frac{(pฬ_1 - pฬ_2)}{\sqrt{pฬ_c(1-pฬ_c)(\frac{1}{n_1} + \frac{1}{n_2})}}$ where $pฬ_c$ is the combined (pooled) sample proportion.
What are the differences between a confidence interval and a significance test?
Confidence Interval: Estimates a range for a population parameter; provides a margin of error. | Significance Test: Tests a claim about a population parameter; determines if there's enough evidence to reject the null hypothesis.
What are the differences between the null and alternative hypotheses?
Null Hypothesis: A statement of no effect or no difference; the hypothesis we are trying to disprove. | Alternative Hypothesis: A statement that contradicts the null hypothesis; represents what we suspect to be true.
What are the differences between a one-proportion z-test and a two-proportion z-test?
One-Proportion z-test: Used to test a claim about a single population proportion. | Two-Proportion z-test: Used to compare the proportions of two independent populations.
What are the differences between Type I and Type II errors?
Type I error: Rejecting a true null hypothesis (false positive). | Type II error: Failing to reject a false null hypothesis (false negative).
What are the differences between increasing the confidence level and decreasing the significance level?
Increasing confidence level: Increases the width of the confidence interval, making it more likely to capture the true parameter. | Decreasing significance level: Makes it harder to reject the null hypothesis, reducing the risk of a Type I error.