Probability, Random Variables, and Probability Distributions
If a sample mean is greater than the hypothesized population mean, what test statistic would you typically expect?
A positive test statistic indicating the difference significant enough to suggest a sample mean greater than pop mean
A zero test statistic suggesting no difference between sample means and hypothesized pop mean
A negative test statistic indicating the difference not significant and sample mean likely close to pop mean
An undetermined test statistic since we haven’t discussed the population standard deviation
Which graphical display would be most appropriate for displaying frequency distribution of single categorical variable?
Scatterplot
Boxplot
Bar graph
Histogram
If you find that residuals from a linear regression show a pattern when plotted against fitted values but no such pattern exists when plotted against any individual explanatory variables, what does this suggest about your model?
Incorrect answer 2.
The presence of interaction terms or nonlinearity that hasn't been accounted for in the model.
Incorrect answer 3.
Incorrect answer 1.
If you wish to generalize findings from your experiment involving exercise impact on sleep quality among adults over age 50, what feature must your study's sample have?
Randomness alone without considering if those sampled truly represent diverse segments within this demographic group.
Representativeness, meaning it should accurately reflect characteristics relevant to all adults over age 50.
Size, assuming quantity will automatically ensure generalizability regardless of how well those chosen mirror broader demographics.
Uniformity, meaning all subjects should share exactly similar physical attributes and lifestyle factors.
A study yields five different correlation coefficients between pairs of variables; which coefficient suggests both strong association and presence of possible influential points?
-0.45 suggests moderate positive association but doesn’t imply influential points specifically
-0.02 indicates virtually no linear relationship hence unlikely to detect influential points
-0.99 shows strong positive association yet doesn’t alone signal presence of influential points
-0.98 indicates strong negative association where few extreme pairs might heavily influence regression results
In multiple regression analysis, what problem may arise if two predictor variables are highly correlated with each other?
Standard deviation always decreases as predictor correlations increase.
The correlation guarantees higher R-squared values improving model fit.
Multicollinearity can inflate standard errors leading to less reliable estimates.
Predictor variable coefficients become more stable with higher correlation levels.
If we want to assess whether there's evidence that household income influences students' standardized test scores differently based on their parents' level of education, which technique should we employ?
Nonparametric tests analyzing rank order differences in standardized test scores among income brackets within educational categories.
Point-biserial correlation coefficients calculated separately within each parental education category against income groups.
A series of independent samples t-tests comparing test scores across different household incomes stratified by parent education levels.
Moderated multiple regression including income, parents' education level, and their interaction term.

How are we doing?
Give us your feedback and let us know how we can improve
How might researchers strengthen their claim that physical activity causes decreased risk for heart disease despite observing only correlational data?
Relying on cross-sectional surveys showing higher self-reported rates of exercising among people with fewer cardiac events suggests correlational not causal links.
Citing anecdotal instances where individuals with high levels of physical activity experience heart issues casts doubt on any causal link when anecdotal evidence cannot establish causal relations.
Presenting case studies of patients with little to no physical exercise having low rates of heart disease contests the possible connection between exercise and cardiac health.
Channeling effects through multiple well-controlled observational studies comparing similar populations with varying levels of physical activity while accounting for known confounders strengthens causal inference..
When conducting a hypothesis test for a population proportion, which assumption is necessary for using the normal distribution model?
The sample proportion must be equal to the hypothesized population proportion under .
There must be at least 30 individuals in the population from which we are sampling.
The sample size should be large enough so that and are both greater than 10.
The standard deviation must be known and remain constant across samples.
What statistical evidence would weaken an argument for causality between daily chocolate consumption and improved cognitive function?
An experiment where participants randomly assigned to eat chocolate show significant cognitive improvements compared to non-chocolate eaters supports causality directly rather than indirectly weakening it.
A low correlation coefficient between chocolate intake frequency and measures of cognitive ability suggests a strong linear relationship exists.
A large p-value for the slope coefficient in a regression analysis predicting cognitive function from chocolate consumption indicates weak evidence against the null hypothesis.
Longitudinal studies showing consistent improvement in cognition among individuals who consume chocolate frequently reinforce potential causality rather than undermining it.