Exploring Two–Variable Data
What does r² represent in a linear regression model?
Total sum of squares in a dataset
Strength and direction of a linear relationship
Proportion of variation explained by the model
Slope of the least squares regression line
What is the goal of linear regression?
To connect all the data points on a scatterplot with a straight line.
To maximize the sum of the squared differences between observed and predicted values.
To model the linear relationship between a dependent variable and one or more independent variables.
To find the average of the dependent variable values.
In performing regression diagnostics, which graph would allow you to investigate whether there are influential points that may unduly affect slope or intercept estimations?
Time series plot displaying trends over time for residual errors after fitting model.
Histogram showing frequency distribution of independent variable values.
Normal probability plot comparing standardized residuals against theoretical quantiles.
Scatterplot with Cook's distance superimposed on each point.
When performing hypothesis testing for (slope) in simple linear regression, which alternative hypothesis correctly tests if there’s evidence of positive association between variables?
When comparing the strength of a linear relationship in two different scatterplots, which statistical value should be compared?
The correlation coefficient ().
The y-intercept of the least squares regression line.
The slope of the least squares regression line.
The sum of squared residuals from each plot.
Which factor would most likely violate the independence assumption necessary when constructing linear regression models?
Using computer-generated random numbers to assign individuals to experimental conditions before starting the collection process.
Ensuring equal gender distribution across different treatment levels studying their effect on outcome.
Randomly assigning subjects into different treatment groups during experiment design.
Collecting multiple observations from each participant within your study.
What does an r² value close to "1" suggest about a linear model?
The independent variable explains most of the variability in the dependent variable.
There is almost no correlation between independent and dependent variables.
The slope of the regression line is approaching zero, suggesting weak association.
The variance around the regression line increases significantly with each unit increase in x-variable.

How are we doing?
Give us your feedback and let us know how we can improve
What are the components of the equation for the least squares regression line?
Coefficient of determination and correlation coefficient.
Median and mode.
Slope and y-intercept.
Mean and standard deviation.
Question #4: In a normal distribution used for constructing a t-distribution based confidence interval around a slope coefficient from linear regression analysis, how will an increase in skewness likely affect inference assuming large samples are not available?
Skewness doesn't influence outcomes as long Central Limit Theorem applies regardlessly ensuring sampling distributions maintain their expected bell-shape form.
Inference precision improves automatically whenever skewness appears regardlessly provided condition known asymptotic properties retain their relevance across all circumstances still persisting here too.
It enhances precision inference since skewness adds directionality interpretation aiding hypothesis testing procedures regarding slope coefficients.
It can compromise validity due its effect on t-distribution approximation accuracy without large samples involved allowing Central Limit Theorem application fully counteracting it.
What refers to obtaining samples by asking participants to recommend others for studies?
Snowball Sampling
Multi-Stage Sampling
Random Digit Dialing
Proportional Allocation