Exploring Two–Variable Data
What does an r² value close to "1" suggest about a linear model?
The independent variable explains most of the variability in the dependent variable.
There is almost no correlation between independent and dependent variables.
The slope of the regression line is approaching zero, suggesting weak association.
The variance around the regression line increases significantly with each unit increase in x-variable.
What does r² represent in a linear regression model?
Total sum of squares in a dataset
Strength and direction of a linear relationship
Proportion of variation explained by the model
Slope of the least squares regression line
In performing regression diagnostics, which graph would allow you to investigate whether there are influential points that may unduly affect slope or intercept estimations?
Time series plot displaying trends over time for residual errors after fitting model.
Histogram showing frequency distribution of independent variable values.
Normal probability plot comparing standardized residuals against theoretical quantiles.
Scatterplot with Cook's distance superimposed on each point.
What is the goal of linear regression?
To connect all the data points on a scatterplot with a straight line.
To maximize the sum of the squared differences between observed and predicted values.
To model the linear relationship between a dependent variable and one or more independent variables.
To find the average of the dependent variable values.
When performing hypothesis testing for (slope) in simple linear regression, which alternative hypothesis correctly tests if there’s evidence of positive association between variables?
When comparing the strength of a linear relationship in two different scatterplots, which statistical value should be compared?
The correlation coefficient ().
The y-intercept of the least squares regression line.
The slope of the least squares regression line.
The sum of squared residuals from each plot.
Which factor would most likely violate the independence assumption necessary when constructing linear regression models?
Using computer-generated random numbers to assign individuals to experimental conditions before starting the collection process.
Ensuring equal gender distribution across different treatment levels studying their effect on outcome.
Randomly assigning subjects into different treatment groups during experiment design.
Collecting multiple observations from each participant within your study.

How are we doing?
Give us your feedback and let us know how we can improve
Considering that multicollinearity can inflate standard errors in multiple regression analysis, what technique could best minimize its effects when predicting house prices using floor area and number of rooms?
Principal component analysis (PCA) which transforms correlated variables into a set of uncorrelated components.
Stepwise selection by gradually adding or removing predictors based on their statistical significance.
Lasso regression which performs variable selection and regularization simultaneously to enhance prediction accuracy.
Ridge regression which adds bias but reduces variance through shrinkage parameters.
If a linear regression model for predicting annual income based on years of education has a residual plot with increasing variability as the number of years of education increases, which transformation should be applied to meet the assumption of equal variances?
Inverse transformation to reduce right skewness.
Cubic transformation to capture polynomial trends.
Logarithmic transformation to stabilize variance.
Square root transformation to linearize the relationship.
If a student increases the sample size used to create a linear regression model, what is likely to happen to the standard error of the estimate?
It will remain unchanged.
It will decrease.
It will become zero.
It will increase.