All Flashcards
What are the differences between a good residual plot and a bad residual plot?
Good: Random scatter, no pattern | Bad: Clear pattern (curve, funnel), indicates non-linearity.
What are the differences between a positive residual and a negative residual?
Positive: Model underestimated, y > ŷ | Negative: Model overestimated, y < ŷ
What are the key differences between a scatterplot and a residual plot?
Scatterplot: Shows relationship between x and y | Residual Plot: Shows residuals (y-ŷ) against x, assessing model fit.
Compare the implications of residuals clustered near zero versus residuals spread far from zero.
Near Zero: Model predictions are close to actual values, better fit | Far From Zero: Model predictions are less accurate, poorer fit.
What are the differences between using a linear model and a non-linear model based on residual plots?
Linear Model: Appropriate when residuals are randomly scattered | Non-Linear Model: More appropriate when residuals show a pattern.
Explain the concept of a residual plot and its purpose.
A residual plot is a graph that displays residuals on the y-axis and predictor variable on the x-axis. It helps assess the appropriateness of a linear model.
Explain the concept of 'least squares criterion'.
The 'least squares criterion' aims to minimize the sum of the squares of the residuals, ensuring the best fit for the regression line.
Explain what a pattern in a residual plot suggests.
A pattern (e.g., curve, funnel) suggests that a linear model is not appropriate for the data and a non-linear model might be a better fit.
Explain what a random scatter in a residual plot suggests.
A random scatter suggests that a linear model is a good fit for the data, as there's no systematic pattern in the residuals.
What does it mean if a linear model 'overestimates' a value?
The predicted value (ŷ) from the model is higher than the actual observed value (y). This results in a negative residual.
What does it mean if a linear model 'underestimates' a value?
The predicted value (ŷ) from the model is lower than the actual observed value (y). This results in a positive residual.
What is the definition of a residual?
The difference between the observed value (y) and the predicted value (ŷ).
What is the definition of a residual plot?
A scatterplot of the residuals (y - ŷ) against the predictor variable (x).
Define a positive residual.
The model underestimated the true value; actual value is higher than predicted.
Define a negative residual.
The model overestimated the true value; actual value is lower than predicted.
What does a random scatter of residuals indicate?
A linear model is appropriate.