# Regression — AP Statistics
Linear regression models the relationship between two quantitative variables. AP Statistics covers the least-squares regression line, correlation, residual analysis, and inference for the slope.
Key Concepts
Scatterplots
Describe: direction (positive/negative), form (linear/nonlinear), strength (weak/moderate/strong), unusual features (outliers).
Correlation ()
- .
- Measures linear association only.
- is unitless and unaffected by changes in units.
Least-Squares Regression Line (LSRL)
where and the line passes through .
Coefficient of Determination ()
= proportion of variability in explained by the linear relationship with .
Residuals
- Residual plot should show random scatter (no pattern).
- Patterns indicate the model is not appropriate.
Influential Points
- Outlier: large residual.
- High leverage: extreme -value.
- Influential: removing it substantially changes the regression line.
Inference for Slope
CI for : .
(no linear relationship).
Conditions (LINE):
- Linear relationship (check residual plot).
- Independence of observations.
- Normal distribution of residuals.
- Equal variance (constant spread in residual plot).
Transformations
If the relationship is nonlinear, transform one or both variables (e.g., vs. for exponential, vs. for power).
Worked Example
Problem: From computer output: , , . Test if the slope is significantly different from zero.
Solution:
, .
, .
p-value . At , reject . There is convincing evidence of a linear relationship.
Practice Questions
1. If , what is and what does it mean?
. 64% of the variability in is explained by the linear relationship with .
2. A residual plot shows a U-shaped pattern. What does this indicate?
The linear model is not appropriate; the relationship may be curved (try a quadratic or transformed model).
3. The LSRL is . Predict when , and find the residual if .
. Residual = .
Want to check your answers and get step-by-step solutions?
Summary
- LSRL: ; ; passes through .
- measures linear association; tells the explained proportion.
- Residual plots check model fit; patterns = bad fit.
- Inference for slope: ; check LINE conditions.
- Transform data for nonlinear relationships.
