The regression line is not as steep as the SD Line (line through point of averages, with slope = sd(y)/sd(x)). The difference is especially noticeable when the scatter plot is the result of a sample from a bivariate normal distribution. This app explains why we use the regression line to predict y from x, even though the SD line appears to be a better linear summary of the scatter plot. Can be used as a starting-point for a discussion of "regression to the mean."
ShallowReg(n=900,rho=0.5)
Arguments
n: Number of points in the scatter plot.
rho: Target correlation for the scatter plot. Points are selected from a standardized bivariate normal distribution, with correlation rho.
Returns
Graphical output.
Note
Uses manipulate, available only in R Studio, and mvrnorm from package MASS.
Examples
## Not run:if(require(manipulate)) ShallowReg()## End(Not run)