AP Statistics Name: Per: Date: 3. Least- Squares Regression p164 168 Ø What is the general form of a regression equation? What is the difference between y and ŷ? Example: Tapping on cans Don t you hate it when you open a can of soda and some of the contents spray out of the can? Two AP Statistics students, Kerry and Danielle, wanted to investigate if tapping on a can of soda would reduce the amount of soda expelled after the can has been shaken. For their experiment, they vigorously shook 40 cans of soda and randomly assigned each can to be tapped for 0 seconds, 4 seconds, 8 seconds, or 1 seconds. Then, after opening the can and cleaning up the mess, the students measured the amount of soda left in each can (in ml). Here is a scatterplot with the least- squares regression line soda! = 48.6 +.63 ( tapping time ). a) Interpret the slope and y intercept of a regression line. Amount of soda remaining (ml) 0 s 4 s 8 s 1 s 45 60 67 75 55 50 71 80 50 50 68 75 50 50 70 80 50 60 76 85 45 65 55 90 48 67 70 84 50 60 70 78 51 61 75 79 49 59 75 80 b) Predict the amount remaining for a can that has been tapped 10 seconds. c) Predict the amount remaining for a can that has been tapped for 60 seconds. How confident are you in this prediction? Ø What is extrapolation? Is it a good idea to extrapolate? HW #3.A page 193 (35 41 odd)
3. Residuals p168 17 Ø What is a residual? How do you interpret a residual? Ø Calculate and interpret the residual for the can that was tapped for 4 seconds and had 60 ml of soda remaining. Example: McDonald s Beef Sandwiches x= Carbs (g) 31 33 34 37 40 40 45 37 38 y = Fat (g) 9 1 3 19 6 4 9 4 8 (a) How can we determine the best regression line for a set of data? (b) Is the least- squares regression line resistant to outliers? (c) Calculate the equation of the least- squares regression line using technology. Make sure to define variables! Sketch the scatterplot with the graph of the least- squares regression line. (d) Interpret the slope and y- intercept in context. (e) Calculate and interpret the residual for the Big Mac, with 45g of carbs and 9g of fat.
p17 176 Ø How can we know a line is the right model to use?? Ø What is a residual plot? What is the purpose of a residual plot? Ø What do you look for in a residual plot? How can you tell if a linear model is appropriate? Ø Construct and interpret a residual plot for the McDonald s data. HW #3.B: page 193 (40, 4, 44, 46, 47, 49, 51)
3. How well the line fits the data: s and p177 181 r Ø What is the standard deviation of the residuals? How do you calculate and interpret it? Ø For the can tapping example, the standard deviation of the residuals is s = 5.00. Interpret this value. Ø Suppose that we wandered in during the can tapping experiment and found a partially- full can. Without measuring the contents, how could we predict how much soda is left in the can (assuming we had access to the data)? Ø How much better would our predictions be if we knew how long it had been tapped? Ø What is the coefficient of determination r? How do you calculate and interpret r? Ø How is r related to r? How is r related to s? HW #3.C: page 194 (48, 50, 55, 58)
3. Interpreting Computer Output, Regression to the Mean p181 18 Ø Discuss r and s Example: Mentos and Diet Coke When Mentos are dropped into a newly opened bottle of Diet Coke, carbon dioxide is released from the Diet Coke very rapidly, causing the Diet Coke to be expelled from the bottle. Will more Diet Coke be expelled when there is a larger number of Mentos dropped in the bottle? Two statistics students, Brittany and Allie, decided to find out. Using 16 ounce ( cup) bottles of Diet Coke, they dropped either, 3, 4, or 5 Mentos into a randomly selected bottle, waited for the fizzing to die down, and measured the number of cups remaining in the bottle. Then, they subtracted this measurement from the original amount in the bottle to calculate the amount of Diet Coke expelled (in cups). Output from a regression analysis is shown below. 1.45 1.40 0.10 1.35 0.05 Amount Expelled 1.30 1.5 1.0 1.15 Residual 0.00-0.05 1.10 1.05.0.5 3.0 3.5 Mentos 4.0 4.5 5.0-0.10 1.15 1.0 1.5 Fitted Value 1.30 1.35 (a) What is the equation of the least- squares regression line? Define any variables you use. Predictor Coef SE Coef T P Constant 1.0008 0.04511.1 0.000 Mentos 0.07083 0.018 5.77 0.000 (b) Interpret the slope of the least- squares regression line. S = 0.06744 R-Sq = 60.% R-Sq(adj) = 58.4% (c) What is the correlation? (d) Is a linear model appropriate for this data? Explain. (e) Would you be willing to use the linear model to predict the amount of Diet Coke expelled when 10 mentos are used? Explain.
(f) Calculate and interpret the residual for bottle of diet coke that had mentos and lost 1.5 cups. (g) Interpret the values of r and s. (h) If the amount expelled was measured in ounces instead of cups, how would the values of r and s be affected? Explain. p18 185 Ø How can you calculate the equation of the least- squares regression line using summary statistics? Ø What happens to the predicted value of y for each increase of 1 standard deviation in x? HW #3.D page 195 (56, 59, 61, 63, 65)
3. Putting it all Together: Regression and Correlation p185 191 Ø Does it matter which variable is x and which is y? Ø Which of the following has the highest correlation? Ø How do outliers affect the correlation, least- squares regression line, and standard deviation of the residuals? Are all outliers influential? Example: Here is a scatterplot showing the cost in dollars and the battery life in hours for a sample of netbooks (small laptop computers). What effect do the two netbooks that cost $500 have on the equation of the least- squares regression line, correlation, standard deviation of the residuals, and r? Explain. Example: Here is a scatterplot showing the relationship between the number of fouls and the number of points scored for NBA players in the 010-011 season. a) Describe the association. b) Should NBA players commit more fouls if they want to score more points? Explain. HW #3.E page 197 (67, 69, 71 78)