The scatterplot below shows the finishing times for the Olympic gold medalist in the men's 100-meter dash for many previous Olympic games. The least squares regression line is also shown. (Source: http://trackandfield.about.com/od/sprintsandrelays/qt/olym100medals.htm)

- Is a linear model a good fit for the data? Explain, commenting on the strength and direction of the association.
- The equation of the linear function that best fits the data (regression line) is $$\widehat{\mbox{Finishing time}} = 10.878 - 0.0106 \left( \mbox{Year after 1900} \right).$$ Given that the summer Olympic games only take place every four years, how should we expect the gold medalist's finishing time to change from one Olympic games to the next?
- What is the vertical intercept of the function's graph? What does it mean in context of the 100-meter dash?
- Note that the gold medalist finishing time for the 1940 Olympic games is not included in the scatterplot. Use the model to estimate the gold medalist's finishing time for that year.
- What is a realistic domain for the linear regression function? Comment on how your answer pertains to using this function to make predictions about future Olympic 100-m dash race times.
Commentary
The task asks students to identify when two quantitative variables show evidence of a linear association, and to describe the strength and direction of that association. Students then utilize a least-squares regression line to make predictions, and to make conjectures about the limitations of the model, which is a very important aspect of SMP4 - Model with Mathematics. They must apply their knowledge of slope and intercept of a linear function in context of the problem; i.e., understand that the slope of a regression line is the predicted change in the response variable per unit change of the explanatory variable, and that the vertical intercept corresponds to a value of zero in the explanatory variable.
Linear models are a very nice connection between statistics and functions in high school mathematics. Coherence in high school mathematics means drawing connections between topics that use the same mathematical concept. In this case we use linear functions to model the relationship between two quantitative variables. We can use the context of investigating if there is an association between two variables to strengthen our understanding of slope and intercept of a linear function.
This task is probably most appropriate for use in instruction. Consider having students work together in pairs or small groups on parts a - d. Part e could then be the basis for a whole class discussion.