Fitting Lines to Data Points

The table below gives the projected life expectancy (life span) of a person born in the United States for select birth years from 1920 and through 2018 according to the CDC.

Year 1920 1930 1940 1950 1960 1970 1980 1990 2000 2004 2010 2015 2018
Years since 1900
Life Span 54.1 59.7 62.9 68.2 69.7 70.8 73.7 75.4 76.8 77.5 78.7 78.7 78.8
  1. First, fill in the empty row of the table labelled Years since 1900 with the number of years since 1900. E.g. the year 1960 is 60 years since 1900. We will consider this row of data our independent variable \(x.\)

  2. Using technology, plot a scatter-plot of this data using Years since 1900 as the independent variable \(x\) and Life Span as the dependent variable \(y.\)
  3. Graphs these two lines on the plot containing the scatter-plot.
    \( y = \frac{1}{5}x+56 \)
    \( y = \frac{1}{4}x+50 \)
  4. You should see that each of these two lines appear to fit the data pretty well, but there must be a line that fits the data better. Manually play (experiment) with the parameters (slope and \(y\)-intercept ) of the equation of a line to find a line that, to your estimation, appears to fit the data best.