From the Statistical Abstract of the United States, we obtained data on percentage of gross domestic product (GDP) spent on health care and life expectancy,

vazelinahS

vazelinahS

Answered question

2021-03-06

From the Statistical Abstract of the United States, we obtained data on percentage of gross domestic product (GDP) spent on health care and life expectancy, in years, for selected countries. a) Obtain a scatterplot for the data. b) Decide whether finding a regression line for the data is reasonable. If so, then also do parts (c)-(f). c) Determine and interpret the regression equation for the data. d) Identify potential outliers and influential observations. e) In case a potential outlier is present, remove it and discuss the effect. f) In case a potential influential observation is present, remove it and discuss the effect.

Answer & Explanation

svartmaleJ

svartmaleJ

Skilled2021-03-07Added 92 answers

Step 1: Female n= Sample size =30

a) Health GDP is on the horizontal axis and Life expectancy is on the vertical axis.

b) It is reasonable to find a regression lien for the data if there is no strong curvature present in the scatterplot. We note that there is no strong curvature in the scatterplot of part (a) and thus it is reasonable to find a regression line for the data.

c) Let us first determine the necessary sums:

 xi=269.1
 xi2=2514.89
 yi=2441.7
 xiyi=21942.55

Next, we can determine Sxx and Sxy 

Sxx=  xi2  ( xi)2n=2514.89  269.1230=101.063
Sxy=  xiyi  ( xi)( yi)n=21942.55  269.1  2441.730=167.087

The estimate b of the slope β is the ratio of Sxy and Sxx: b= SxySxx= 167.087101.063=0.4008

The mean is the sum of all values divided by the number of values: x=  xin= 269.130=8.97
y=  yin= 2441.730=81.39

The estimate a of the intercept α is the average of y decreased by the product of the estimate of the slope and the average of x. a= y  b x=81.39  0.4008  8.97=77.7953 General least-squares equation: y^= α + β x.

Replace α by a=77.7953 and β by b=0.4008 in the general least-squares equation: y=a + bx=77.7953 + 0.4008x

d) There appear to be an outlier to the far right in the graph, because the point lies much far too the right than all other points in the scatterplot. The outlier appears to be an influential observation as well, because the regression line doesn't follow the general pattern in points excluding the outlier.

e) Let us first determine the necessary sums:  xi=253.8
 xi2=2280.8
 yi=2361.3
 xiyi=20712.43

Next, we can determine Sxx and Sxy

Do you have a similar question?

Recalculate according to your conditions!

New Questions in College Statistics

Ask your question.
Get an expert answer.

Let our experts help you. Answer in as fast as 15 minutes.

Didn't find what you were looking for?