Box Office Mojo collects and posts data on movie grosses. For a random sample of 50 movies, we obtained both the domestic (U.S.) and overseas grosses,

snowlovelydayM

snowlovelydayM

Answered question

2020-11-09

Box Office Mojo collects and posts data on movie grosses. For a random sample of 50 movies, we obtained both the domestic (U.S.) and overseas grosses, in millions of dollars. a) Obtain a scatterplot for the data. b) Decide whether finding a regressimz line for the data is reasonable. If so, then also do parts (c)-(f). c) Determine and interpret the regression equation for the data. d) Identify potential outliers and influential observations. e) In case a potential outlier is present, remove it and discuss the effect. f) In case a potential influential observation is present, remove it and discuss the effect.

Answer & Explanation

Tuthornt

Tuthornt

Skilled2020-11-10Added 107 answers

Given: n= Sample size =50 a) Domestic is on the horizontal axis and Overseas is on the vertical axis. image b) It is reasonable to find a regression lien for the data if there is no strong curvature present in the scatterplot. We note that there is no strong curvature in the scatterplot of part (a) and thus it is reasonable to find a regression line for the data. c) Let us first determine the necessary sums:  xi=3588.9
 xi2=712440.81
 yi=5233.5
 xiyi=968209.51 Next, we can determine Sxx and Sxy
Sxx=  xi2  ( xi)2n=712440.81  3588.9250=4548367458
Sxy=  xiyi  ( xi)( yi)n=968209.51  3588.9  5233.550=592559.347 The estimate b of the slope β is the ratio of Sxy and Sxx: b= SxySxx= 592559.374454836.7458=1.3028 The mean is the sum of all values divided by the number of values: x=  xin= 3588.950=71.778
y=  yin= 5233.550=104.67 The estimate a of the intercept α is the average of y decreased by the product of the estimate of the slope and the average of x. a= y  b x=104.67  1.3028  71.778=11.1579 General least-squares equation: y^= α + β x. Replace α by a=11.1579 and β by b=1.3028 in the general least-squares equation: y=a + bx=11.1579 + 1.3028x d) There appear to be two outliers, because the two rightmost points lie far from the group of other points. There appear to be an influential obsevation, because the point in the rightmost corner lies very close to the regression line while the point is a potential outlier. e) Let us first determine the necessary sums:  xi=2732.8
 xi2=345183.2
 yi=4314
 xiyi=567537.88 Next, we can determine Sxx and

Do you have a similar question?

Recalculate according to your conditions!

New Questions in College Statistics

Ask your question.
Get an expert answer.

Let our experts help you. Answer in as fast as 15 minutes.

Didn't find what you were looking for?