emancipezN
2021-03-02
Unusual points Each of the four scatterplots that follow shows a cluster of points and one “stray” point. For each, answer these questions:
1) In what way is the point unusual? Does it have high leverage, a large residual, or both?
2) Do you think that point is an influential point?
3) If that point were removed, would the correlation be- come stronger or weaker? Explain.
4) If that point were removed, would the slope of the re- gression line increase or decrease? Explain
hajavaF
Skilled2021-03-03Added 90 answers
a.
1. Residual:
The residual corresponding to a predictor variable is given as the difference between actual value of the response variable and the predicted value. That is, e=y−yˆ, where y be the actual value of the response variable and yˆ be the predicted value of the response variable for same predictor variable.
Leverage:
An observation, whose predictor variables values (x values) are far from the mean of the predictor variables values (x values) is called as leverage point. Leverage point pulls the regression line to it and has a large effect on the regression line. An observation having high leverage has small residual.
The point pulls the regression line to it and has a large effect on the regression line. In addition, the difference between observed and predicted value of response variable corresponding to this point is high.
Thus, the point has a high leverage with a high residual.
2. Influential point:
A point, which does not belong in a data set and the omission of which from the data results in a very different regression model, is called as influential point.
The point is far from the mean of the explanatory variable. Moreover, the omission of the point from the data results in a very different regression model as it reinforces the association. In addition, including the point scatterplot shows an overall positive direction that is not the actual direction.
Thus, the point is an influential point.
3. Association:
Association between two variables implies that if two variables are associated or related then the value of one variable gives information about the value of the other variable.
Correlation measure the linear relationship between two variables.
The point supports the positive association. Removing of this point it would weaken the association.
As a result of this the correlation would become weaker. Thus, removing the point result in a weaker correlation.
4. In a linear regression model , where yˆ be the predicted values of response variable and x be the predictor variable, the b1b1 be the slope and b0b0 be the intercept of the line.
Slope gives the rapidly change of y with respect to x and slope estimate is given as,
, where r be the correlation between x and y, sy be the standard deviation of y and sx be the standard deviation of x.
The slope of the regression line would increase from negative slope to a slope near 0.
Thus, if the point were removed, would the slope of the regression line would be nearly flat.
b.
1. The point pulls the regression line to it and has a large effect on the regression line.
Thus, the point has a high leverage with a small residual.
2. The point is far from the actually scattered points and direction of scatterplot is positive due to this point when the points are actually scattered. Moreover, the omission of the point from the data results in a very different regression model.
Thus, the point is an influential point.
3. Removing of this point it would weaken the association. Except the point there would be little evidence of linear association.
As a result of this the correlation would become weaker.
Thus, removing the point result in a weaker correlation.
4. As the point is not influential, thus removing of the point is not result in a very different regression result.
The slope of the regression line would increase from negative slope to a slope near 0.
Thus, if the point were removed, would the slope of the regression line would be nearly flat.
c.
1. The point does not pull the regression line to it and has not a large effect on the regression line. The difference between observed and the predicted value of response variable corresponding to that point is quite high.
Thus, the point has a little leverage with a high residual.
2. The point is close to the mean of the explanatory variable. Moreover, the omission of the point from the data results in not a different regression model.
Thus, the point is not influential point.
3. Removing of this point it would reinforce the association as the point detracts from the overall pattern.
Thus, removing the point result in a slightly stronger correlation, decreasing to become negative.
4. As the point is not influential, thus removing of the point is not result in a very different regression result.
The slope of the regression line would not be affected.
Thus, if the point were removed, would the slope of the regression line would be remain same.
d.
1. The point pulls the regression line to it and has a large effect on the regression line.
Thus, the point has a high leverage with a small residual.
2. The point is far from the mean of the explanatory variable gives the high leverage. Moreover, the omission of the point from the data results in not a very different regression model as it reinforces the association.
Thus, the point is not an influential point.
3. The point supports the negative association. Removing of this point it would weaken the association.
As a result of this the correlation would become weaker.
Thus, removing the point result in a weaker correlation.
4. As the point is not influential, thus removing of the point is not result in a very different regression result.
Thus, if the point were removed, would the slope of the regression line remain same
Read carefully and choose only one option
A statistic is an unbiased estimator of a parameter when (a) the statistic is calculated from a random sample. (b) in a single sample, the value of the statistic is equal to the value of the parameter. (c) in many samples, the values of the statistic are very close to the value of the parameter. (d) in many samples, the values of the statistic are centered at the value of the parameter. (e) in many samples, the distribution of the statistic has a shape that is approximately Normal
Construct all random samples consisting three observations from the given data. Arrange the observations in ascending order without replacement and repetition.
86 89 92 95 98.
Find the mean of the following data: 12,10,15,10,16,12,10,15,15,13.
The equation has a positive slope and a negativey-intercept.
1) y=−2x−3
2) y=2−3x
3) y=2+3x
4) y=−2+3x
What term refers to the standard deviation of the sampling distribution?
Fill in the blanks to make the statement true: .
What percent of is
The first 15 digits of pi are as follows: 3.14159265358979
The frequency distribution table for the digits is as follows:
Which two digits appear for 3 times each?
A) 1, 7
B) 2, 6
C) 5, 9<br<D) 3, 8
How to write
What is the simple interest of a loan for $1000 with 5 percent interest after 3 years?
What number is 12% of 45?
The probability that an automobile being filled with gasoline also needs an oil change is 0.30; the probability that it needs a new oil filter is 0.40; and the probability that both the oil and the filter need changing is 0.10. (a) If the oil has to be changed, what is the probability that a new oil filter is needed? (b) If a new oil filter is needed, what is the probability that the oil has to be changed?
Leasing a car. The price of the car is$45,000. You have $3000 for a down payment. The term of the lease is and the interest rate is 3.5% APR. The buyout on the lease is51% of its purchase price and it is due at the end of the term. What are the monthly lease payments (before tax)?
The mean of sample A is significantly different than the mean of sample B. Sample A: Sample B: Use a two-tailed -test of independent samples for the above hypothesis and data. What is the -value?
What is mean and its advantages?