In using the powerful analysis techniques we have learned there is a potential problem for which we must be constantly vigilant. We know that SSE and estimators which minimize SSE are extremely sensitive to outliers.
Outliers are extreme observations that for one reason or another do not belong with the other observations in the DATA.
If least square regression estimators are routinely applied to data which contain a few wild observations, then the obtained estimates can be seriously misleading.
It is therefore critically important to investigate the data for the presence of outliers whenever least square regression procedures are used.

If a data point has any of these three characteristics, it can be considered an outlier which requires attention from the researcher.
We do not have formal rules for determining the cut off for the lever, but there are informal guidelines.
According to Judd and McClelland, Velleman & Welsch (1981) suggest that levers two or three times the average ought to be considered large and in need of further attention in the data analysis.
Again according to Judd and McClelland, Huber (1981) suggests that any lever over .2 deserves special attention whenever n is reasonably large.
Influential points ------------------------------------------------------ Mahalanobis Row Leverage Distance DFITS ------------------------------------------------------ 6 0.759815 33.8814 8.2275 ------------------------------------------------------ Average leverage = 0.15384615384615385 The table of influential data points lists all observations which have leverage values greater than 3 times that of an average data point, or which have an unusually large value of DFITS. Leverage is a statistic which measures how influential each observation is in determining the coefficients of the estimated model. DFITS is a statistic which measures how much the estimated coefficients would change if each observation was removed from the data set. In this case, an average data point would have a leverage value equal to 0.15384615384615385. There is one data point with more than 3 times the average leverage, but none with more than 5 times. There is one data point with an unusually large value of DFITS.
The first question is unusual with respect to what? Obviously, we want to answer the question of whether a criterion value is unusual with respect to the model we are using.
Residuals look at differences between the model and the criterion value.
Raw residuals have two major problems. Their values depend on the scale of the variables, and, unusual criterion values tend to "grab" the regression line -- producing smaller residuals than when the criterion point is left out of the analysis.
Studentized deleted residuals solve both of these problems. In addition, none of the other commonly reported transformed residual scores detect outliers any better than the studentized deleted residual, and finally, if you square the studentized deleted residual you have the appropriate F statistic (remember student developed the t statistic) for testing an outlier model where a single parameter is used for the case in question, against a model where that parameter is not used for that particular case.
In general, Judd & McClelland do not recommend that the squared studentized deleted residual or the F that results from evaluating the outlier model be used as a formal statistical test unless one has external information questioning the validity or reliability of a particular observation (don't go on a witch hunt).
Unusual residuals ---------------------------------------------------------------------- Predicted Studentized Row Y Y Residuals Residuals ---------------------------------------------------------------------- 6 86.0 72.6715 13.3285 4.63 ----------------------------------------------------------------------
Cook's D tests to see whether the error in the model changes when a specific data value is either included, or excluded from the model.
There are only informal guidelines for interpreting Cook's D.In summary, Cook's D assesses the global impact of each observation on the parameter estimates and, equivalently, on the predictions for all other observations. Large values of D identify those observations with large impacts. We therefore can use Cook's D to answer this third question.
DFITS A statistic computed when fitting a multiple regression model to measure the change in each predicted value which would occur if a single data value was deleted. Large values correspond to points which have a big influence on the fitted model.
The Statlet's application does calculate Cook's D values as shown in the Output tab below.

Judd & McClelland think that it is good practice to omit outliers from the analysis with the explicit admission in the report that there are some observations which are not understood. You can then report a good model for those observations which you do understand.
There is a good Exhibit (9.9) on page 231 in the text which shows what each type of outlier does to the regression research.

This plot displays observed values of SAT versus values predicted by the fitted model. The closer the points lie to the diagonal line, the better the model at predicting the observed data. You should look for various anomalies, such as increases in variability around the line as the value of SAT increases (heteroscedasticity), or individual data points which lie far away from the line (outliers).

This plot displays the Studentized residuals versus values of HSRANK. Any non-random pattern could indicate that the selected model does not adequately describe the observed data. In addition, any values outside the range of -3 to +3 could well be outliers.
While Statlets doesn't have a feature where scatter plots can be drawn where the size of the dot is proportional to the influence the data point has on the regression equation, other statistical packages can be used to produce these plots. On the left is a SYSTAT influence plot produced using Exhibit 9-2.