Title | Diagnosing multivariate outliers detected by robust estimators |
Publication Type | Journal Article |
Year of Publication | 2009 |
Authors | Willems, G, Joe, H, Zamar, R |
Journal | Journal of Computational and Graphical Statistics |
Volume | 18 |
Pagination | 73-91 |
Date Published | MAR |
Type of Article | Article |
ISSN | 1061-8600 |
Keywords | Outlier diagnostics, Robust distances, Visualization of multivariate data |
Abstract | We propose a number of diagnostic methods that can be used whenever multiple outliers are identified by robust estimates for multivariate location and scatter. Their main purpose is visualization of the multivariate data to help determine whether the detected outliers (a) form separate clusters or (b) are isolated or randomly scattered (such as heavy tails compared with Gaussian). We make use of Mahalanobis distances and linear projections, to check for separation and to reveal additional aspects of the data structure. Several real data examples are analyzed, and artificial examples are used to illustrate the diagnostic power of the proposed plots. Code to perform the diagnostics, datasets used as examples in the article and documention are available in the online supplements. |
DOI | 10.1198/jcgs.2009.0005 |