Diagnosing Multivariate Outliers Detected by Robust Estimators

Subscribe to email list

Please select the email list(s) to which you wish to subscribe.

User menu

You are here

Diagnosing Multivariate Outliers Detected by Robust Estimators

TitleDiagnosing Multivariate Outliers Detected by Robust Estimators
Publication TypeJournal Article
Year of Publication2009
AuthorsWillems, G, Joe, H, Zamar, R
JournalJOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS
Volume18
Pagination73-91
Date PublishedMAR
Type of ArticleArticle
ISSN1061-8600
KeywordsOutlier diagnostics, Robust distances, Visualization of multivariate data
AbstractWe propose a number of diagnostic methods that can be used whenever multiple outliers are identified by robust estimates for multivariate location and scatter. Their main purpose is visualization of the multivariate data to help determine whether the detected outliers (a) form separate clusters or (b) are isolated or randomly scattered (such as heavy tails compared with Gaussian). We make use of Mahalanobis distances and linear projections, to check for separation and to reveal additional aspects of the data structure. Several real data examples are analyzed, and artificial examples are used to illustrate the diagnostic power of the proposed plots. Code to perform the diagnostics, datasets used as examples in the article and documention are available in the online supplements.
DOI10.1198/jcgs.2009.0005