Statistics, Department of

 

Document Type

Article

Date of this Version

3-2023

Citation

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS 2023, VOL. 00, NO. 0, 1–16 https://doi.org/10.1080/10618600.2023.2195462

Comments

Open access.

Abstract

Parallel Coordinate Plots (PCP) are a valuable tool for exploratory data analysis of high-dimensional numerical data. The use of PCPs is limited when working with categorical variables or a mix of categorical and continuous variables. In this article, we propose Generalized Parallel Coordinate Plots (GPCP) to extend the ability of PCPs from just numeric variables to dealing seamlessly with a mix of categorical and numeric variables in a single plot. In this process we find that existing solutions for categorical values only, such as hammock plots or parsets become edge cases in the new framework. By focusing on individual observations rather than a marginal frequency we gain additional flexibility. The resulting approach is implemented in the R package ggpcp. Supplementary materials for this article are available online.

Share

COinS