Featured image of post Parallel Sets

Parallel Sets

Parallel sets visualize multidimensional categorical data. Several categorical variables are arranged as parallel axes, and bands connect categories across adjacent axes. The band width represents frequency or proportion.

The method extends the idea of parallel coordinates from continuous variables to categorical combinations.

Background

Parallel sets were developed in the 2000s by visualization researchers Robert Kosara and Jorg Hauser. Their paper “Parallel Sets: Interactive Exploration and Visual Analysis of Categorical Data” introduced the method as a way to explore contingency-table data across many dimensions.

The method supports interaction such as reordering axes, rearranging categories, and highlighting subsets.

Data Structure

ElementMeaning
AxisCategorical variable
BlockCategory on an axis
BandCombination between adjacent variables
WidthFrequency or proportion

Purpose

Parallel sets are used to understand relationships among several categorical variables, such as gender, age group, education, occupation, and income. They make multivariate cross-tabulations easier to inspect visually.

How to Read It

The vertical axes represent variables. Blocks on each axis are categories. Bands connect category combinations, and wider bands indicate more observations.

Ribbon width represents both marginal and conditional probability

A key feature is that band width can be read as both:

  • a share of the whole dataset
  • a conditional share within a category

Design Notes

  • Axis order strongly affects readability.
  • Reduce or group categories when there are too many.
  • Use consistent color logic.
  • Interaction is valuable for filtering and highlighting.

Alternatives

MethodData typeFeature
Parallel coordinatesContinuous variablesNumeric correlation patterns
Sankey diagramFlows or categoriesDirectional flow
Mosaic plotCategorical variablesArea-based cross-tabulation
TreemapHierarchyNested category area

Difference from Sankey Diagrams

Sankey diagrams usually represent directional flows such as energy, money, or process movement. Parallel sets represent combinations of categories. They are not necessarily directional, and relationships can be read from either side.

Summary

Parallel sets are powerful for exploring categorical data across many variables. Their clarity depends on careful axis ordering, category reduction, color design, and often interaction.

References

Licensed under CC BY-NC-SA 4.0
Last updated on Jun 12, 2026 10:18 +0900
Built with Hugo
Theme Stack designed by Jimmy