Necessary but not Sufficient: Limitations of Projection Quality Metrics

dc.contributor.authorMachado, Alisteren_US
dc.contributor.authorBehrisch, Michaelen_US
dc.contributor.authorTelea, Alexandruen_US
dc.contributor.editorAigner, Wolfgangen_US
dc.contributor.editorAndrienko, Nataliaen_US
dc.contributor.editorWang, Beien_US
dc.date.accessioned2025-05-26T06:36:13Z
dc.date.available2025-05-26T06:36:13Z
dc.date.issued2025
dc.description.abstractHigh-dimensional data analysis often uses dimensionality reduction (DR, also called projection) to map data patterns to human-digestible visual patterns in a 2D scatterplot. Yet, DR methods may fail to show true data patterns and/or create visual patterns that do not represent any data patterns. Projection Quality Metrics (PQMs) are used as objective measures to gauge the above process: the higher a projection's scores in PQMs, the more it is deemed faithful to the data it represents. We show that, while PQMs can be used as exclusion criteria - low values usually mean poor projections - the converse does not always hold. For this, we develop a technique to automatically generate projections that score similar or even higher PQM values than projections created by well-known techniques, but show different, often confusing, visual patterns. Our results show that accepted PQMs cannot be used as an exclusive way to tell whether a projection yields accurate and interpretable visual patterns - in this sense, PQMs play a role akin to that of summary statistics in exploratory data analysis. We also show that not all studied metrics can be fooled equally well, suggesting a ranking of metrics in their ability to reliably capture quality.en_US
dc.description.sectionheadersDimensionality Reduction and High-Dimensional Data
dc.description.seriesinformationComputer Graphics Forum
dc.identifier.doi10.1111/cgf.70101
dc.identifier.issn1467-8659
dc.identifier.pages12 pages
dc.identifier.urihttps://doi.org/10.1111/cgf.70101
dc.identifier.urihttps://diglib.eg.org/handle/10.1111/cgf70101
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.rightsAttribution 4.0 International License
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectCCS Concepts: Mathematics of computing → Dimensionality reduction; Computing methodologies → Machine learning; Humancentered computing → Information visualization
dc.subjectMathematics of computing → Dimensionality reduction
dc.subjectComputing methodologies → Machine learning
dc.subjectHumancentered computing → Information visualization
dc.titleNecessary but not Sufficient: Limitations of Projection Quality Metricsen_US
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
cgf70101.pdf
Size:
28.73 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
1019-file-i8.pdf
Size:
11.07 MB
Format:
Adobe Portable Document Format
Collections