Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3002151.3002162acmconferencesArticle/Chapter ViewAbstractPublication Pagessiggraph-asiaConference Proceedingsconference-collections
research-article
Public Access

Visual exploration of multiway dependencies in multivariate data

Published: 28 November 2016 Publication History

Abstract

Analyzing dependencies among variables within multivariate data is an important and challenging problem, especially when the number of data points is large, the number of variables is high, or multiway dependencies are of interest. Several visualization methods have been proposed to aid in the exploration of such information through the direct visualization of the summary statistics. These methods are typically limited to the study of all possible pairwise relationship but in a manner that does not scale to large multidimensional data. In cases where 3-way relationships are investigated, only subsets of dimensions are considered. In this paper, we propose a novel technique for analyzing multiway dependencies through an overview+detail visualization. In this approach, the overview represents all pairwise, 3-, and 4-way dependencies in the data using glyphs that provide a global visual exploration interface for selecting candidate relationships. Exploration is supported through interactive filtering, sorting, zooming, and selection operations. Once selected, the detailed view helps in developing an inference by providing specific information about those selected variables. Various use cases demonstrate how our approach helps to explore multiway dependencies efficiently in large datasets.

Supplementary Material

ZIP File (a2-nguyen.zip)
Supplemental files.

References

[1]
Allison, P. D. 1998. Multiple Regression: A Primer. Sage Publications.
[2]
Beham, M., Herzner, W., Gröller, M. E., and Kehrer, J. 2014. Cupid: Cluster-based exploration of geometry generators with parallel coordinates and radial trees. IEEE TVCG 20, 12, 1693--1702.
[3]
Benesty, J., Chen, J., and Huang, Y. 2008. On the importance of the pearson correlation coefficient in noise reduction. IEEE Trans. on ASLP 16, 4, 757--765.
[4]
Cao, N., Lin, Y.-R., and Gotz, D. 2015. Untangle map: Visual analysis of probabilistic multi-label data. IEEE TVCG PP, 99, 1--1.
[5]
Chan, Y.-H., Correa, C. D., and Ma, K.-L. 2010. Flow-based scatterplots for sensitivity analysis. In IEEE VAST, 43--50.
[6]
Chen, Y. A., Almeida, J. S., Richards, A. J., Muller, P., Carroll, R. J., and Rohrer, B. 2010. A nonparametric approach to detect nonlinear correlation in gene expression. Journal of CGS 19, 3, 552--568.
[7]
Chen, C.-K., Wang, C., Ma, K.-L., and Wittenberg, A. T. 2011. Static correlation visualization for large time-varying volume data. PacificVis, 27--34.
[8]
Elmqvist, N., Dragicevic, P., and Fekete, J.-D. 2008. Rolling the dice: Multidimensional visual exploration using scatterplot matrix navigation. IEEE TVCG 14, 6, 1141--1148.
[9]
Friendly, M. 2002. Corrgrams: Exploratory displays for correlation matrices. The American Statistician 1.
[10]
Glatter, M., Huang, J., Ahern, S., Daniel, J., and Lu, A. 2008. Visualizing temporal patterns in large multivariate data using modified globbing. IEEE TVCG 14, 6, 1467--1474.
[11]
Gosink, L., Anderson, J. C., Bethel, E. W., and Joy, K. I. 2007. Variable Interactions in Query-Driven Visualization. IEEE TVCG 13, 6, 1400--1407.
[12]
Gu, Y., and Wang, C. 2010. A study of hierarchical correlation clustering for scientific volume data. In Advances in Visual Computing, 437--446.
[13]
Harrison, L., Yang, F., Franconeri, S., and Chang, R. 2014. Ranking visualizations of correlation using weber's law. IEEE TVCG 20, 12, 1943--1952.
[14]
Hartigan, J. A. 1975. Printer graphics for clustering. Journal of Statistical Computation and Simulation 4, 3.
[15]
Heinrich, J., and Weiskopf, D. 2013. State of the art of parallel coordinates. In Eurographics STAR, 95--116.
[16]
Hogg, R. V., and Craig, A. T. 1995. Introduction to Mathematical Statistics, 5th ed. Macmillan.
[17]
Hogg, R. V., and Craig, A. T. 1998. Nonparametrics: Statistical Methods Based on Ranks. Prentice-Hall.
[18]
Huang, T.-H., Huang, M. L., and Zhang, K. 2012. An interactive scatter plot metrics visualization for decision trend analysis. In Int'l Conf. on Mach. Learning & App., 258--264.
[19]
Inselberg, A. 1985. The plane with parallel coordinates. Visual Computer 1, 2, 69--91.
[20]
Janicke, H., Bottinger, M., and Scheuermann, G. 2008. Brushing of attribute clouds for the visualization of multivariate data. IEEE TVCG 14, 6, 1459--1466.
[21]
Jen, D., Parente, P., Robbins, J., Weigle, C., Taylor, R., Burette, A., and Weinberg, R. 2004. Imagesurfer: a tool for visualizing correlations between two volume scalar fields. In IEEE VIS, 529--536.
[22]
Jeong, D. H., Ziemkiewicz, C., Fisher, B., Ribarsky, W., and Chang, R. 2009. ipca: An interactive system for pca-based visual analytics. In EuroVis, 767--774.
[23]
Kay, M., and Heer, J. 2016. Beyond weber's law: A second look at ranking visualizations of correlation. IEEE TVCG 22, 1, 469--478.
[24]
Ke, Y., Cheng, J., and Ng, W. 2008. Efficient correlation search from graph databases. IEEE Trans. Know. & Data Eng. 20, 12, 1601--1615.
[25]
Keim, D. A., Mansmann, F., Schneidewind, J., and Ziegler, H. 2006. Challenges in visual data analysis. In IEEE InfoVis, 9--16.
[26]
Keith, T. 2006. Multiple Regression and Beyond. Pearson Education.
[27]
Liu, X., and Shen, H. W. 2016. Association analysis for visual exploration of multivariate scientific data sets. IEEE TVCG 22, 1 (Jan), 955--964.
[28]
Magnello, E., and Vanloon, B. 2009. Introducing Statistics: A Graphic Guide. Icon Books Ltd.
[29]
Nagaraj, S., Natarajan, V., and Nanjundiah, R. S. 2011. A gradient-based comparison measure for visual analysis of multifield data. In EuroVis 2011, 1101--1110.
[30]
Nguyen, H., and Rosen, P. 2016. Improved identification of data correlations through correlation coordinate plots. In Int'l Conf. on Info. Vis. Theory & App.
[31]
Pfaffelmoser, Tobias, and Westermann. 2013. Correlation visualization for structural uncertainty analysis. International Journal for Uncertainty Quantification 3, 2.
[32]
Qu, H., Chan, W.-Y., Xu, A., Chung, K.-L., Lau, K.-H., and Guo, P. 2007. Visual analysis of the air pollution problem in hong kong. IEEE TVCG 13, 6, 1408--1415.
[33]
Sauber, N., Theisel, H., and Seidel, H. 2006. Multifield-graphs: An approach to visualizing correlations in multifield scalar data. IEEE TVCG 12, 5, 917--924.
[34]
Sukharev, J., Wang, C., Ma, K., and Wittenberg, A. T. 2009. Correlation study of time-varying multivariate climate data sets. In PacificVis, 161--168.
[35]
Teoh, S., and Ma, K.-L. 2005. Hifocon: Object and dimensional coherence and correlation in multidimensional visualization. In Advances in Visual Computing, vol. 3804, 235--242.
[36]
Thompson, S. 1992. Sampling. John Wiley, Sons, Inc.
[37]
Viau, C., McGuffin, M. J., Chiricota, Y., and Jurisica, I. 2010. The flowvizmenu and parallel scatterplot matrix: Hybrid multidimensional visualizations for network exploration. IEEE TVCG 16, 6, 1100--1108.
[38]
Wang, J., and Zheng, N. 2013. A novel fractal image compression scheme with block classification and sorting based on pearson's correlation coefficient. IEEE Trans. on Image Proc. 22, 9.
[39]
Yuan, X., Guo, P., Xiao, H., Zhou, H., and Qu, H. 2009. Scattering points in parallel coordinates. IEEE TVCG 15, 6, 1001--1008.

Cited By

View all
  • (2023)Automatic Scatterplot Design Optimization for Clustering IdentificationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2022.318988329:10(4312-4327)Online publication date: 1-Oct-2023
  • (2021)Modeling the Influence of Visual Density on Cluster Perception in Scatterplots Using TopologyIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2020.303036527:2(1829-1839)Online publication date: Feb-2021
  • (2019)Efficient Optimal Overlap Removal: Algorithms and ExperimentsComputer Graphics Forum10.1111/cgf.1372238:3(713-723)Online publication date: 10-Jul-2019
  • Show More Cited By

Index Terms

  1. Visual exploration of multiway dependencies in multivariate data

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SA '16: SIGGRAPH ASIA 2016 Symposium on Visualization
    November 2016
    129 pages
    ISBN:9781450345477
    DOI:10.1145/3002151
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 November 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. correlation visualization
    2. multivariate data visualization
    3. statistical visualization
    4. variable dependency visualization

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    SA '16
    Sponsor:
    SA '16: SIGGRAPH Asia 2016
    December 5 - 8, 2016
    Macau

    Acceptance Rates

    Overall Acceptance Rate 178 of 869 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)41
    • Downloads (Last 6 weeks)11
    Reflects downloads up to 21 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Automatic Scatterplot Design Optimization for Clustering IdentificationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2022.318988329:10(4312-4327)Online publication date: 1-Oct-2023
    • (2021)Modeling the Influence of Visual Density on Cluster Perception in Scatterplots Using TopologyIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2020.303036527:2(1829-1839)Online publication date: Feb-2021
    • (2019)Efficient Optimal Overlap Removal: Algorithms and ExperimentsComputer Graphics Forum10.1111/cgf.1372238:3(713-723)Online publication date: 10-Jul-2019
    • (2017)Visual Analytic Design for Characterizing Air-Sampling Sensor Performance and Operation2017 IEEE Conference on Visual Analytics Science and Technology (VAST)10.1109/VAST.2017.8585678(217-218)Online publication date: Oct-2017

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media