Quality-based guidance for exploratory dimensionality reduction

Fernstad, Sara Johansson, Shaw, Jane and Johansson, Jimmy (2013) Quality-based guidance for exploratory dimensionality reduction. Information Visualization, 12 (1). pp. 44-64. ISSN 1473-8716

Full text not available from this repository. (Request a copy)
Official URL: http://dx.doi.org/10.1177/1473871612460526

Abstract

High-dimensional data sets containing hundreds of variables are difficult to explore, as traditional visualization methods often are unable to represent such data effectively. This is commonly addressed by employing dimensionality reduction prior to visualization. Numerous dimensionality reduction methods are available. However, few reduction approaches take the importance of several structures into account and few provide an overview of structures existing in the full high-dimensional data set. For exploratory analysis, as well as for many other tasks, several structures may be of interest. Exploration of the full high-dimensional data set without reduction may also be desirable. This paper presents flexible methods for exploratory analysis and interactive dimensionality reduction. Automated methods are employed to analyse the variables, using a range of quality metrics, providing one or more measures of ‘interestingness’ for individual variables. Through ranking, a single value of interestingness is obtained, based on several quality metrics, that is usable as a threshold for the most interesting variables. An interactive environment is presented in which the user is provided with many possibilities to explore and gain understanding of the high-dimensional data set. Guided by this, the analyst can explore the high-dimensional data set and interactively select a subset of the potentially most interesting variables, employing various methods for dimensionality reduction. The system is demonstrated through a use-case analysing data from a DNA sequence-based study of bacterial populations.

Item Type: Article
Uncontrolled Keywords: High-dimensional data, dimensionality reduction, quality metrics, visual exploration, interactive visual analysis
Subjects: G400 Computer Science
Department: Faculties > Engineering and Environment > Computer Science and Digital Technologies
Depositing User: Sara Johansson Fernstad
Date Deposited: 19 Mar 2015 17:23
Last Modified: 10 Aug 2015 11:09
URI: http://nrl.northumbria.ac.uk/id/eprint/21649

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics


Policies: NRL Policies | NRL University Deposit Policy | NRL Deposit Licence