Campus Units

Genetics, Development and Cell Biology, Statistics, Biochemistry, Biophysics and Molecular Biology

Document Type

Article

Publication Version

Published Version

Publication Date

2007

Journal or Book Title

Journal of Data Science

Volume

5

Issue

2

First Page

151

Last Page

182

Abstract

This paper describes how to explore gene expression data using a combination of graphical and numerical methods. We start from the general methodology for multivariate data visualization, describing heatmaps, parallel coordinate plots and scatterplots. We propose new methods for gene expression data analysis using direct manipulation graphics. With linked scatterplots and parallel coordinate plots we explore gene expression data differently than many common practices. To check replicates in relation to treatments we introduce a new type of plot called a “replicate line” plot. There is a worked example, that focuses on an experimental study containing two two-level factors, genotype and cofactor presence, with two replicates.

Comments

This article is from Journal of Data Science 5 (2007): 151. Posted with permission.

Copyright Owner

The Authors

Language

en

File Format

application/pdf