Dirk Husmeier - Essays

small picture of Dirk subject logo

 

» Dirk's Home Page
Other Essays:-

Determining Gene Expression Profiles with cDNA Microarrays

August 2001
  1. Objective
  2. Applications
  3. Biological facts
  4. Normalisation
  5. Analysis
  6. References

Objective

Determine the pattern of gene expression to understand the following:

Applications

Biological facts

Normalisation

For further details, see Yang et al.

Analysis

The high dimensionality of gene expression data calls for new methods that automatically detect interesting structures in the data. After image pre-processing and normalisation, which in itself is the subject of current microarray research, one has to solve the following information extraction problems:

  1. Dimension reduction: We want to do better than PCA.
  2. Variable selection: Which genes are biologically relevant?
  3. Classification: Assign tissue samples to phenotypically characterised categories, e.g. different types of tumour.
  4. Clustering: Detect previously unknown relationships among genes, among tissues, or between genes and tissues.

An interesting approach addressing these problems was suggested by v. Heydebreck (MPI Berlin).
ad 2)
Use a score based on diagonal linear discriminant analysis to find genes whose expression levels strongly correlate with a known class distinction.
ad 1 and 3)
Project the expression profiles onto discriminant axes determined by subsets of selected genes and test if a priori known classes are well separated (supervised training).
ad 4)
Unsupervised exploration: Look for new, better separated clusters by combining a heuristic initialisation and a greedy search in the graph of all bipartitions.

The last step offers a "quality check" of existing classifications (by measuring the distance to the local maximum on the bipartition graph) and may detect new biologically relevant structures related to cell type, mutational status, response to a drug, tumour progression etc.

References

^ Top