Last data update: 2014.03.03

R: Process a list of S4 expressionSet objects.
processExpressionSetListR Documentation

Process a list of S4 expressionSet objects.

Description

A wrapper function for the post-processing function processExpressionSet() on a list of S4 expressionSet objects. This function is run after initial dataset normalization, such as quantile normalization on microarray datasets.

Usage

processExpressionSetList(exprSetList, outputFileDirectory = "./", 
numTopVarGenes, minVarPercentile, maxVarPercentile = 1, minVar)

Arguments

exprSetList

List of S4 expression sets.

outputFileDirectory

Output file directory for messages that print status of post-processing the ExpressionSets.

minVarPercentile

Minimum variance percentile. Must be provided in conjunction with maxVarPercentile to use percentiles to threshold genes.

maxVarPercentile

Maximum variance percentile. Defaul is 1, i.e. 1%. Must be provided in conjunction with minVarPercentile to use percentiles to threshold genes.

minVar

If maxVar is provided, as opposed to minVarPercentile and maxVarPercentile, genes are removed that are below a certain variance magnitude. This is helpful before running certain algorithms, such as the popular Combat batch normalization technique, that can throw errors if genes with extremely low variances are in the data matrix. May be used in conjunction with maxVar or in isolation.

numTopVarGenes

A numeric value indicating the number of genes (features) to select; the function will only take this number of genes that have the highest variance across all genes.

Value

A list of processed S4 ExpressionSet objects.

Author(s)

Katie Planey <katie.planey@gmail.com>

See Also

processExpressionSet

Examples

## Not run: 
#warning: takes a while to run! you're processing all datasets in the package!
#load up our datasets
data(curatedBreastDataExprSetList);

#just take top 5000 genes by variance
#this will post-process every dataset in the package
#to make them ready for downstream analyses.
proc_curatedBreastDataExprSetList <- processExpressionSetList(
exprSetList=curatedBreastDataExprSetList, 
outputFileDirectory = "./", numTopVarGenes=5000)

## End(Not run)

Results