Last data update: 2014.03.03

R: Get dataset in ExpressionSet format
getDatasetR Documentation

Get dataset in ExpressionSet format

Description

Get dataset for a given dataset identifier and platform identifier in a specific format.

Usage

getDataset(dataset, platform, ...);

Arguments

dataset

Valid dataset identifier: the GEO or InSilico DB accession code.

platform

The platform of the chip on which a measurement was done. A platform is identified by its GEO accession code.

...

Optional arguments can be passed to the function for more specific requests:

format = "CURESET"

File format in which the dataset should be returned.

Possible values:

  • CURESET : Curated R/Bioconductor ExpressionSet.

  • ESET : R/Bioconductor ExpressionSet without curation.

features = "PROBE"

Type of features in which data is represented. Two possible values: "GENE" or "PROBE". By default an expression matrix containing probes is returned.

norm = "ORIGINAL"

Type of normalization to use when pre-processing a dataset.

Possible normalizations: "ORIGINAL", "FRMA"[1], "SCAN"[2].

curation

Specific curation used to annotate a dataset. A curation is identified by a InSilico DB Curation identifier. You can annotate datasets yourself at https://insilicodb.com/app/browse If not defined, the prefered curation of specified dataset will be used.

timeout = 120

Time (in seconds) to wait for the dataset, if it has to be generated.

Value

A Bioconductors ExpressionSet.

References

[1] MN. McCall, BM. Bolstad, and RA. Irizarry, Frozen Robust Multi-Array Analysis (fRMA), Biostatistics, vol. 11, no. 2, pp. 242-253 , 2008.

[2] SR. Piccolo, Y. Sun, JD. Campbell, ME. Lenburg, AH. Bild, and WE. Johnson, A single-sample microarray normalization method to facilitate personalized-medicine workflows, Genomics, vol. 100, no. 6, pp. 337-344, 2012.

See Also

getDatasetInfo

Examples

InSilicoLogin("rpackage_tester@insilicodb.com", "5c4d0b231e5cba4a0bc54783b385cc9a");
dataset.id <- "GSE4635";
platform.id <- getPlatforms(dataset.id);
eset <- getDataset(dataset.id, platform.id[[1]],
                   norm = "ORIGINAL", features = "PROBE");
print(dim(eset));
## Features  Samples 
##    22215        8 
eset <- getDataset( dataset.id, platform.id[[1]],
                   norm = "ORIGINAL", features = "GENE");
print(dim(eset));
## Features  Samples 
##    12698        8 

Results


R version 3.3.1 (2016-06-21) -- "Bug in Your Hair"
Copyright (C) 2016 The R Foundation for Statistical Computing
Platform: x86_64-pc-linux-gnu (64-bit)

R is free software and comes with ABSOLUTELY NO WARRANTY.
You are welcome to redistribute it under certain conditions.
Type 'license()' or 'licence()' for distribution details.

R is a collaborative project with many contributors.
Type 'contributors()' for more information and
'citation()' on how to cite R or R packages in publications.

Type 'demo()' for some demos, 'help()' for on-line help, or
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.

> library(inSilicoDb)
Loading required package: rjson
Loading required package: Biobase
Loading required package: BiocGenerics
Loading required package: parallel

Attaching package: 'BiocGenerics'

The following objects are masked from 'package:parallel':

    clusterApply, clusterApplyLB, clusterCall, clusterEvalQ,
    clusterExport, clusterMap, parApply, parCapply, parLapply,
    parLapplyLB, parRapply, parSapply, parSapplyLB

The following objects are masked from 'package:stats':

    IQR, mad, xtabs

The following objects are masked from 'package:base':

    Filter, Find, Map, Position, Reduce, anyDuplicated, append,
    as.data.frame, cbind, colnames, do.call, duplicated, eval, evalq,
    get, grep, grepl, intersect, is.unsorted, lapply, lengths, mapply,
    match, mget, order, paste, pmax, pmax.int, pmin, pmin.int, rank,
    rbind, rownames, sapply, setdiff, sort, table, tapply, union,
    unique, unsplit

Welcome to Bioconductor

    Vignettes contain introductory material; view with
    'browseVignettes()'. To cite Bioconductor, see
    'citation("Biobase")', and for packages 'citation("pkgname")'.

Loading required package: RCurl
Loading required package: bitops
> png(filename="/home/ddbj/snapshot/RGM3/R_BC/result/inSilicoDb/getDataset.Rd_%03d_medium.png", width=480, height=480)
> ### Name: getDataset
> ### Title: Get dataset in ExpressionSet format
> ### Aliases: getDataset
> 
> ### ** Examples
> 
> InSilicoLogin("rpackage_tester@insilicodb.com", "5c4d0b231e5cba4a0bc54783b385cc9a");
  INSILICODB: Welcome RPackage Tester
[1] 5296
> dataset.id <- "GSE4635";
> platform.id <- getPlatforms(dataset.id);
> eset <- getDataset(dataset.id, platform.id[[1]],
+                    norm = "ORIGINAL", features = "PROBE");
  INSILICODB: The dataset you requested could not be computed. We are sorry for the inconvenience.
Error: Stopped because of previous errors
Execution halted