R: P-values and test-statistics from the Hedenfalk et al. (2001)...
hedenfalk
R Documentation
P-values and test-statistics from the Hedenfalk et al. (2001) gene
expression dataset
Description
The data from the breast cancer gene expression study of
Hedenfalk et al. (2001) were obtained and analyzed. A comparison was made
between 3,226 genes of two mutation types, BRCA1 (7 arrays) and BRCA2 (8
arrays). The data included here are p-values, test-statistics, and
permutation null test-statistics obtained from a two-sample t-test analysis
on a set of 3170 genes, as described in Storey and Tibshirani (2003).
Usage
data(hedenfalk)
Value
A list called hendfalk containing:
p
Vector of 3,170
p-values of tests comparing BRCA1 to BRCA2.
stat
Vector of 3,170
absolute two-sample t-statistics comparing BRCA1 to BRCA2.
stat0
A
3,170 by 100 matrix of absolute two-sample t-statistics from 100 independent
permutations of the BRCA1 and BRCA2 labels; the row stat0[i,].
contains the permutation statistics corresponding to observed statistic
stat[i].
References
Hedenfalk I et al. (2001). Gene expression profiles in hereditary breast cancer.
New England Journal of Medicine, 344: 539-548.
Storey JD and Tibshirani R. (2003). Statistical significance for genome-wide
studies. Proceedings of the National Academy of Sciences, 100: 9440-9445. http://www.pnas.org/content/100/16/9440.full
See Also
qvalue, empPvals
Examples
# import data
data(hedenfalk)
stat <- hedenfalk$stat
stat0 <- hedenfalk$stat0 #vector from null distribution
p.pooled <- empPvals(stat=stat, stat0=stat0)
p.testspecific <- empPvals(stat=stat, stat0=stat0, pool=FALSE)
#compare pooled to test-specific p-values
qqplot(p.pooled, p.testspecific); abline(0,1)
# calculate q-values and view results
qobj <- qvalue(p.pooled)
summary(qobj)
hist(qobj)
plot(qobj)
Results
R version 3.3.1 (2016-06-21) -- "Bug in Your Hair"
Copyright (C) 2016 The R Foundation for Statistical Computing
Platform: x86_64-pc-linux-gnu (64-bit)
R is free software and comes with ABSOLUTELY NO WARRANTY.
You are welcome to redistribute it under certain conditions.
Type 'license()' or 'licence()' for distribution details.
R is a collaborative project with many contributors.
Type 'contributors()' for more information and
'citation()' on how to cite R or R packages in publications.
Type 'demo()' for some demos, 'help()' for on-line help, or
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.
> library(qvalue)
> png(filename="/home/ddbj/snapshot/RGM3/R_BC/result/qvalue/hedenfalk.Rd_%03d_medium.png", width=480, height=480)
> ### Name: hedenfalk
> ### Title: P-values and test-statistics from the Hedenfalk et al. (2001)
> ### gene expression dataset
> ### Aliases: hedenfalk
> ### Keywords: dataset, hedenfalk
>
> ### ** Examples
>
> # import data
> data(hedenfalk)
> stat <- hedenfalk$stat
> stat0 <- hedenfalk$stat0 #vector from null distribution
>
> p.pooled <- empPvals(stat=stat, stat0=stat0)
> p.testspecific <- empPvals(stat=stat, stat0=stat0, pool=FALSE)
>
> #compare pooled to test-specific p-values
> qqplot(p.pooled, p.testspecific); abline(0,1)
>
> # calculate q-values and view results
> qobj <- qvalue(p.pooled)
> summary(qobj)
Call:
qvalue(p = p.pooled)
pi0: 0.669926
Cumulative number of significant calls:
<1e-04 <0.001 <0.01 <0.025 <0.05 <0.1 <1
p-value 15 76 265 424 605 868 3170
q-value 0 0 1 73 162 319 3170
local FDR 0 0 3 30 85 167 2241
> hist(qobj)
> plot(qobj)
>
>
>
>
>
>
> dev.off()
null device
1
>