|
|||||||||||||||||||||||
DetailsPartial Least Squares Discriminant Analysis (PLS-DA) is a technique specifically appropriate for analysis of high dimensionality data sets and multicollinearity (Perez-Enciso, 2013). PLS-DA is a supervised method (i.e. makes use of class labels) with the aim to provide a dimension reduction strategy in a situation where we want to relate a binary response variable (in our case young or old status) to a set of predictor variables. Dimensionality reduction procedure is based on orthogonal transformations of the original variables (miRNAs/isomiRs) into a set of linearly uncorrelated latent variables (usually termed as components) such that maximizes the separation between the different classes in the first few components (Xia, 2011). We used sum of squares captured by the model (R2) as a goodness of fit measure. We implemented this method using the
Read more about the parameters related to the PLS-DA directly from plsDA function. ValueA If the option ReferencesPerez-Enciso, Miguel and Tenenhaus, Michel. Prediction of clinical outcome with microarray data: a partial least squares discriminant analysis (PLS-DA) approach. Human Genetics. 2003. Xia, Jianguo and Wishart, David S. Web-based inference of biological patterns, functions and pathways from metabolomic data using MetaboAnalyst. Nature Protocols. 2011. Examplesdata(mirData) # Only miRNAs with > 10 reads in all samples. ids <- isoCounts(mirData, minc=10, mins=6) ids <- isoNorm(ids) pls.ids = isoPLSDA(ids, "condition", nperm = 2) cat(paste0("pval:",pls.ids$p.val)) cat(paste0("components:",pls.ids$components)) ResultsR version 3.3.1 (2016-06-21) -- "Bug in Your Hair" Copyright (C) 2016 The R Foundation for Statistical Computing Platform: x86_64-pc-linux-gnu (64-bit) R is free software and comes with ABSOLUTELY NO WARRANTY. You are welcome to redistribute it under certain conditions. Type 'license()' or 'licence()' for distribution details. R is a collaborative project with many contributors. Type 'contributors()' for more information and 'citation()' on how to cite R or R packages in publications. Type 'demo()' for some demos, 'help()' for on-line help, or 'help.start()' for an HTML browser interface to help. Type 'q()' to quit R. > library(isomiRs) Loading required package: DiscriMiner Loading required package: IRanges Loading required package: BiocGenerics Loading required package: parallel Attaching package: 'BiocGenerics' The following objects are masked from 'package:parallel': clusterApply, clusterApplyLB, clusterCall, clusterEvalQ, clusterExport, clusterMap, parApply, parCapply, parLapply, parLapplyLB, parRapply, parSapply, parSapplyLB The following objects are masked from 'package:stats': IQR, mad, xtabs The following objects are masked from 'package:base': Filter, Find, Map, Position, Reduce, anyDuplicated, append, as.data.frame, cbind, colnames, do.call, duplicated, eval, evalq, get, grep, grepl, intersect, is.unsorted, lapply, lengths, mapply, match, mget, order, paste, pmax, pmax.int, pmin, pmin.int, rank, rbind, rownames, sapply, setdiff, sort, table, tapply, union, unique, unsplit Loading required package: S4Vectors Loading required package: stats4 Attaching package: 'S4Vectors' The following objects are masked from 'package:base': colMeans, colSums, expand.grid, rowMeans, rowSums Loading required package: GenomicRanges Loading required package: GenomeInfoDb Loading required package: SummarizedExperiment Loading required package: Biobase Welcome to Bioconductor Vignettes contain introductory material; view with 'browseVignettes()'. To cite Bioconductor, see 'citation("Biobase")', and for packages 'citation("pkgname")'. > png(filename="/home/ddbj/snapshot/RGM3/R_BC/result/isomiRs/isoPLSDA.Rd_%03d_medium.png", width=480, height=480) > ### Name: isoPLSDA > ### Title: Partial Least Squares Discriminant Analysis for 'IsomirDataSeq' > ### Aliases: isoPLSDA > > ### ** Examples > > data(mirData) > # Only miRNAs with > 10 reads in all samples. > ids <- isoCounts(mirData, minc=10, mins=6) > ids <- isoNorm(ids) converting counts to integer mode -- note: fitType='parametric', but the dispersion trend was not well captured by the function: y = a/x + b, and a local regression fit was automatically substituted. specify fitType='local' or 'mean' to avoid this message next time. > pls.ids = isoPLSDA(ids, "condition", nperm = 2) > cat(paste0("pval:",pls.ids$p.val)) pval:0> cat(paste0("components:",pls.ids$components)) components:16.6858092276317 components:3.04045594908574 components:9.34212467094332 components:-9.24066935893392 components:-9.70619388142206 components:-10.1215266073048 components:-12.841132088321 components:15.8274781050384 components:7.15087127544249 components:-2.0753270359708 components:-3.46336377223972 components:-4.59852648394939 components:2.49048849972438 components:5.62559676754491 components:-7.01897559719785 components:-3.9154729400621 components:-1.79013597554926 components:4.60849924553993> > > > > > dev.off() null device 1 > |
|||||||||||||||||||||||
|