R: Annotate clusters with respect to transcript features
annotateClusters
R Documentation
Annotate clusters with respect to transcript features
Description
Carries out strand-specific annotation of clusters with respect to distinct
transcript features, particularly introns, coding sequences, 3'-UTRs,
5'-UTRs. Mapping to multiple features and to those outside the above
mentioned ones are reported. Unmapped clusters are then futher further
analyzed and annotated with respect to features localizing on the anti-sense
strand. Results can be plotted as dotchart and annotations are returned as
clusters metadata.
GRanges object containing individual clusters as identified
by the getClusters function
txDB
TranscriptDb object obtained through a call to the
makeTxDbFromUCSC function in the GenomicFeatures
package. Default is NULL, namely the object will be fetched internally
genome
A character specifying the genome abbreviation used by UCSC.
Available abbreviations are returned by a call to ucscGenomes()[ ,
"db"]. Default is "hg19" (human genome)
tablename
A character specifying the name of the UCSC table
containing the transcript annotations to retrieve. Available table names are
returned by a call to supportedUCSCtables(). Default is "ensGene",
namely ensembl gene annotations
plot
Logical, if TRUE a dotchart with cluster annotations is produced
verbose
Logical, if TRUE processing steps are printed
Value
Same as the input GRanges object, with an additional metadata column
containing the following character encoding of the genomic feature each
cluster maps to:
"CDS ss"
Coding Sequence Sense Strand
"Introns ss"
Intron Sense Strand
"3' UTR ss"
3' UTR Sense
Strand
"5' UTR ss"
5' UTR Sense Strand
"Multiple"
More than
one of the above
"CDS as"
Coding Sequence Antisense Strand
"Introns as"
Intron Antisense Strand
"3' UTR as"
3' UTR
Antisense Strand
"5' UTR as"
5' UTR Antisense Strand
"Other"
None of the above
If plot=TRUE, a dotchart is
produced in addition.
Author(s)
Federico Comoglio
References
M. Carlson and H. Pages and P. Aboyoun and S. Falcon and M.
Morgan and D. Sarkar and M. Lawrence, GenomicFeatures: Tools for making and
manipulating transcript centric annotations, R package version 1.12.4
Comoglio F, Sievers C and Paro R (2015) Sensitive and highly resolved identification
of RNA-protein interaction sites in PAR-CLIP data, BMC Bioinformatics 16, 32.