Last data update: 2014.03.03

R: Functionality related to DNA/RNA sequences
getGenomeFaFileR Documentation

Functionality related to DNA/RNA sequences

Description

Utility functions related to RNA/DNA sequences, such as extracting RNA/DNA sequences for features defined in Ensb.

Usage


## S4 method for signature 'EnsDb'
getGenomeFaFile(x, pattern="dna.toplevel.fa")



Arguments

(In alphabetic order)

pattern

For method getGenomeFaFile: the pattern to be used to identify the fasta file representing genomic DNA sequence.

x

For all other methods an EnsDb instance.

Value

For getGenomeFaFile: a FaFile-class object with the genomic DNA sequence.

Methods and Functions

getGenomeFaFile

Returns a FaFile-class (defined in Rsamtools) with the genomic sequence of the genome build matching the Ensembl version of the EnsDb object. The file is retrieved using the AnnotationHub package, thus, at least for the first invocation, an internet connection is required to locate and download the file; subsequent calls will load the cached file instead. If no fasta file for the actual Ensembl version is available the function tries to identify a file matchint the species and genome build version of the closest Ensembl release and returns that instead. See the vignette for an example to work with such files.

Author(s)

Johannes Rainer

See Also

BasicFilter transcripts exonsBy

Examples


## Loading an EnsDb for Ensembl version 75 (genome GRCh37):
library(EnsDb.Hsapiens.v75)
edb <- EnsDb.Hsapiens.v75

## Not run: 
    ## Retrieve a FaFile with the gneomic DNA sequence matching the organism,
    ## genome release version and, if possible, the Ensembl version of the
    ## EnsDb object.
    Dna <- getGenomeFaFile(edb)
    ## Extract the transcript sequence for all transcripts encoded on chromosome
    ## Y.
    ##extractTranscriptSeqs(Dna, edb, filter=SeqnameFilter("Y"))


## End(Not run)

Results


R version 3.3.1 (2016-06-21) -- "Bug in Your Hair"
Copyright (C) 2016 The R Foundation for Statistical Computing
Platform: x86_64-pc-linux-gnu (64-bit)

R is free software and comes with ABSOLUTELY NO WARRANTY.
You are welcome to redistribute it under certain conditions.
Type 'license()' or 'licence()' for distribution details.

R is a collaborative project with many contributors.
Type 'contributors()' for more information and
'citation()' on how to cite R or R packages in publications.

Type 'demo()' for some demos, 'help()' for on-line help, or
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.

> library(ensembldb)
Loading required package: BiocGenerics
Loading required package: parallel

Attaching package: 'BiocGenerics'

The following objects are masked from 'package:parallel':

    clusterApply, clusterApplyLB, clusterCall, clusterEvalQ,
    clusterExport, clusterMap, parApply, parCapply, parLapply,
    parLapplyLB, parRapply, parSapply, parSapplyLB

The following objects are masked from 'package:stats':

    IQR, mad, xtabs

The following objects are masked from 'package:base':

    Filter, Find, Map, Position, Reduce, anyDuplicated, append,
    as.data.frame, cbind, colnames, do.call, duplicated, eval, evalq,
    get, grep, grepl, intersect, is.unsorted, lapply, lengths, mapply,
    match, mget, order, paste, pmax, pmax.int, pmin, pmin.int, rank,
    rbind, rownames, sapply, setdiff, sort, table, tapply, union,
    unique, unsplit

Loading required package: GenomicRanges
Loading required package: S4Vectors
Loading required package: stats4

Attaching package: 'S4Vectors'

The following objects are masked from 'package:base':

    colMeans, colSums, expand.grid, rowMeans, rowSums

Loading required package: IRanges
Loading required package: GenomeInfoDb
Loading required package: GenomicFeatures
Loading required package: AnnotationDbi
Loading required package: Biobase
Welcome to Bioconductor

    Vignettes contain introductory material; view with
    'browseVignettes()'. To cite Bioconductor, see
    'citation("Biobase")', and for packages 'citation("pkgname")'.

> png(filename="/home/ddbj/snapshot/RGM3/R_BC/result/ensembldb/EnsDb-sequences.Rd_%03d_medium.png", width=480, height=480)
> ### Name: getGenomeFaFile
> ### Title: Functionality related to DNA/RNA sequences
> ### Aliases: getGenomeFaFile getGenomeFaFile,EnsDb-method
> ### Keywords: classes
> 
> ### ** Examples
> 
> 
> ## Loading an EnsDb for Ensembl version 75 (genome GRCh37):
> library(EnsDb.Hsapiens.v75)
> edb <- EnsDb.Hsapiens.v75
> 
> ## Not run: 
> ##D     ## Retrieve a FaFile with the gneomic DNA sequence matching the organism,
> ##D     ## genome release version and, if possible, the Ensembl version of the
> ##D     ## EnsDb object.
> ##D     Dna <- getGenomeFaFile(edb)
> ##D     ## Extract the transcript sequence for all transcripts encoded on chromosome
> ##D     ## Y.
> ##D     ##extractTranscriptSeqs(Dna, edb, filter=SeqnameFilter("Y"))
> ##D 
> ## End(Not run)
> 
> 
> 
> 
> 
> 
> dev.off()
null device 
          1 
>