R: From a transcript identifier to 'GRanges' object
etrid2grlR Documentation

From a transcript identifier to GRanges object


This function takes on or more Ensembl transcript identifiers, queries Biomart and constructs a GRangesList object as would Gviz::BiomartGeneRegionTrack for a genomic region (in fact, currently most of the code has been taken from Gviz::.fetchBMData and GViz::.chrName is used to validate chromosome names).


etrid2grl(etrid, ens, use.names = FALSE)



A vector of Ensembl transcript identifiers.


A instance of class Mart from biomaRt. If missing, useMart("ensembl", "hsapiens_gene_ensembl") is used.


If set to TRUE and etrid has names, then the latter are used to name the output.


A GRangesList object of length length(etrid).


Laurent Gatto


id <- c("ENST00000612959", "ENST00000317091")
grl1 <- etrid2grl(id[1])
grl <- etrid2grl(id)
stopifnot(all.equal(id, names(grl)))


> id <- c("ENST00000612959", "ENST00000317091")
> grl1 <- etrid2grl(id[1])
> grl1
GRangesList object of length 1:
GRanges object with 8 ranges and 7 metadata columns:
      seqnames               ranges strand |        feature            gene
         <Rle>            <IRanges>  <Rle> |    <character>     <character>
  [1]    chr17 [30116807, 30116863]      + |           utr5 ENSG00000126653
  [2]    chr17 [30172542, 30172589]      + |           utr5 ENSG00000126653
  [3]    chr17 [30172590, 30172598]      + | protein_coding ENSG00000126653
  [4]    chr17 [30178071, 30178199]      + | protein_coding ENSG00000126653
  [5]    chr17 [30179090, 30179297]      + | protein_coding ENSG00000126653
  [6]    chr17 [30180908, 30181016]      + | protein_coding ENSG00000126653
  [7]    chr17 [30184615, 30185674]      + | protein_coding ENSG00000126653
  [8]    chr17 [30185675, 30186474]      + |           utr3 ENSG00000126653
                 exon      transcript      symbol      rank     phase
          <character>     <character> <character> <numeric> <integer>
  [1] ENSE00003738663 ENST00000612959       NSRP1         1        -1
  [2] ENSE00003689679 ENST00000612959       NSRP1         2        -1
  [3] ENSE00003689679 ENST00000612959       NSRP1         2        -1
  [4] ENSE00003519306 ENST00000612959       NSRP1         3         0
  [5] ENSE00003659893 ENST00000612959       NSRP1         4         0
  [6] ENSE00003562538 ENST00000612959       NSRP1         5         1
  [7] ENSE00003744594 ENST00000612959       NSRP1         6         0
  [8] ENSE00003744594 ENST00000612959       NSRP1         6        -1

seqinfo: 1 sequence from an unspecified genome; no seqlengths
> grl <- etrid2grl(id)
> stopifnot(all.equal(id, names(grl)))
