Last data update: 2014.03.03
|
R: 1000genomes Reference Genome Sequence (hs37d5)
BSgenome.Hsapiens.1000genomes.hs37d5 | R Documentation |
1000genomes Reference Genome Sequence (hs37d5)
Description
Full 1000genomes Phase2 Reference Genome Sequence (hs37d5), based on NCBI GRCh37.
Note
This BSgenome data package was made from the following source data file:
ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/phase2_reference_assembly_sequence/hs37d5.fa.gz
The genome is composed of:
Integrated reference sequence from the GRCh37 primary assembly
(chromosomal plus unlocalized and unplaced contigs)
The rCRS mitochondrial sequence (AC:NC_012920)
Human herpesvirus 4 type 1 (AC:NC_007605)
Concatenated decoy sequences (hs37d5cs.fa.gz)
For details, please see
ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/phase2_reference_assembly_sequence/README_human_reference_20110707.
Author(s)
Julian Gehring <julian.gehring@embl.de>
See Also
-
BSgenome objects and the
available.genomes function
in the BSgenome software package.
-
DNAString objects in the Biostrings
package.
The BSgenomeForge vignette (vignette("BSgenomeForge") )
in the BSgenome software package for how to make a BSgenome
data package.
Examples
BSgenome.Hsapiens.1000genomes.hs37d5
genome <- BSgenome.Hsapiens.1000genomes.hs37d5
seqlengths(genome)
Results
R version 3.3.1 (2016-06-21) -- "Bug in Your Hair"
Copyright (C) 2016 The R Foundation for Statistical Computing
Platform: x86_64-pc-linux-gnu (64-bit)
R is free software and comes with ABSOLUTELY NO WARRANTY.
You are welcome to redistribute it under certain conditions.
Type 'license()' or 'licence()' for distribution details.
R is a collaborative project with many contributors.
Type 'contributors()' for more information and
'citation()' on how to cite R or R packages in publications.
Type 'demo()' for some demos, 'help()' for on-line help, or
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.
> library(BSgenome.Hsapiens.1000genomes.hs37d5)
Loading required package: BSgenome
Loading required package: BiocGenerics
Loading required package: parallel
Attaching package: 'BiocGenerics'
The following objects are masked from 'package:parallel':
clusterApply, clusterApplyLB, clusterCall, clusterEvalQ,
clusterExport, clusterMap, parApply, parCapply, parLapply,
parLapplyLB, parRapply, parSapply, parSapplyLB
The following objects are masked from 'package:stats':
IQR, mad, xtabs
The following objects are masked from 'package:base':
Filter, Find, Map, Position, Reduce, anyDuplicated, append,
as.data.frame, cbind, colnames, do.call, duplicated, eval, evalq,
get, grep, grepl, intersect, is.unsorted, lapply, lengths, mapply,
match, mget, order, paste, pmax, pmax.int, pmin, pmin.int, rank,
rbind, rownames, sapply, setdiff, sort, table, tapply, union,
unique, unsplit
Loading required package: S4Vectors
Loading required package: stats4
Attaching package: 'S4Vectors'
The following objects are masked from 'package:base':
colMeans, colSums, expand.grid, rowMeans, rowSums
Loading required package: IRanges
Loading required package: GenomeInfoDb
Loading required package: GenomicRanges
Loading required package: Biostrings
Loading required package: XVector
Loading required package: rtracklayer
> png(filename="/home/ddbj/snapshot/RGM3/R_BC/result/BSgenome.Hsapiens.1000genomes.hs37d5/package.Rd_%03d_medium.png", width=480, height=480)
> ### Name: BSgenome.Hsapiens.1000genomes.hs37d5
> ### Title: 1000genomes Reference Genome Sequence (hs37d5)
> ### Aliases: BSgenome.Hsapiens.1000genomes.hs37d5-package
> ### BSgenome.Hsapiens.1000genomes.hs37d5 hs37d5
> ### Keywords: package data
>
> ### ** Examples
>
> BSgenome.Hsapiens.1000genomes.hs37d5
Human genome:
# organism: Homo sapiens (Human)
# provider: 1000genomes
# provider version: hs37d5
# release date: 2011-07-07
# release name: hs37d5
# 86 sequences:
# 1 2 3 4 5 6 7
# 8 9 10 11 12 13 14
# 15 16 17 18 19 20 21
# 22 X Y MT GL000207.1 GL000226.1 GL000229.1
# GL000231.1 GL000210.1 GL000239.1 GL000235.1 GL000201.1 GL000247.1 GL000245.1
# ... ... ... ... ... ... ...
# GL000198.1 GL000208.1 GL000191.1 GL000227.1 GL000228.1 GL000214.1 GL000221.1
# GL000209.1 GL000218.1 GL000220.1 GL000213.1 GL000211.1 GL000199.1 GL000217.1
# GL000216.1 GL000215.1 GL000205.1 GL000219.1 GL000224.1 GL000223.1 GL000195.1
# GL000212.1 GL000222.1 GL000200.1 GL000193.1 GL000194.1 GL000225.1 GL000192.1
# NC_007605 hs37d5
# (use 'seqnames()' to see all the sequence names, use the '$' or '[[' operator
# to access a given sequence)
> genome <- BSgenome.Hsapiens.1000genomes.hs37d5
> seqlengths(genome)
1 2 3 4 5 6 7
249250621 243199373 198022430 191154276 180915260 171115067 159138663
8 9 10 11 12 13 14
146364022 141213431 135534747 135006516 133851895 115169878 107349540
15 16 17 18 19 20 21
102531392 90354753 81195210 78077248 59128983 63025520 48129895
22 X Y MT GL000207.1 GL000226.1 GL000229.1
51304566 155270560 59373566 16569 4262 15008 19913
GL000231.1 GL000210.1 GL000239.1 GL000235.1 GL000201.1 GL000247.1 GL000245.1
27386 27682 33824 34474 36148 36422 36651
GL000197.1 GL000203.1 GL000246.1 GL000249.1 GL000196.1 GL000248.1 GL000244.1
37175 37498 38154 38502 38914 39786 39929
GL000238.1 GL000202.1 GL000234.1 GL000232.1 GL000206.1 GL000240.1 GL000236.1
39939 40103 40531 40652 41001 41933 41934
GL000241.1 GL000243.1 GL000242.1 GL000230.1 GL000237.1 GL000233.1 GL000204.1
42152 43341 43523 43691 45867 45941 81310
GL000198.1 GL000208.1 GL000191.1 GL000227.1 GL000228.1 GL000214.1 GL000221.1
90085 92689 106433 128374 129120 137718 155397
GL000209.1 GL000218.1 GL000220.1 GL000213.1 GL000211.1 GL000199.1 GL000217.1
159169 161147 161802 164239 166566 169874 172149
GL000216.1 GL000215.1 GL000205.1 GL000219.1 GL000224.1 GL000223.1 GL000195.1
172294 172545 174588 179198 179693 180455 182896
GL000212.1 GL000222.1 GL000200.1 GL000193.1 GL000194.1 GL000225.1 GL000192.1
186858 186861 187035 189789 191469 211173 547496
NC_007605 hs37d5
171823 35477943
>
>
>
>
>
> dev.off()
null device
1
>
|