Two dimensional data in original and log scale and their hierarchical modal
clustering. This dataset demonstrates the fact that modal clustering
techniques can be used to cluster untransformed data as it does not
depend on parametric assumptions. The clustering results,
before and after the log transformation both produce nice separation of
the three clusters.
cta20 and logcta20 are two dimensional
matrices. cta20.hmac and logcta20.hmac are objects of class hmac
obtained from applying phmac on cta20 and logcta20 respectively
Details
The dataset is generated by illumina technology for high
throughput genotyping named GOLDEN GATE (
http://www.illumina.com/technology/goldengate_genotyping_assay.ilmn).
The data values are actual measurements made by the machine (intensity), after these are normalized (background subtracted etc).
The data set is used for making genotype calls by Illumina. The data
around X- and Y-axes represents the two homozygous
genotypes (e.g. AA and TT), while the cluster along the 45-degree line represents the
heterozygous (e.g. AT) genotype. Due to noisy reads, the data points often
lie in-between the axes, and cluster detection is used for making
automatic genotype calls.
R version 3.3.1 (2016-06-21) -- "Bug in Your Hair"
Copyright (C) 2016 The R Foundation for Statistical Computing
Platform: x86_64-pc-linux-gnu (64-bit)
R is free software and comes with ABSOLUTELY NO WARRANTY.
You are welcome to redistribute it under certain conditions.
Type 'license()' or 'licence()' for distribution details.
R is a collaborative project with many contributors.
Type 'contributors()' for more information and
'citation()' on how to cite R or R packages in publications.
Type 'demo()' for some demos, 'help()' for on-line help, or
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.
> library(Modalclust)
Loading required package: mvtnorm
Loading required package: zoo
Attaching package: 'zoo'
The following objects are masked from 'package:base':
as.Date, as.Date.numeric
Loading required package: class
> png(filename="/home/ddbj/snapshot/RGM3/R_CC/result/Modalclust/cta20.Rd_%03d_medium.png", width=480, height=480)
> ### Name: cta20
> ### Title: Two dimensional data in original and log scale
> ### Aliases: cta20 logcta20 cta20.hmac logcta20.hmac
> ### Keywords: data
>
> ### ** Examples
>
> data(logcta20)
> data(logcta20.hmac)
> plot(logcta20)
> plot(logcta20.hmac)
> plot(logcta20.hmac,level=4)
>
>
>
>
>
> dev.off()
null device
1
>