Last data update: 2014.03.03

R: Two dimensional data in original and log scale
cta20R Documentation

Two dimensional data in original and log scale

Description

Two dimensional data in original and log scale and their hierarchical modal clustering. This dataset demonstrates the fact that modal clustering techniques can be used to cluster untransformed data as it does not depend on parametric assumptions. The clustering results, before and after the log transformation both produce nice separation of the three clusters.

Usage

data(cta20)
data(cta20.hmac)
data(logcta20)
data(logcta20.hmac)

Format

cta20 and logcta20 are two dimensional matrices. cta20.hmac and logcta20.hmac are objects of class hmac obtained from applying phmac on cta20 and logcta20 respectively

Details

The dataset is generated by illumina technology for high throughput genotyping named GOLDEN GATE ( http://www.illumina.com/technology/goldengate_genotyping_assay.ilmn). The data values are actual measurements made by the machine (intensity), after these are normalized (background subtracted etc). The data set is used for making genotype calls by Illumina. The data around X- and Y-axes represents the two homozygous genotypes (e.g. AA and TT), while the cluster along the 45-degree line represents the heterozygous (e.g. AT) genotype. Due to noisy reads, the data points often lie in-between the axes, and cluster detection is used for making automatic genotype calls.

Author(s)

Surajit Ray and Yansong Cheng

Examples

data(logcta20)
data(logcta20.hmac)
plot(logcta20)
plot(logcta20.hmac)
plot(logcta20.hmac,level=4)

Results


R version 3.3.1 (2016-06-21) -- "Bug in Your Hair"
Copyright (C) 2016 The R Foundation for Statistical Computing
Platform: x86_64-pc-linux-gnu (64-bit)

R is free software and comes with ABSOLUTELY NO WARRANTY.
You are welcome to redistribute it under certain conditions.
Type 'license()' or 'licence()' for distribution details.

R is a collaborative project with many contributors.
Type 'contributors()' for more information and
'citation()' on how to cite R or R packages in publications.

Type 'demo()' for some demos, 'help()' for on-line help, or
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.

> library(Modalclust)
Loading required package: mvtnorm
Loading required package: zoo

Attaching package: 'zoo'

The following objects are masked from 'package:base':

    as.Date, as.Date.numeric

Loading required package: class
> png(filename="/home/ddbj/snapshot/RGM3/R_CC/result/Modalclust/cta20.Rd_%03d_medium.png", width=480, height=480)
> ### Name: cta20
> ### Title: Two dimensional data in original and log scale
> ### Aliases: cta20 logcta20 cta20.hmac logcta20.hmac
> ### Keywords: data
> 
> ### ** Examples
> 
> data(logcta20)
> data(logcta20.hmac)
> plot(logcta20)
> plot(logcta20.hmac)

> plot(logcta20.hmac,level=4)

> 
> 
> 
> 
> 
> dev.off()
null device 
          1 
>