Last data update: 2014.03.03

R: HPC Job Scheduling Data
schedulingDataR Documentation

HPC Job Scheduling Data

Description

These data consist of information on 4331 jobs in a high performance computing environment. Seven attributes were recorded for each job along with a discrete class describing the execution time.

The predictors are: Protocol (the type of computation), Compounds (the number of data points for each jobs), InputFields (the number of characteristic being estimated), Iterations (maximum number of iterations for the computations), NumPending (the number of other jobs pending at the time of launch), Hour (decimal hour of day for launch time) and Day (of launch time).

The classes are: VF (very fast), F (fast), M (moderate) and L (long).

Usage

data(schedulingData)

Value

schedulingData

a data frame with 4331 rows and 8 columns

Examples

data(schedulingData)

library(caret)

set.seed(1104)
inTrain <- createDataPartition(schedulingData$Class, p = .8, list = FALSE)

schedulingData$NumPending <- schedulingData$NumPending + 1

trainData <- schedulingData[ inTrain,]
testData  <- schedulingData[-inTrain,]

modForm <- as.formula(Class ~ Protocol + log10(Compounds) +
                      log10(InputFields)+ log10(Iterations) +
                      log10(NumPending) + Hour + Day)


Results


R version 3.3.1 (2016-06-21) -- "Bug in Your Hair"
Copyright (C) 2016 The R Foundation for Statistical Computing
Platform: x86_64-pc-linux-gnu (64-bit)

R is free software and comes with ABSOLUTELY NO WARRANTY.
You are welcome to redistribute it under certain conditions.
Type 'license()' or 'licence()' for distribution details.

R is a collaborative project with many contributors.
Type 'contributors()' for more information and
'citation()' on how to cite R or R packages in publications.

Type 'demo()' for some demos, 'help()' for on-line help, or
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.

> library(AppliedPredictiveModeling)
> png(filename="/home/ddbj/snapshot/RGM3/R_CC/result/AppliedPredictiveModeling/jobScheduling.Rd_%03d_medium.png", width=480, height=480)
> ### Name: schedulingData
> ### Title: HPC Job Scheduling Data
> ### Aliases: schedulingData
> ### Keywords: datasets
> 
> ### ** Examples
> 
> data(schedulingData)
> 
> library(caret)
Loading required package: lattice
Loading required package: ggplot2
> 
> set.seed(1104)
> inTrain <- createDataPartition(schedulingData$Class, p = .8, list = FALSE)
> 
> schedulingData$NumPending <- schedulingData$NumPending + 1
> 
> trainData <- schedulingData[ inTrain,]
> testData  <- schedulingData[-inTrain,]
> 
> modForm <- as.formula(Class ~ Protocol + log10(Compounds) +
+                       log10(InputFields)+ log10(Iterations) +
+                       log10(NumPending) + Hour + Day)
> 
> 
> 
> 
> 
> 
> 
> dev.off()
null device 
          1 
>