The dataset contains outcome variable Y, covariate X, environmental exposure E, time, and genotype data Z. The first column in time is the subject ID and the second column is the measured exam. Y, X, E and time are all in long form. Z is a genotype matrix where each row corresponds to one subject.