This data set was constructed from a very small subset of the
Enron email corpus (Klimt & Yang, 2004). A large set of email messages
was made public during the legal investigation concerning the
Enron corporation. The full corpus contained 619,446 emails from
158 users. This data set contains only ten emails and includes
the body of the email, the email's subject line, and the date.
Usage
data(enron)
Format
A data frame with 10 observations on the following 3 variables.
email
A character vector of the email's body.
date
The email's timestamp as a 'Date' type.
subject
A character vector containing the email's subject line.
Source
Klimt, Bryan, and Yiming
Yang. "The enron corpus: A new dataset for email classification research."
In Machine learning: ECML 2004, pp. 217-226. Springer Berlin Heidelberg,
2004.