Last data update: 2014.03.03

R: Import documents from a directory into Mallet format
mallet.read.dirR Documentation

Import documents from a directory into Mallet format

Description

This function takes a directory path as its only argument and returns a data.frame() with two columns: <id> & <text>, which can be passed to the mallet.import function. This data.frame() has as many rows as there are files in the Dir.

Usage

mallet.read.dir(Dir)

Arguments

Dir

The path to a directory containing one document per file.

Note

This function was contributed to RMallet by Dan Bowen.

See Also

mallet.import

Examples

## Not run: 
documents <- mallet.read.dir(Dir)
mallet.instances <- mallet.import(documents$id, documents$text, "en.txt",
		    		token.regexp = "\p{L}[\p{L}\p{P}]+\p{L}")

## End(Not run)

Results