Last data update: 2014.03.03

Data Source

R Release (3.2.3)
CranContrib
BioConductor
All

Data Type

Packages
Functions
Images
Data set

Classification

Results 1 - 10 of 55 found.
[1] < 1 2 3 4 5 6 > [6]  Sort:

meta (Package: tm) : Metadata Management

Accessing and modifying metadata of text documents and corpora.
● Data Source: CranContrib
● Keywords:
● Alias: DublinCore, DublinCore<-, meta, meta.PCorpus, meta.PlainTextDocument, meta.VCorpus, meta.XMLTextDocument, meta<-.PCorpus, meta<-.PlainTextDocument, meta<-.VCorpus, meta<-.XMLTextDocument
● 0 images

tm_map (Package: tm) : Transformations on Corpora

Interface to apply transformation functions (also denoted as mappings) to corpora.
● Data Source: CranContrib
● Keywords:
● Alias: tm_map, tm_map.PCorpus, tm_map.VCorpus
● 0 images

removeWords (Package: tm) : Remove Words from a Text Document

Remove words from a text document.
● Data Source: CranContrib
● Keywords:
● Alias: removeWords, removeWords.PlainTextDocument, removeWords.character
● 0 images

Docs (Package: tm) : Access Document IDs and Terms

Accessing document IDs, terms, and their number of a term-document matrix or document-term matrix.
● Data Source: CranContrib
● Keywords:
● Alias: Docs, Terms, nDocs, nTerms
● 0 images

readXML (Package: tm) : Read In an XML Document

Return a function which reads in an XML document. The structure of the XML document is described with a specification.
● Data Source: CranContrib
● Keywords:
● Alias: readXML
● 0 images

Corpus (Package: tm) : Corpora

Representing and computing on corpora.
● Data Source: CranContrib
● Keywords:
● Alias: Corpus
● 0 images

PCorpus (Package: tm) : Permanent Corpora

Create permanent corpora.
● Data Source: CranContrib
● Keywords:
● Alias: PCorpus
● 0 images

readDOC (Package: tm) : Read In a MS Word Document

Return a function which reads in a Microsoft Word document extracting its text.
● Data Source: CranContrib
● Keywords:
● Alias: readDOC
● 0 images

tm_combine (Package: tm) : Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors

Combine several corpora into a single one, combine multiple documents into a corpus, combine multiple term-document matrices into a single one, or combine multiple term frequency vectors into a single term-document matrix.
● Data Source: CranContrib
● Keywords:
● Alias: c.TermDocumentMatrix, c.TextDocument, c.VCorpus, c.term_frequency
● 0 images

getTokenizers (Package: tm) : Tokenizers

Predefined tokenizers.
● Data Source: CranContrib
● Keywords:
● Alias: getTokenizers
● 0 images