R: Brown Corpus Frequency Data (zipfR)
Brown Corpus Frequency Data (zipfR)
Description
Brown.tfl , Brown.spc and Brown.emp.vgc are
zipfR objects of classes tfl ,
spc and vgc , respectively.
These data were extracted from the Brown corpus (see Kucera and
Francis 1967).
Details
Brown.emp.vgc is the empirical vocabulary growth
curve, reflecting the V and V(1) development in the
non-randomized corpus.
We removed numbers and other forms of non-linguistic material before
collecting word counts from the Brown.
References
Kucera, H. and Francis, W.N. (1967). Computational analysis of
present-day American English. Brown University Press, Providence.
See Also
The datasets documented in BrownSubsets pertain to
various subsets of the Brown (e.g., informative prose, adjectives
only, etc.)
Examples
data(Brown.tfl)
summary(Brown.tfl)
data(Brown.spc)
summary(Brown.spc)
data(Brown.emp.vgc)
summary(Brown.emp.vgc)
Results
|