Objects of classes spc and vgc that
contain frequency data for a collection of Dickens's works from
Project Gutenberg, and for 3 novels (Oliver Twist, Great
Expectations and Our Mutual Friends).
Details
Dickens.spc has a frequency spectrum derived from a
collection of Dickens' works downloaded from the Gutenberg archive
(A Christmas Carol, David Copperfield, Dombey and Son, Great
Expectations, Hard Times, Master Humphrey's Clock, Nicholas
Nickleby, Oliver Twist, Our Mutual Friend, Sketches by BOZ, A Tale
of Two Cities, The Old Curiosity Shop, The Pickwick Papers, Three
Ghost Stories). Dickens.emp.vgc contains the corresponding
observed vocabulary growth (V and V(1)).
DickensOliverTwist.spc and DickensOliverTwist.emp.vgc
contain spectrum and observed growth curve (V and V(1)
of the early novel Oliver Twist (1837-1839).
DickensGreatExpectations.spc and
DickensGreatExpectations.emp.vgc contain spectrum and
observed growth curve (V and V(1)) of the late novel
Great Expectations (1860-1861).
DickensOurMutualFriend.spc and
DickensOurMutualFriend.emp.vgc contain spectrum and observed
growth curve (V and V(1)) of Our Mutual Friend, the
last novel completed by Dickens (1864-1865).
Notice that we removed numbers and other forms of non-linguistic
material before collecting the frequency data.