This function gives general statistics for a character vector,
e.g. obtained by loading a text file with the
readLines or stri_read_lines function,
where each text line' is represented by a separate string.
Usage
stri_stats_general(str)
Arguments
str
character vector to be aggregated
Details
Any of the strings must not contain or \n characters,
otherwise you will get at error.
Below by 'white space' we mean the Unicode binary property
WHITE_SPACE, see stringi-search-charclass.
Value
Returns an integer vector with the following named elements:
Lines - number of lines (number of
non-missing strings in the vector);
LinesNEmpty - number of lines with at least
one non-WHITE_SPACE character;
Chars - total number of Unicode code points detected;
CharsNWhite - number of Unicode code points
that are not WHITE_SPACEs;
... (Other stuff that may appear in future releases of stringi).
See Also
Other stats: stri_stats_latex
Examples
s <- c("Lorem ipsum dolor sit amet, consectetur adipisicing elit.",
"nibh augue, suscipit a, scelerisque sed, lacinia in, mi.",
"Cras vel lorem. Etiam pellentesque aliquet tellus.",
"")
stri_stats_general(s)