Text From Files
Contents
- This has the
source_filestruct, which is sort of the intermediary parse representation. - It feeds entire files into the Lexer (except for the generated source).
- It does some Unicode handling here to make characters neater.
- The word count is separate from the actual count of “words” because it considers quoted text as multiple words.