I am trying to do some text statistics, like word frequency, average word length, average sentence length, and average paragraph length, I managed to do the word frequency and the average sentence and word length. What I need to do next is preprocess the text file by removing some words, "listed in some other text file", and then do my statistics. And if some one can tell me how to do the average paragraph length too, please.
Any help is appreciated.
Any help is appreciated.
Comment