Professional Documents
Culture Documents
1. File method
2. PlaintextCorpusReader
Q2: Pre-process the corpus loaded in step 1(apply normalization, tokenization, stopword removal,
stemming)
Q3: Convert the corpus into Bag-of-Words and tf-idf feature matrix using: