topic model 预处理步骤

1. del punctuation

2. lower case

3. del stopword

4. len(s)>1

5. del infrequent word (optional)

7. stemming

你可能感兴趣的:(topic model 预处理步骤)