I gave a talk at my local Cleveland R User Group about text processing and document vectorization. You can view the talk here:
Note that I’m using the tm package, which is the traditional way to work with a document collection in R. There are new ways like tidytext that are gaining popularity. I may do a follow up talk on that.
Feedback, and More Videos
Enjoy, and feedback is welcome! And if you are interested in more video content on machine learning in R, check out this post.