sanitizing tweets
Problem
You have the text of a tweet and you want to get rid of the bullshit (smileys, emojis, etc.)
Solution
See https://github.com/s/preprocessor. It’s customizable, you can select what to remove, e.g. URLs, smileys, etc.
Categories: python
sanitize, text preprocessing, tweet, twitter
Comments (0)
Leave a comment
