twitter_sentiment_analysis_part4 icon indicating copy to clipboard operation
twitter_sentiment_analysis_part4 copied to clipboard

Data split, feature extraction with count vectorizer

Another Twitter Sentiment Analysis with Python - Part 4

Attached Jupyter Notebook is the part 4 of the Twitter Sentiment Analysis project I implemented as a capstone project for General Assembly's Data Science Immersive course.

Accompanying blog posts can be found from my Medium account: https://medium.com/@rickykim78

Below implementations can be found in the attached notebook.

Train/Dev/Test Split

Baseline

  • null accuracy
  • TextBlob sentiment polarity score (prerequisite: TextBlob pip install textblob)

Count Vectorizer

experiment with

  • stop words removal
  • varying vocabulary size
  • n-grams