AAA2011-Tweets
AAA2011-Tweets copied to clipboard
R code for analyzing tweets relating to #AAA2011 (text mining, topic modelling, network analysis, clustering and sentiment analysis)
R code for obtaining and analysing tweets from the 2011 meeting of the American Anthropological Association
The code details ten steps in the analysis and visualisation of the tweets:
- acquiring the raw Twitter data
- calculating some basic statistics with the raw Twitter data
- calculating some basic retweet statistics
- calculating the ratio of retweets to tweets
- calculating some basic statistics about URLs in tweets
- basic text mining for token frequency and token association analysis
- calculating senitment scores of tweets, including on subsets containing tokens of interest
- hierarchical clustering of tokens based on multiscale bootstrap resampling
- topic modelling the tweet corpus using latent Dirichlet allocation
- network analysis of tweeters based on retweets
Author: Ben Marwick Contact: http://faculty.washington.edu/bmarwick/how-contact-me Licence: http://creativecommons.org/licenses/by-nc-sa/2.0/ Date: Dec 2011