
2016 Election Tweets: Hillary vs Trump

Data
The data in this tab is comprised of tweets from then presidential candidate Donald Trump and former Secretary of State Hillary Clinton. The tweets are from Jan 2016 to Sept 2016 and have much information available, such as the author, time/date information, whether or not it was a retweet, and of course, the tweet itself.
MAGA - Positive or Negative?
“Make America Great Again,” or MAGA, is a slogan Donald Trump used during his 2016 presidential campaign. It suggests a goal of returning America to past greatness, focusing on boosting the economy and strengthening security. However, the slogan has sparked controversy. Some people criticize MAGA, saying it looks back to a time when people were treated less fairly. This has made MAGA a divisive term, representing very different ideas about what America should be.
This histogram then shows the distribution of the AFINN scores of all of the words from tweets that contained the phrase “MAGA”. Each word is given a rating from -5 (worst connotation) to 5 (best connotation). For further information about AFINN, see the figure below the histogram, which has a few words from the AFINN data and their associated scores.
# A tibble: 6 × 2
# Groups: value [6]
word value
<chr> <dbl>
1 lunatics -3
2 lethargy -2
3 manipulation -1
4 solution 1
5 peace 2
6 lovely 3
Capitalized Words
Anyone who has ever come across even a few of Donald Trump’s tweets will be familiar with his affinity for capital letters. In the context of this data, it makes sense to compare Trump’s usage of capital letters to Hillary’s usage. Before arriving at my final result, I had to not count words like “I”, which are always capitalized, as well as abbreviations like “NC” for North Carolina. The following output is
Trump_Prop
1 0.006820328
Hillary_Prop
1 0.005009851
These proportions are nearly identical, differing by not even a percentage point. This is an interesting result for sure, suggesting that perhaps Trump isn’t as much of a loudmouth as people make him out to be (at least on Twitter). If I were to do further analysis, I would like to compare some metric tied to whether or not a tweet was a retweet or create a time vs frequency of words graph.