Dictionary-Based Text Analysis

Chris Bail
Duke University

Word Counting

Word Counting w/Trump Tweets

 

library(tidytext)
library(dplyr)

load(url("https://cbail.github.io/Trump_Tweets.Rdata"))

tidy_trump_tweets<- trumptweets %>%
    select(created_at,text) %>%
      unnest_tokens("word", text)

Without Stop words

data("stop_words")

top_words<-
   tidy_trump_tweets %>%
      anti_join(stop_words) %>%
        filter(!(word=="https"|
                 word=="rt"|
                 word=="t.co"|
                 word=="amp")) %>%
            count(word) %>%
              arrange(desc(n))

So many Sentiments...

From Ribiero et al. (2016)

So many Sentiments...

From Goncalves et al. (2013)

When Should You Use Dictionary-Based Analysis?