First corpus building function
assemble_corpus(data, stopwords, remove_twitter)
| data | The data from which the corpus is drawn with documents in first column |
|---|---|
| stopwords | Words to exclude from the clustering |
| remove_twitter | Whether to remove text associated with Twitter content, useful for when analyzing data from this source (defaults to FALSE) |
| ... | The metadata columns following the text column |
Puts together the corpus and dfm from the data frame provided