Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools


[Up] [Top]

Documentation for package ‘tidytext’ version 0.1.4

Help Pages

tidytext-package tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools
augment.jobjRef Tidiers for Latent Dirichlet Allocation models from the mallet package
augment.LDA Tidiers for LDA objects from the topicmodels package
augment.STM Tidiers for Structural Topic Models from the stm package
bind_tf_idf Bind the term frequency and inverse document frequency of a tidy text dataset to the dataset
cast_dfm Casting a data frame to a DocumentTermMatrix, TermDocumentMatrix, or dfm
cast_dtm Casting a data frame to a DocumentTermMatrix, TermDocumentMatrix, or dfm
cast_sparse Create a sparse matrix from row names, column names, and values in a table.
cast_tdm Casting a data frame to a DocumentTermMatrix, TermDocumentMatrix, or dfm
corpus_tidiers Tidiers for a corpus object from the quanteda package
dictionary_tidiers Tidy dictionary objects from the quanteda package
get_sentiments Get a tidy data frame of a single sentiment lexicon
glance.corpus Tidiers for a corpus object from the quanteda package
glance.LDA Tidiers for LDA objects from the topicmodels package
glance.STM Tidiers for Structural Topic Models from the stm package
lda_tidiers Tidiers for LDA objects from the topicmodels package
mallet_tidiers Tidiers for Latent Dirichlet Allocation models from the mallet package
parts_of_speech Parts of speech for English words from the Moby Project
sentiments Sentiment lexicons from three sources
stm_tidiers Tidiers for Structural Topic Models from the stm package
stop_words Various lexicons for English stop words
tdm_tidiers Tidy DocumentTermMatrix, TermDocumentMatrix, and related objects from the tm package
tidy.Corpus Tidy a Corpus object from the tm package
tidy.corpus Tidiers for a corpus object from the quanteda package
tidy.dfmSparse Tidy DocumentTermMatrix, TermDocumentMatrix, and related objects from the tm package
tidy.dictionary2 Tidy dictionary objects from the quanteda package
tidy.DocumentTermMatrix Tidy DocumentTermMatrix, TermDocumentMatrix, and related objects from the tm package
tidy.jobjRef Tidiers for Latent Dirichlet Allocation models from the mallet package
tidy.LDA Tidiers for LDA objects from the topicmodels package
tidy.simple_triplet_matrix Tidy DocumentTermMatrix, TermDocumentMatrix, and related objects from the tm package
tidy.STM Tidiers for Structural Topic Models from the stm package
tidy.TermDocumentMatrix Tidy DocumentTermMatrix, TermDocumentMatrix, and related objects from the tm package
tidytext tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools
tidy_triplet Utility function to tidy a simple triplet matrix
unnest_tokens Split a column into tokens using the tokenizers package