Skip to contents

The function creates a lexical dispersion plot for corpus objects based on keywords or dictionaries.

Usage

plotLexDiv(corpus,
           keywords,
           docvar=NULL,         
           value=NULL,
           aggr.by.var=NULL,
           rem.accent = TRUE,
           rem.punct = TRUE,
           case.insensitive = TRUE,
           lang = "es", 
           title="Lexical Dispersion Plot", 
           caption="Own elaboration.",
           subtitle="Keywords",
           legend.title="Group",
           legend.rows=1,
           palette=c("#DD8D29","#E2D200","#46ACC8","#E58601","#B40F20"),
           custom.color="black",
           panel.bg.fill="grey98",
           hline.color="white",
           hline.width=0.5,
           na.rm=FALSE)

Arguments

corpus

A quanteda corpus object.

keywords

Keywords or dictionary employed to search for terms in texts.

docvar

Metadata variable associated to the texts on a quanteda corpus object.

value

List of values contained in the metadata variable (docvar).

aggr.by.var

Establish the grouping variable to be used in the creation of a new aggregated corpus.

rem.accent

Remove accents. The default is TRUE.

rem.punct

Remove punctuation. The default is TRUE.

case.insensitive

Search for both upper and lowercase words. The default is TRUE.

lang

The language for removing stopwords. The default is Spanish: "es".

title

The title of the graph. The default ist "Lexical Dispersion Plot".

caption

The caption, note or source of the graph. The default is "Own elaboration".

subtitle

The subtitle of the graph. The default is "Keywords".

legend.title

The title for the legend. The default is "Group".

legend.rows

The number of rows organizing groups in the legend. The default is 1.

palette

The list of color names of hexadecimal codes to represent groups.

custom.color

The custom color to be used when no groups or dictionary entries are defined. The default is "black".

panel.bg.fill

The background color for the text panels. The default color is "grey98".

hline.color

Horizontal line color. The default is "white".

hline.width

Horizontal line width. The default is 0.5.

na.rm

Logical. Removes texts that do not contain the keywords or categories indicated. The default is FAlSE.

Details

The function generates a Lexical Dispersion Plot for keywords, dictionaries or metadata variables. It represents the position of the selected keywords, dictionary categories, or metadata values in each text forming a corpus object.

Value

A chart representing the dispersion of terms or concepts.

Examples

if (FALSE) {
# Retrieve a corpus of text 
tx <- quanteda::data_corpus_inaugural

# find the relative position of keywords
plotLexDiv(tx, keywords = c("democ","liber","freedom"))

# creates a dictionary
dic <- dictionary(list(
                Goverment=c("govern",
                            "executive",
                            "policy"),
                Nation=c("nation",
                         "homeland",
                         "patriot"),
                Democracy=c("democ",
                            "freedom",
                            "liberty",
                            "rights")
                ))

# Creates the graph                
plotLexDiv(tx, 
           keywords = dic)

# Aggregate the corpus according to party
plotLexDiv(tx, 
           docvar="President", 
           value=c("Nixon","Bush","Kennedy","Roosevelt"), 
           aggr.by.var = "Party")
}