Skip to contents

An R data package containing a collection of Russian lexicons, polarity tables, stop words lists, lemmatization tables and etc. for text analysis.

Naming convention

The naming convention based on the idea of lexicon package, and help to categorize the data types.

Prefix

Description

key_

A data.frame with a lookup and return value

hash_

A keyed data.table hash table

hash_lemmas_

A lemmatization hash table

hash_sentiment_

A sentiment polarity hash table

profanity_

A profane words vector

sw_

A stopword vector

freq_

A data.table of terms with frequencies

_en An English version of dictionary

Installation

You can install this package from GitHub with:

# install.packages("devtools")
devtools::install_github("dmafanasyev/rulexicon")