text2vec (0.6)

Modern Text Mining Framework for R.


Fast and memory-friendly tools for text vectorization, topic modeling (LDA, LSA), word embeddings (GloVe), similarities. This package provides a source-agnostic streaming API, which allows researchers to perform analysis of collections of documents which are larger than available RAM. All core functions are parallelized to benefit from multicore machines.

Maintainer: Dmitriy Selivanov
Author(s): Dmitriy Selivanov [aut, cre, cph], Manuel Bickel [aut, cph] (Coherence measures for topic models), Qing Wang [aut, cph] (Author of the WaprLDA C++ code)

License: GPL (>= 2) | file LICENSE

Uses: digest, lgr, Matrix, mlapi, R6, Rcpp, rsparse, stringi, proxy, glmnet, testthat, knitr, magrittr, rmarkdown, covr, udpipe
Reverse suggests: lime, quanteda, textrecipes

Released 3 months ago.