stringdist (

0 users

Approximate String Matching and String Distance Functions.

Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well.

Maintainer: Mark van der Loo
Author(s): Mark van der Loo [aut, cre] (<>), Jan van der Laan [ctb], R Core Team [ctb], Nick Logan [ctb], Chris Muir [ctb]

License: GPL-3

Uses: tinytest
Reverse depends: AurieLSHGaussian, blink, VDJgermlines, vwr
Reverse suggests: c14bazAAR, googleLanguageR, rlist, sjmisc, spew, sprint, statar, tidycells

Released 7 months ago.

26 previous versions



  (0 votes)


  (0 votes)

Log in to vote.


No one has written a review of stringdist yet. Want to be the first? Write one now.

Related packages: RJDemetra, sdcSpatial, tau, textrank, micEconIndex, sdcHierarchies, PxWebApiData, weights, emdi, MatchThem, tokenizers.bpe, RcmdrPlugin.temis, qdap, textir, stringi, SnowballC, movMF, lda, languageR, koRpus(20 best matches, based on common tags.)

Search for stringdist on google, google scholar, r-help, r-devel.

Visit stringdist on R Graphical Manual.