lucene.net - Lucene 3.0.3 - How is fuzzy search similarity correlated to later versions edit distance? (e.g. 4.x) -
prior versions 4.x, set similarity fuzzy search float between 0.1 1.0. later versions use value between 0 , 2 edit distances.
how these values correlated? cannot find anywhere in documentation actual float range 0.1 1.0 means.
i'm using lucene.net 3.0.3
version 4.0 onward use damerau-levenshtein edit distance.
version 3.0.3 instead compares edit distance length of term. if length(term) * minsimilarity >= edit distance
(where minsimilarity float argument referring to), term considered match.
so, if set 0.5, term of length 4 have edit distance of 2, while term of length 6 have distance of 3 , still match.
Comments
Post a Comment