On the size of the neighborhoods of a word
On the size of the neighborhoods of a word
The d-neighborhood of a word W in the Levenshtein distance is the set of all words at distance at most d from W. Generating the neighborhood of a word W, or related sets of words such as the condensed neighborhood or the super-condensed neighborhood has applications in the design of approximate pattern matching algorithms. It follows that bounds on the maximum size of the neighborhood of words of a given length can be used in the complexity analysis of such approximate pattern matching algorithms. In this note, we present exact formulas for the size of the condensed and super condensed neighborhoods of a unary word, a novel upper bound for the maximum size of the condensed neighborhood of an arbitrary word of a given length, and we prove a conjectured upper bound again for the maximum size of the condensed neighborhood of an arbitrary word of a given length.
Cedric Chauve、Louxin Zhang
计算技术、计算机技术
Cedric Chauve,Louxin Zhang.On the size of the neighborhoods of a word[EB/OL].(2025-05-19)[2025-07-16].https://arxiv.org/abs/2505.13796.点此复制
评论