Neighbor Levenshtein DC
Description:
This column lists orthographical neighbors to the downcased types in the Type DC (downcased) column in the same table. The definition of neighborhood applied here is like the one in the Neighbors Levenshtein table, but it is applied to a downcased version of the corpus: by ignoring case distinctions, the number of neighbors for a type is generally higher than in the case-sensitive table.
Example: Hans is downcased to hans and has, in the downcased corpus, higher frequency Levenshtein neighbors haus and hand, and less frequent Levenshtein neighbors ans, hals, hang, gans, hanns and more.
Please note that the number and the cumulative frequency of a downcased type's downcased neighbors are given in the Types DC table. They need not be calculated manually. The list of neighbors given here is just for reference and verification of the neighborhood measures given in the Types DC table.
Data type:
- String
- case-insensitive
- eq, regex
- no null value
Available in tables:
Contents
Current version
- 0.3
- New tables: all measures in case-insensitive variant.