string_metrics

deduplipy.string_metrics.adjusted_partial_ratio(x_1: str, x_2: str)float

Length adjusted version of partial_ratio metric. When at least one of the strings is short, the metric is reduced.

Parameters
  • x_1 – string to compare

  • x_2 – string to compare

Returns

adjusted partial_ratio metric

deduplipy.string_metrics.adjusted_ratio(x_1: str, x_2: str)float

Length adjusted version of ratio metric. When at least one of the strings is short, the metric is reduced.

Parameters
  • x_1 – string to compare

  • x_2 – string to compare

Returns

adjusted ratio metric

deduplipy.string_metrics.adjusted_token_set_ratio(x_1: str, x_2: str)float

Length adjusted version of token_set_ratio metric. When at least one of the strings is short, the metric is reduced.

Parameters
  • x_1 – string to compare

  • x_2 – string to compare

Returns

adjusted token_set_ratio metric

deduplipy.string_metrics.adjusted_token_sort_ratio(x_1: str, x_2: str)float

Length adjusted version of token_sort_ratio metric. When at least one of the strings is short, the metric is reduced.

Parameters
  • x_1 – string to compare

  • x_2 – string to compare

Returns

adjusted token_sort_ratio metric