active_learning

class deduplipy.active_learning.ActiveStringMatchLearner(col_names, interaction=False, coef_diff_threshold=0.05, verbose=0)

Bases: object

fit(X)

Fit ActiveStringMatchLearner instance on pairs of strings

Args:

X: Pandas dataframe containing pairs of strings

predict(X)

Predict on new data whether the pairs are a match or not

Args:

X: Pandas dataframe to predict on

Returns: predictions

predict_proba(X)

Predict probabilities on new data whether the pairs are a match or not

Args:

X: Pandas dataframe to predict on

Returns: match probabilities