Fuzzy Join

Fuzzy Join combines two datasets into a single table, based on a similar shared column.

This tool is currently in Beta and is still being tested. Want to learn more? Like to provide feedback? Please reach out to support@cascade.io

Fuzzy Join allows for the combination of two datasets into a single table, based on a similar shared column.

Note: Fuzzy Join always functions as a left join type

Input/Output

Input

Output

Left Table - any table

Right Table - any table

Single table with merged data from Left Table and Right Table and an additional match_score column

Options

Option

Description

Left Field

Column in the Left table that shares similar values with the Right table

Right Field

Column in the Right table that shares values with the Left table

Threshold

Threshold allows you to limit results to only rows with match scores above a set value. The match_score determines the best match for each combination when joining.

The FuzzyJoin tool will not automatically update while in the tool view. When you are finished selecting your prompts, simply click the 'Update' button in the bottom right corner to run the tool.

Last updated