Fuzzy Join
Fuzzy Join combines two datasets into a single table, based on a similar shared column.
This tool is currently in Beta and is still being tested. Want to learn more? Like to provide feedback? Please reach out to support@cascade.io
Fuzzy Join allows for the combination of two datasets into a single table, based on a similar shared column.
Note: Fuzzy Join always functions as a left join type
Input/Output
Input
Output
Left Table - any table
Right Table - any table
Single table with merged data from Left Table and Right Table and an additional match_score column
Options
Option
Description
Left Field
Column in the Left table that shares similar values with the Right table
Right Field
Column in the Right table that shares values with the Left table
Threshold
Threshold allows you to limit results to only rows with match scores above a set value. The match_score determines the best match for each combination when joining.
The FuzzyJoin tool will not automatically update while in the tool view. When you are finished selecting your prompts, simply click the 'Update' button in the bottom right corner to run the tool.
Last updated