Run Stitch

Stitch uses patented algorithms to evaluate massive volumes of data to discover the hidden connections in your customer records that identify unique individuals. Stitch outputs a unified collection of data that assigns a unique identifier to each unique individual that is discovered within your customer records.

Add tables to Stitch run

Stitch only runs against selected domain tables. A domain table is made available to Stitch by the Make available to Stitch configuration setting in the Feed Editor.

A domain table that is made available to Stitch must also be selected from the list of domain tables in the Stitch settings dialog box.

Each selected table is processed and compared for identity resolution, after which Amperity IDs are assigned to each of your unique customers that are discovered across all domain tables that are included in the Stitch run.

Note

Only tables with the Make available to Stitch setting enabled in the Feed Editor will be available in the Stitch Settings dialog box.

To add tables to the Stitch run

From the Stitch tab, click Settings. This opens the Stitch settings dialog box. On the Stitched tables tab, select each of the tables to include in Stitch results, and then click Save.

The Stitched tables tab in the Stitch settings dialog box.

After you have selected the list of tables to include in Stitch results, return to the Stitch page, and then click Run.

Run Stitch

A Stitch run takes a certain amount of time, depending on the size of the data analyzed and the number of potential pairs in the data. In general, you should expect to wait at least 20 minutes (but as much as 2 hours) for a Stitch run to complete.

You can navigate to other areas of Amperity and do other tasks while waiting for Stitch to complete successfully. Stitch has two notifications: the Stitch run, and then the Stitch report. The UI will refresh with updates when the Stitch report has been generated.

Click the Run button on the Stitch page to manually start a Stitch run.

Important

Before starting the Stitch run, verify that all tables that should be analyzed by Stitch have been made available to Stitch via the Feed Editor, that all processes that load data to Amperity (including couriers, feeds, and domain tables) have finished processing, and that all domain tables are selected.

Configure Stitch

A list of Stitch configuration settings is available from the Stitch page. These settings should not be changed until after your first Stitch run and the Stitch QA process has started.

It is not uncommon to make some configuration changes after analyzing how Amperity has processed your data. Common configuration changes include adjusting how Amperity handles certain types of semantic tags, changing the blocking strategy, adjusting the thresholds for match categories, updating the maximum number of allowed trivial duplicates.

In rare cases, adjustments may be made to how Amperity applies the results of blocking or to the clustering algorithm.