Glossary

Welcome to the Amperity glossary.

A

ActiveCampaign

ActiveCampaign is an automation platform for ecommerce, business-to-consumer, and business-to-business that helps you use email and SMS marketing to reach, nurture, convert, and grow your customer base.

active customer

An active customer is one whose likelihood of returning to make another purchase is above 50%.

active database

An active database is fully configured to use stitched output tables, is ready to run, is ready for queries and segmentation, and is usable with scheduled workflows.

active segment

An active segment runs from the Segments page or run automatically using an orchestration. An active segment:

  • Must have a unique name

  • Remains active until deleted

  • Must be a draft segment for editing; a draft segment has the same name as the active segment, appended with “–Draft”

  • Remains active until activated

Acxiom

Acxiom collects, analyzes, and sells information about customers and businesses for use with targeted advertising campaigns.

ad suppression

Ad suppression is an approach to marketing that intentionally limits or prevents the display of advertisements to certain users or in specific contexts. For example: preventing users from seeing an ad too often, removing users from an audience when they are not relevant, or ensuring compliance with advertising guidelines or regulatory policies.

address

The address connected with the location of a customer, such as “123 Main Street”.

-or-

The address connected with the location of a unique Amperity ID. For example: 123 Main Street.

address-based householding

Address-based householding is a process that standardizes addresses, pairs them with a last name, and then assigns that pair a Household ID.

address2

Address information, such as an apartment number or a post office box, connected with the location of a customer, such as “Apt #9”.

-or-

Address information, such as an apartment number or a post office box, connected with the location of a unique Amperity ID. For example: Apt #9.

Adjust

Adjust is an analytics platform that helps marketers grow their mobile apps with solutions for measuring and optimizing campaigns and protecting user data.

Adobe Analytics

Adobe Analytics offers useful intelligence about customer activity on Web sites and mobile devices. Marketers can analyze clickstream data to understand what their customers are doing in real-time, and then optimize customer experiences across brands.

Adobe Campaign

Adobe Campaign is a cloud-based service that manages direct marketing campaigns, leads, resources, customer data, and analytics.

Adobe Commerce

Adobe Commerce is a digital commerce and order management platform for building multichannel ecommerce experiences on a single platform. Adobe Commerce uses MySQL for database management.

Adobe Customer Attributes

Adobe Customer Attributes is a part of Adobe Experience Cloud. Upload visitor information as a set of customer attributes, and then make those attributes available to Adobe Target and Adobe Analytics.

Adobe Experience Platform

Adobe Experience Platform is a part of Adobe Experience Cloud that transforms data into robust customer profiles. Use updated customer profiles to optimize customer experiences across every channel.

Adobe Marketo

Adobe Marketo is marketing automation software that streamlines, automates, and measures marketing tasks and workflows.

AdRoll

AdRoll is an advertising platform that helps companies grow revenue through seamless marketing and ads.

Affinity table

An Affinity table associates individual customers to the products they are most likely to purchase. Use an Affinity table to help deliver personalized experiences to your customers.

Airship

Airship is a customer engagement platform that helps marketers connect with their customers across every stage of the customer lifecycle.

Airtable

Airtable works like a spreadsheet but gives you the power of a database to organize anything.

All Opted In Emails

The All Opted In Emails table has all opted-in email addresses and associated Amperity IDs. Use the Email Opt Status table to build this table, after which you can extend it to support brands, regions, email programs, and language preferences.

Amazon Ads

Amazon Ads is an online advertising platform that helps brands meet their audiences where they shop, read, listen, and watch.

Amazon DSP

Amazon DSP is a demand-side platform for paid media advertising that lets your brand programmatically buy ads to reach new and existing audiences on and off Amazon.

Amazon Kinesis Data Firehose, AWS Kinesis

Amazon Kinesis Data Firehose is a fully managed service for delivering real-time streaming data to Amazon S3.

Amazon Marketing Cloud

Amazon Marketing Cloud (AMC) is a secure, privacy-safe, and cloud-based clean room solution. Advertisers can perform analytics and build audiences across pseudonymized signals, including Amazon Ads signals as well as their own inputs.

Amazon Pinpoint

Amazon Pinpoint is a flexible and scalable communications service for inbound and outbound marketing. Use Amazon Pinpoint to connect with your customers over a variety of channels, such as email, SMS, push, voice, in-app messaging.

Amazon QuickSight

Amazon QuickSight is a cloud-based, self-service BI tool for creating and publishing interactive dashboards for retail, ecommerce, manufacturing, and more.

Amazon Redshift

Amazon RedShift is a data warehouse located within Amazon Web Services that can handle massive sets of column-oriented data.

Amazon S3, Amazon Simple Storage Service

Amazon Simple Storage Service, more commonly referred to as Amazon S3, stores customer data files of any size in any file formats.

Amperity

Amperity is a cloud data platform analyzes all types of source data, and then builds accurate customer profiles even when source data is incomplete, inconsistent, or lacks linking keys. Amperity merges behavioral, contextual, and transactional data to form useful profile views with out-of-the-box attributes, affinities, and insights. Use queries and segments to build audiences, and then send those audiences to any downstream workflow.

Amperity Bridge

Amperity Bridge enables sharing data between Amperity and a data lakehouse using industry-standard data formats. Each bridge configures inbound or outbound connections that give you access to shared tables without replicating data.

Amperity Data Warehouse (ADW)

See Business Intelligence Connect.

Amperity ID

An Amperity ID is a patented unique identifier assigned to clusters of customer profiles. A single Amperity ID represents a single individual.

Note

The Amperity ID is a universally unique identifier (UUID) that is 36 characters spread across five groups separated by hyphens: 8-4-4-4-12.

For example:

123e4567-e89b-12d3-a456-426614174000
Amperity.js

Amperity.js is a JavaScript-based software development kit that integrates directly with any website or web application to support real-time streaming of data from that website to Amperity.

Amplitude

Amplitude is a self-service digital analytics platform to understand your users, drive conversions, and increase engagement, growth and revenue.

anonymous visitor

An anonymous visitor is an individual who visits a website, but does not sign up to receive information and does not purchase anything from the website. Anonymous visitors are interesting from a marketing perspective because they have shown some form of intent to engage with a brand. Anonymous visitors are a significant percentage–often more than 95%–of website traffic.

Apache Avro

Apache Avro is a row-oriented remote procedure call and data serialization framework developed within the Apache Hadoop ecosystem. Avro uses JSON to define data types and protocols, and serializes data in a compact binary format.

Apache Parquet

Apache Parquet is a free and open source column-oriented data storage format developed within the Apache Hadoop ecosystem. It is similar to RCFile and ORC, but offers more efficient data compression and encoding schemes with enhanced performance and can better handle large amounts of complex bulk data.

Apple Search Ads

Apple Search Ads helps people discover your app on the App Store, matching customers with your app at the right moments.

AppsFlyer

AppsFlyer is a mobile app tracking and attribution analytics platform that helps drive predictable app growth, protects customer privacy and delivers exceptional mobile experiences.

Attentive Mobile

Attentive is a personalized messaging platform that improves brand engagement with mobile consumers.

attributes explorer

The attributes explorer is an expandable list of tables and column names that you can reference while building queries and segments. The attributes explorer is located on the right-side of the Segments page and shows the list of available attributes for each table.

attributes table

An attributes table consolidates a set of fields for use with a specific downstream workflow. Assign names to fields to correctly map them with the naming patterns required by a workflow or downstream system.

audience

An audience is a list of customers who match the attributes, filters, and conditions that define a segment.

audience sizes

Audience sizes represent the total number of customers that are necessary for a percentage of purchases to occur within a given time window.

  • A large audience represents the percentage of the total audience required to capture 90% of total purchases 30 days ago.

  • A medium audience represents the percentage of the total audience required to capture 70% of total purchases 30 days ago.

  • A small audience represents the percentage of the total audience required to capture 50% of total purchases 30 days ago.

The purchase curve and audience sizes.
AUR, average unit retail

The average selling price for a retail item.

Australian Privacy Principles (APP)

Australian Privacy Principles (APP) is a law that covers data protection and privacy in Australia. It governs a broad set of standards, including rights and obligations around the collection, use, and disclosure of personal information and the rights of individuals to access their personal information.

authentication

Authentication is the process of establishing that a user is who they say they are. For example, by asking a user to give a username and password.

authorization

Authorization is the process of establishing that a user can perform a given action. Authorization map to roles in Amperity.

automatic courier

An automatic courier is an active courier configured to run as part of a scheduled workflow. Configure an automatic courier from the Sources page.

automatic query

An automatic query is an active query configured to run whenever upstream data changes. Configure an automatic query from the Queries page.

automatic segment

An automatic segment is an active segment configured to run whenever upstream data changes. Configure an automatic segment from the Segments tab.

AWS Connect

Amazon Connect is a contact center as a service solution that offers self-service configuration and enables dynamic, personal, and natural customer engagement at any scale.

average order value

Average order value is the average order value for a unique customer’s transactions.

average unit retail

See AUR.

AWS Lambda

AWS Lambda runs code for any application or service configured to run automatically from within Amazon Web Services to support any downstream workflow.

Azure Blob Storage

Azure Blob Storage is an object storage solution for the cloud that stores massive amounts of unstructured data.

Azure Cloud Functions

Azure Functions is a serverless compute service that offers code on-demand services without having to provision or manage infrastructure.

Azure Data Factory

Azure Data Factory is a service in Azure that can convert any data format into another data format, such as converting Apache Parquet to CSV.

Azure Data Share

Azure Data Share is a service for sharing data in any format and any size with Amperity. Azure Data Share requires no infrastructure setup or management and uses underlying Azure security measures as they apply to both Azure accounts. Snapshot-based sharing does not require a special access key.

Azure Synapse Analytics

Azure Synapse Analytics is a limitless analytics service and data warehouse. Azure Synapse Analytics has four parts: SQL analytics, Apache Spark, hybrid data integration, and a unified user experience.

B

bad-values blocklist

A bad-values blocklist has known values that appear often in data. The Stitch process should exclude values from the bad-values blocklist.

Bazaarvoice

Bazaarvoice allows retailers to manage user-generated content on their website.

BigCommerce

BigCommerce is an all-in-one tool used to build storefronts, optimize product searches, and convert shoppers into customers.

birthdate

The date of birth connected with a customer.

-or-

The date of birth connected with a unique Amperity ID.

block

A block is a group of records that match the characteristics defined by the blocking strategy.

blocking

Blocking is a process that uses rules to divide massive datasets into smaller blocks. Smaller and offer higher probabilities of discovering matching records.

blocking key, bk-[semantic]

A blocking key defines a specific combination of characters for a blocking strategy. For example, the first three characters in given-name, the first character in surname, and birthdate represent a blocking key.

blocking strategy

A blocking strategy acts like a filter against large datasets. Each blocking strategy applies its filter. All matching records group together into a block. Each record that matches a blocking strategy is a blocking key.

Bluecore

Bluecore is a marketing technology company that intelligently connects casual shoppers to products with the goal of transforming those casual shoppers into lifetime customers.

Bluekai

See Oracle DMP.

brand

The brand associated with a customer interaction.

Braze

Braze is a leading marketing automation platform for creating custom experiences based on sophisticated customer attributes and segments, and then mapping those experiences to campaigns.

Brightloom

Brightloom is a customer growth platform for restaurants, retailers, and consumer brands that leverages customer transaction history and marketing campaign data to predict individual customer buying preferences.

Business Intelligence Connect

Business Intelligence Connect is an Amperity-managed cloud data warehouse that offers a location from which you can use any BI tool to access your Amperity data.

byte order mark

A byte order mark (BOM) is an optional usage of the special Unicode character “U+FEFF BYTE ORDER MARK”. This character starts the file and identifies byte order and character encoding within the file.

C

California Consumer Privacy Act (CCPA)

The California Consumer Privacy Act (CCPA) is a law that covers data protection and privacy in the state of California. It gives control to individuals over their personal data and addresses the transfer of personal data, including providing for the ability to request removal of data.

Camelot SMM

Camelot SMM offers marketing strategies and media services to top brands of all sizes, leveraging data, technology, and industry-leading experience.

campaign

A campaign is a message or offer sent to a specific group of customers or recipients.

campaign type

A campaign may be one of the following types:

  • One-time

  • Recurring

Campaign Recipients

The Campaign Recipients table has a list of Amperity IDs associated with campaigns sent from Amperity, along with details about the campaign. These details include control and treatment groups, audience segments, destinations, and launch dates.

Campaigns tab

The Campaigns page uses segments to build audiences, and configure how Amperity send audiences to any downstream marketing workflow.

cardinality

Cardinality is a measure of how many unique values are present in data. A higher cardinality indicates a larger percentage of unique values, whereas a lower cardinality indicates a higher percentage of repeat values.

CASS, coding accuracy support system

Coding accuracy support system (CASS) is an address standardization concept that helps clean address to make them more effective for direct mail campaigns.

channel

A channel is the purchase method used by a customer. For example: online, point-of-sale, loyalty, and in-store.

Cheetah Digital by Marigold

Cheetah Digital by Marigold is a customer engagement platform that helps marketers deliver personalized experiences, create effective cross-channel messaging, and increase customer loyalty.

churn

Churn represents the likelihood of a customer not making a purchase during the next year.

churn prevention

Churn prevention is a process of identifying, and then marketing to users who are likely to churn.

churn propensity

Churn propensity is a predictive model that finds the likelihood that a customer is active based on their purchase history. The churn propensity model outputs a score between 0 and 1 that represents a customer’s probability of returning to make a purchase.

  • A score closer to 0 indicates a low probability of churn, suggesting the customer is likely to remain active.

  • A score closer to 1 indicates a high probability of churn, suggesting the customer is at risk of leaving.

city

The city connected with the location of a customer.

-or-

The city connected with the location of a unique Amperity ID.

claim

A claim is a set of information shared by an identity provider (IDP) with a service provider (Amperity). Each individual claim key specifies a single claim, such as a user’s email address, name, or the user’s role in Amperity.

clienteling

Clienteling is a technique used by retail sales associates to establish long-term relationships with key customers based on data about their preferences, behaviors and purchases.

cluster graph

A cluster graph is one of the outcomes of the Stitch process. It is a visual representation of every pairwise connection in a cluster of records.

The data explorer, showing the cluster graph.

See pairwise connection and stitched records.

cluster transition

A cluster transition occurs when records move from one cluster to another during the Stitch process.

clustering

Clustering is the process of deciding which records to include in a customer profile. A matching threshold defines the lowest threshold at which two records match, and then included in a cluster. Lower quality matches are a transitive connection. Distinct customer profiles emerge as a cluster of record pairs.

CLV

Customer lifetime value (CLV) measures how valuable a customer is to your brand.

COGS, cost of goods sold

Cost of goods sold (COGS) are the direct costs of producing goods sold by a brand, including the costs of materials and labor to produce the item, but excluding indirect expenses like distribution or sales.

combined requests table

The Combined Requests table consists of all the data subject access request (DSAR) data pulled from one or more tables with compliance semantics assigned to them.

common table expression (CTE)

A common table expression (CTE) is a named subquery defined by the WITH clause. A CTE defines a set of variables that act as a temporary view, an optional list of column names, and a query expression. The results of the query expression behaves like a table.

company

The company, typically an employer or small business, connected with a customer.

-or-

The company, typically an employer or small business, connected with a unique Amperity ID.

completed query

A completed query is an active query that has run successfully through Amperity as part of an orchestration within the past 24 hours.

completed segment

A completed segment is an active segment that has run successfully through Amperity as part of an orchestration within the past 24 hours.

completion

Completion is the percentage of non-NULL values within a column.

compliance detail report table

The Compliance Detail Report passthrough table has one row for each found record.

compliance overview report table

The Compliance Overview Report passthrough table has a row for each request.

component ID

An identifier that represents a set of records that are transitively connected with a score above threshold as an outcome of blocking and initial scoring. Records that share a component ID, but have different Amperity IDs, are split during hierarchical comparison.

Concise Binary Object Representation (CBOR)

CBOR is a binary data serialization format based on JSON. Like JSON it allows the transmission of data objects that contain name-value pairs, but in a more concise manner. This increases processing and transfer speeds at the cost of human-readability.

conversions

A conversion occurs when a customer takes a desired action in response to a call to action on an online website or platform. For example, signing up for a newsletter through a pop-up on a website, making a purchase after clicking on an ad, or tapping on a push notification on a mobile device.

conversion rate

Conversion rate is the percentage of website visitors who convert by taking a desired action, such as making a purchase or subscribing to a newsletter. A higher percentage represents a higher conversion rate.

Connected TV, CTV

A connected TV (CTV), is a physical device that a customer uses to watch and stream video content online. Examples of physical devices include:

  • Smart TVs, such Apple TV, Fire TV, and Android TV

  • Streaming sticks, such as Amazon FireStick and Chromecast

  • Gaming consoles, such as XBox and PlayStation

consumer packaged goods (CPG)

Consumer packaged goods are items that require routine replacement or replenishment, such as food, beverages, clothes, personal items, and household products, on a daily basis.

control group

A control group is the percentage of an audience who do not receive communications related to a marketing campaign. Use control groups to establish a baseline against which you can measure the success of a campaign over time.

Cordial

Cordial is a cross-channel marketing and data platform that collects customer data into one platform for use with audience segments, trends, and automated customer experiences.

core table

Core tables represent a normalized foundation for data built from the results of the Amperity identity resolution process. Use individual core tables with any database your brand builds within Amperity.

See standard core table and custom core table.

cost

Cost represents the total cost of all goods sold (COGS) for a product.

country

The country connected with the location of a customer.

-or-

The country connected with the location of a unique Amperity ID.

Coupa

Coupa is a cloud platform for business spend management. Coupa delivers measurable value through real-time spend visibility, control, compliance, and agility.

courier

A courier brings data from an external system to Amperity.

courier fileset

See fileset.

courier group

A courier group is a list of one or more couriers that run as a group. A courier group can act as a constraint on downstream workflows and can run automatically as part of a scheduled workflow.

courier group schedule

A schedule defines the frequency at which a courier group runs. All couriers in the same courier group run as a unit and all tasks must complete before a downstream process starts. Define a schedule using cron.

Criteo

Criteo is a commerce media platform that helps marketers and media owners manage and scale campaigns. Send audiences to Criteo, and then advertise to customers across paid media, including connected TV (CTV), banner ads, and video ads.

Criteo Retail Media

Criteo Retail Media allows marketers to build audiences using their own first-party data, and then monetize those audiences by allowing other brands to use them for their own marketing campaigns.

cron

Cron is a time-based job scheduler that uses cron syntax to automate scheduled jobs to run periodically at fixed times, dates, or intervals.

crontab syntax

Cron syntax specifies the fixed time, date, or interval at which cron runs. Each line represents a job. 30 8 * * * represents “run at 8:30 AM every day” and 30 8 * * 0 represents “run at 8:30 AM every Sunday”.

For example:

┌───────── minute (0 - 59)
│ ┌─────────── hour (0 - 23)
│ │ ┌───────────── day of the month (1 - 31)
│ │ │ ┌────────────── month (1 - 12)
│ │ │ │ ┌─────────────── day of the week (0 - 6) (Sunday to Saturday)
│ │ │ │ │
│ │ │ │ │
│ │ │ │ │
* * * * * command to execute

Amperity validates the cron syntax and shows you the results. You may also use crontab guru to validate cron syntax.

Cross Country Computer

Cross Country Computer specializes in strategic solutions for companies that are seeking to evolve their marketing methods around acquiring, retaining, and developing valuable customers.

CSV, comma-separated values

A comma-separated values (CSV) file, defined by RFC 4180 , is a delimited text file that uses a comma to separate values. A CSV file stores tabular data–numbers and text–in plain text. Each line of the file is a data record. Each record consists of one or more fields, separated by commas. The use of the comma as a field separator is the source of the name for this file format.

currency

Currency represents the currency used to pay for an item. For example: dollar.

custom core table

A custom core table built using Spark SQL and may reference one or more core tables or domain tables. Use custom core tables to extend the normalized foundation to support use cases beyond what the set of standard core tables offers.

custom database table

A custom database table modifies the schema of a source table to apply semantic tags or change the shape of the data within the table. For example, splitting values in a single column into more than one column, and then applying semantic tags. Use custom domain tables as inputs to Stitch when they contain customer profile data.

custom domain table

A custom domain table is a domain table that defines its schema using Spark SQL.

customer

A customer is a known individual with a core set of associated personally identifiable properties, such as their given name, surname, birthdate, mailing address, email address, and phone number.

customer 360 database

A customer 360 database uses standard core tables generated by the Stitch process. These tables offer a unified view of your brand’s customer data, including customer profiles and interaction records, organized, merged, and linked together by the Amperity ID.

customer 360 profile

Amperity creates a unique profile for each unique customer profile, assigns an Amperity ID, and then stores these profiles in the Customer 360 table. Amperity calculates the number of unique profiles by counting the number of Amperity IDs in the Customer 360 table.

Customer 360 page

The Customer 360 page is the interface to view and manage all databases, including the customer 360 database, along with related customer profile and interaction data. Explore and interact with stitched tables. Use Spark SQL to build the customer 360 database, add tables, define predictions, and surface data points for use with queries and segments.

The Customer 360 page in Amperity.
C360, Customer 360 (data table)

The Customer 360 table is the unified view of the customer across all points of engagement, including attributes that cross systems. This table does not exist by default. Each row represents a complete record for a unique individual, including their Amperity ID, merged PII data, and summary attributes.

Customer Attributes

The Customer Attributes table has a series of columns that identify attributes about individuals. For example:

  • Is an individual contactable?

  • Is there a marketable email address?

  • Is the physical address known?

  • Is there a phone number?

  • Are they are an employee, reseller, or a test account?

  • What is the individual’s revenue relationship with the brand?

customer data table

A customer data table has customer profiles from an external source system.

customer key, ck

A customer key is a column in a data table with a unique identifier that represents a unique ID for data as it exists outside of Amperity.

customer lifecycle status

Customer lifecycle status is a probabilistic score–referred to as p(return) or “probability of return”–that identifies if a customer is active or if they are likely to churn.

A customer’s p(return) score finds the customer lifecycle status tier:

Status tier

p(return) score

Active

p(return) score is over 60%

Cooling down

p(return) score is between 50%-60%

At risk

p(return) score is between 35%-50%

Highly at risk

p(return) score is between 20%-35%

Lost

p(return) score is below 20%

customer lifetime value (CLV)

See CLV.

customer profile

A customer profile is a collection of attributes connected to a single unique individual in the customer 360 database. The total number of customer profiles is equal to the total number of rows in the Customer 360 table. This total correlates strongly, but not exactly, with the total number of Amperity IDs assigned to unique individuals in the same dataset.

customer record

A customer profile is a row in a customer data table that has information about the customer. Who they are, where they live, and how much they spend. For example, a email list table has names, email addresses, or phone numbers.

Custora

Custora was a cloud-based customer analytics solution that applies machine learning across retail, organization, and customer touchpoints.

D

data explorer

The Data Explorer offers a detailed way to navigate through data in Amperity. The information available from the Data Explorer varies:

  • The Stitch page opens the Data Explorer to three views: stitched connections, cluster graphs, and pairwise connections.

  • The Customer 360 page opens the Data Explorer to a view that shows all tables in the database, from which you may view each of the individual tables.

  • The Segments page opens the Data Explorer to a specific table, and then offers a view of its schema along with sample data.

The data explorer, as opened from within the Stitch page in Amperity.
data lineage

Data lineage is a graph visualization that shows the connections between data in your tenant.

data mapping

A data mapping template defines how columns in Amperity output map to fields or columns required by a destination, including the ordering and naming of columns. This is especially useful when a destination has case-sensitive column names or when it uses a REST API with specific requirements for naming patterns. A destination that does not have requirements for the shape of data should use NULL instead of defining a data mapping structure.

data source

A data source is a system in a customer’s environment that sends data to Amperity as structured or semi-structured data through cloud-based storage or a REST API.

data subject access request (DSAR)

A data subject access request (DSAR) is a written request made by an individual to ask for regulatory compliance. For example: California Consumer Privacy Act (CCPA), General Data Protection Regulation (GDPR), or Personal Information Protection and Electronic Documents Act (PIPEDA). A DSAR requires a response within a pre-defined time window, typically 30 days.

data table

A data table is a set of rows organized into named columns with types, typically as output from some database application in a SQL format.

data template

A data template defines how Amperity sends columns in data structures to downstream workflows. A data template is part of the configuration for sending query and segment results from Amperity to an external location.

data type

A data type defines the data that is in a column. Amperity supports the following types: integer, string, float, decimal, date, datetime, map, array, and Boolean. When defining a feed for ingesting customer data to Amperity, it is important to consistently apply the correct data type for incoming columns.

Databricks

Databricks offers a unified platform for data and AI that supports large-scale processing for batch and streaming workloads, standardized machine learning lifecycles, and accelerated data science workflows for large datasets.

Datalogix

Datalogix, part of Oracle Data Cloud, connects offline purchase signals to digital media, helping marketers in the United States create comprehensive consumer profiles across digital, mobile, offline, and TV.

See Oracle Data Cloud.

dataset

A dataset is a collection of data grouped together to support use cases required by a high-level goal. A dataset may contain smaller datasets, as if they are building blocks.

days since last order

Days since latest order measures the number of days that have elapsed since a customer has placed an order.

deduplication

Deduplication is an output of the Stitch process that identifies the total number of unique individuals within a customer dataset.

deduplication rate

The deduplication rate represents the total number of unique individuals within a customer dataset. This rate measures the difference between the total number of original identifiers in customer data and the total number of Amperity IDs assigned to unique individuals.

delete confirmation

A delete confirmation sent from Amperity to confirm completion of a request to delete a user record.

delete user record request

A delete user record request. This is a written request that asks to delete data related to a specific user record from Amperity data tables.

Delta Lake

Delta Lake is an open source format that involves a metadata layer on top of columnar files in cloud storage using the Apache Parquet file format.

Delta Sharing

Delta Sharing is an open protocol for secure sharing of live data between organizations. Delta Sharing generates temporary credentials that allow access to individual data files in cloud storage without copying data to another system regardless of computing platform.

Demandware

See Salesforce Commerce Cloud.

derived semantic

A derived semantic is inferred from existing data. For example, “given-name” and “surname” semantics are from “full-name”. “gender” is from “title”. Amperity creates derived semantics automatically to ensure that individual semantics are available to Stitch during identity resolution.

destination

A destination is a location that receives data from Amperity.

destination plugin

A destination is a template that defines how Amperity sends data to an external location. When you select a destination, the dialog box updates to show the list of fields for the selected destination.

Destinations tab

The Destinations page is the interface for defining destinations for campaigns and orchestrations, the destinations to which individual segments are sent, and automated workflows.

The Destinations page in Amperity.
Detailed Examples

The Detailed Examples table has detailed examples of Stitch results. Use these examples to help identify which features lead to scores with the biggest effect on Stitch results, including how they associate with various combinations of fields that contain PII data.

deterministic

Deterministic is identity resolution that uses rules-based matching to generate exact matches. The results rank predictability over accuracy. For example: processing that connects a person with their payments.

digital channel

The digital channel for a transaction. For example: Facebook, Google Ads, email, TikTok Ads, or Reddit.

direct mail

Direct mail is a marketing strategy that uses printed media to send offers and advertising to a customer’s physical address.

directly identifying information (DII)

Directly identifying information (DII) is information that directly identifies an individual, such as a name, an email address, or a phone number.

discount sensitivity

Discount sensitivity is a predictive model that categorizes customers into three behaviors: full price shopper, discount opportunist, and discount seeker.

display advertising

Display advertising on websites attracts new customers who are otherwise difficult to reach.

domain table

A source domain table exists for each data source loaded to Amperity. Apply semantic tags to fields in source domain tables for customer profiles, transactions, loyalty programs, and customer events.

See stitched domain table.

Domo

Domo is a cloud-based, self-service BI tool that helps you visualize data from a single dashboard.

downstream

Downstream refers to the series of steps in a workflow that occurs after the current step. Downstream steps often have dependencies on the current step. For example, if the current step updates a data table, then dependent segments are rerun to return an audience with updated data.

draft database

A draft database is inactive and not available for queries or segments. Use a draft database for testing or experimentation.

draft segment

A draft segment is any segment that is not an active segment. A draft segment:

  • Is the default type for any segment that added to Amperity

  • Exists in a draft state until activated

  • May exist indefinitely

  • Appends “–Draft” when it is a working copy of an existing active segment; only a single working copy of an active segment may exist at any time

  • Is auto-saved

  • Allows downloading results as a CSV file

  • Runs within the Segment Editor

  • Must activate before running it on the Segments page adding it to a campaign

DSAR

See data subject access request (DSAR).

DSAR response

A response to a DSAR sent from Amperity to confirm that PII data related to discovery of the subject of a DSAR.

duplicate record

A duplicate record matches to one or more other records within a cluster. Amperity uses duplicate records to calculate the duplication rate.

duplication rate

The duplication rate is the percentage of all ingested customer profiles that are duplicate records. Duplication rate applies to each customer table as well as the overlap between customer tables.

Dynamic Yield

Dynamic Yield helps companies build and test personalized, optimized, and synchronized digital customer experiences.

Dynamics 365 Marketing

Dynamics 365 Marketing helps you build personalized journeys for real-time and outbound marketing, along with event management capabilities.

E

early repeat purchaser

Early repeat purchasers represent the percentage of first-time buyers who returned to make a second purchase within 90 days.

early repeat rate, ERR

Early repeat rate is a measure of one-time to two-time buyer conversion based on second purchases made within the previous 90 days.

edge

An edge is the relationship between two vertices in a graph database. Each edge has a type and must start with one vertex and end with another.

email

The email address connected with a customer. A customer may have more than one email address.

-or-

The email address connected with a unique Amperity ID. A customer profile may have many email addresses.

email engagement

Email engagement data captures the history of email interactions between a customer and a brand, such as opens and clicks, including clicks by day and by month, unsubscribes, conversions, and bounces.

email events

Email events associate email summary statistics to brands, email addresses, regions, event types, event dates and times, and sender IDs.

email summary statistics

Email summary statistics offer fields that summarize customer engagement with your brand. Individual statistics include brand, email address, counts for opens and clicks by day and by month, engagement frequency, and engagement status.

Email Engagement Attributes

The Email Engagement Attributes table has many of the same fields as the Email Engagement Summary table, except for the addition of the Amperity ID field. Whereas the Email Engagement Summary table is unique by email and brand, the Email Engagement Attributes table is unique by the Amperity ID and email for each brand combination.

Note

In the Email Engagement Attributes table, each Amperity ID should only have one email address, per brand.

Tip

The Email Engagement Attributes table pulls the email engagement data, for each Amperity ID, from the Email Engagement Summary table using the email associated with it in the Merged Customers table.

Email Engagement Summary

The Email Engagement Summary table has a summary of email event statistics, such as counts for opens and clicks, the first open, and the most recent click, unique by email address.

Email Opt Status

The Email Opt Status table has a row for each unique combination of email address, brand, region, and email program.

Emarsys

Emarsys is a customer engagement platform that helps marketers deliver personalized engagement across channels.

environment

An environment represents the cloud platform in which an Amperity tenant runs: Amazon AWS or Microsoft Azure. All aspects of an Amperity tenant run within the same environment, including:

  • Processes that pull data from customer data sources

  • Processes that send data to destinations

  • SQL for databases, queries, and segments

  • Sandboxes

Epsilon

Epsilon offers a suite of loyalty marketing services that spans database marketing, direct mail, email marketing, web development, loyalty programs, analytics, data services, strategic consulting, and creative services.

Epsilon Abacus offers data and analytic expertise to help improve your marketing activities and deliver a greater return on your marketing investment.

Epsilon Conversant offers services to help marketers recognize and measure the value of personalization solutions.

Epsilon Targeting offers market intelligence solutions for understanding multi-channel buying patterns and habits, and then following-up with personalized messages for better marketing campaign results.

escape character

An escape character is specific to a data format and allows certain character sequences to interpret differently from characters that are not prefixed by that escape character.

event character

Event propensity is a predictive model that finds the likelihood that a customer performs a revenue-generating event within the next 30 days.

Evergage

See Salesforce Interaction Studio.

ExactTarget

See Salesforce Marketing Cloud.

exclusion list

An exclusion list identifies customers who should not receive communications related to a campaign.

Experian

Experian offers credit reporting services and scores to businesses and consumers to help protect them from fraud and identity theft.

F

Facebook Ads

Ads on Facebook appear in a variety of locations, including the news feed and within the right-side column on pages. Use Meta Ads Manager to manage ad placements on Facebook.

feed

A feed defines how to load data into a domain table, including specifying required columns and columns with semantic tags for customer profile (PII) or transactions data.

Feed Editor

The Feed Editor is the second step adds a new data source to Amperity. The Feed Editor is the interface in which you assign field types, apply semantic tags and primary keys, and then make the data source available to Stitch.

field

A field refers to a single piece of data within a record or a set of records. In relational databases, this corresponds to the data within a column. For instance, within an Amperity customer table, a field would describe an individual piece of data, such as zip code, or phone number.

filedrop

A filedrop is the most common way to transfer data to and from Amperity. A cloud-based filedrop location stores and retrieves any amount of data from anywhere without restriction on file format or file size. A filedrop location may be Amperity-managed or customer-managed.

filename template

A filename template defines the naming pattern for files that sent from Amperity. Specify the name of the file, and then use Jinja-style string formatting to append a date or timestamp to the filename.

fileset

A fileset is a group of files processed as a unit by a single courier. A fileset defines each file individually by name, datestamp, file format, and load operation. A courier expects all files in a fileset to be available for processing, unless a file is as optional.

first order date, first order datetime

First order date is the date on which a customer placed their first order.

first order ID

First order ID is the order ID for a customer’s first order.

first order revenue

First order revenue is the total revenue connected with a customer’s first order, ignoring returned and canceled items.

first order total items

First order total items represents the number of items purchased in a customer’s first order, ignoring returned and canceled items.

first-party data

Customers share first-party data with a company or a brand, often directly. For example: filling out a registration card, completing an online form, requesting an emailed receipt, using a mobile app, or responding to email and SMS messaging.

first-matching-sub-audience

A first matching sub-audience refers to a customer connected with many sub-audiences across many brands and geographic regions. Use the first matched sub-audience to prevent the Amperity ID from appearing too often in a marketing campaign.

first-to-latest order days

First-to-latest order days is the number of days that have elapsed between the date of the first order and the date of the latest order.

first-to-second order days

First-to-second order days is the number of days that have elapsed between the date of the first order and the date of the second order.

fiscal calendar

A fiscal calendar is a yearly accounting period that aligns the weeks and months in a calendar year with holidays and a brand calendar. Use a fiscal calendar to align the business for an entire calendar year. A common fiscal calendar brands use is the 4-5-4 fiscal calendar.

foreign key, fk

A foreign key is a column in a data table that acts as primary key and is often used for deterministic matching of records. A record pair assigns an exact match score–5.0–when foreign keys contain the same values during pairwise comparison.

frequency capping

Frequency capping is an approach to marketing that sets limits on the number of times individuals can see specific advertisements within a given time period. For example:

  • Limit the number of impressions shown to a user.

  • Limit the number of times a user clicks on an ad.

  • Limit ads by time period, such as hour, day, or week.

  • Limit some combination of impressions, clicks, and time periods.

full name

A combination of given name and surname–or first name and last name–for a customer. May include a middle name or initial.

-or-

A combination of given name and surname–or first name and last name–for a customer. May include a middle name or initial.

G

General Data Protection Regulation (GDPR)

The General Data Protection Regulation (GDPR) is a law that covers data protection and privacy in the European Union (EU) and the European Economic Area (EEA). It gives control to individuals over their personal data and addresses the transfer of personal data outside the EU and EEA areas. GDPR simplifies the regulatory environment for international business by unifying regulation within the EU.

gender

The gender connected with a customer.

-or-

The gender connected with a unique Amperity ID.

generational suffix

The suffix that identifies to which family generation a customer profile belongs. For example: Jr., Sr. II, and III.

-or-

The suffix that identifies to which family generation a unique Amperity ID belongs. For example: Jr., Sr. II, and III.

given name

The first name connected with a customer.

-or-

The first name connected with a unique Amperity ID.

Google Ads

Google Ads is search-based advertising that runs across the Google advertising network. Use search-based advertising to promote your brand, help sell products or services, raise awareness, and increase traffic to your website or stores.

Google Advertising ID, Google AAID

The Google advertising ID (Google AAID) is a device identifier for advertisers that anonymously tracks users on Android devices.

Google Big Query

Google BigQuery is a fully managed data warehouse that offers scalable, cost-effective, serverless software that can perform fast analysis over petabytes of data and querying using ANSI SQL.

Google Campaign Manager

Campaign Manager is a web-based ad management system for advertisers and agencies that helps you manage your digital campaigns across websites and mobile devices.

Google Chrome

Google Chrome is the recommended web browser for Amperity.

Google Cloud Storage

Google Cloud Storage is an online file storage web service for storing and accessing data on Google Cloud Platform infrastructure.

Google Customer Match

Google Customer Match uses first-party data that your customers have shared with you to enable advertising across Google–including search (Google Ads) and display advertising (Display & Video 360), the Shopping tab, Gmail, and YouTube–to your customers and to similar audiences.

Google Enhanced Conversions

Google Enhanced Conversions improves the accuracy of conversion measurement by sending hashed first-party conversion data to Google using one-way SHA-256 hashing of first-party customer data.

Google Pub/Sub

Google Pub/Sub is a low-latency messaging service that streams data–including real-time–to Google Cloud Storage.

graph database

A graph database stores relationships between data items. A graph database that has Amperity data stores relationships between customer profiles, transactions, and the Amperity ID graph.

gross sales

The total amount of sales without any deductions removed from the total.

gzip

gzip is a file format used for file compression and decompression.

H

hard conflict

A hard conflict occurs when profile values are different enough to ensure that two records should not group together. A hard conflict is most often associated with a birthdate or a generational suffix, but can associate with other combinations of profile data.

hashed email

A hashed email is an email address encrypted with a hexadecimal string of numbers and letters. This creates a unique digital signature that is always the same even when customers use that email address to log in to different browsers, website, apps, and devices.

Heap

Heap is a digital insights platform that helps you understand how and why customers engage with your product. Heap automatically collects all customer data from your site or app, then gives direction on the improvements that you can make.

hierarchical comparison

Hierarchical comparison is a step in the Stitch process that occurs after pairwise scoring. This step examines each group of records to identify edge cases. For example, married couples with overlapping profile (PII) data or children with the same name as a parent who live at the same address.

high cardinality profile attribute

A high cardinality profile attribute has a large number of distinct values. For example, phone numbers, email addresses, and postal codes. High-cardinality profile attributes with an exact or close match are useful during identity resolution.

holdout control group

See control group.

Household ID

A Household ID is a universally unique identifier (UUID) that uniquely identifies the combination of a standardized address and a last name.

householding

A household is a group of people who share a physical address and possibly other attributes. For example: a shared device or network, a shared last name, a shared phone number. Householding is a process that identifies a unique household in a dataset.

HubSpot

HubSpot is a CRM platform built around dedicated hubs for marketing, sales, services, content management, and operations.

I

Identifier for Advertisers (IDFA)

The Identifier for Advertisers (IDFA) is a random device identifier that tracks and identifies a user, but without revealing personally identifiable information (PII). Apple assigns IDFA to a user’s device for use with customized advertising.

identity provider, IdP, IDP

An identity provider (IDP) is system that creates, maintains, and manages identity information for users, and then provides claims and authentication. Common examples of IDPs include Auth0, Azure AD, Okta, and PingFederate.

identity resolution

Identity resolution is the process of connecting and matching different data points across many devices and channels to form a unified view of a single customer. This allows brands to connect the dots between fragmented data to form a complete picture of an actual person.

inactive customer

An inactive customer is one whose likelihood of returning to make another purchase is below 50%.

inbound connection

An inbound connection is a directional connection between Amperity Bridge and a lakehouse shared between production and all sandboxes.

Infutor

Infutor is a consumer identity management platform that helps brands understand their consumers and make informed marketing decisions. Infutor offers a secure, privacy compliant foundation that improves inbound engagements and outbound marketing reach and minimizes fraud and collections risk.

interaction record

An interaction record is a row in a customer data table that has information about customer behavior. For example:

  • Purchases, such as items bought, items returned, or costs of items

  • Preferences, such as brands, products, or cart adds

is cancellation?

A flag that identifies a canceled item.

is returned?

A flag that identifies a returned item.

item cost

Item cost is the cost to produce all units of an item.

item discount amount

Item discount amount is the discount amount applied to all units of a single item within a single transaction.

item discount percent

Item discount percent is the percentage discount applied to all units of a single item within a single transaction.

item list price

Item list price is the manufacturer’s suggested retail price (MSRP) for all units of this item.

item profit

Item profit represents the amount of profit earned after selling all units of an item.

item quantity

Item quantity is the total number of items in an order. For returned and canceled items, item quantity is the total number of returned or canceled items.

item revenue

The total revenue for all units of an item, after applying discounts. For returned and canceled items, the total revenue for all returned or canceled items.

item subtotal

An item subtotal is the amount for an item, before applying discounts.

item tax amount

An item tax amount is the total amount of taxes paid for purchases.

itemized transaction

An itemized transaction represents data that captures orders at the item level, with many rows per order and one order per item with identified returns and cancellations.

itemized transaction semantic

An itemized transactions semantic is a way to identify brands, channels, stores, orders, products, quantities, per-item costs, and total costs. Use itemized transactions semantics when a data source has one row per item.

J

JDBC, Java Database Connectivity

Java Database Connectivity (JDBC) uses the Java programming language to define how a client may access a database.

jitter

Jitter tracks changes to Amperity IDs across Stitch runs.

Joda-Time

Joda-Time is an open source date and time library used to establish consistency in filename patterns. The recommended pattern is Segment_Name_MM-dd-YYYY, where Segment_Name is the name of the segment and MM-dd-YYYY appends the current date.

journey

A journey is a milestone or series of milestones that a customer goes through as they experience a brand.

journey orchestration

Journey orchestration helps marketers use what they know about their customers–previous behaviors, preferences, and real-time activity–to personalize each customer’s journey and improve business outcomes.

JSON, JavaScript Object Notation

JavaScript Object Notation (JSON) is language-independent data format that is similar to JavaScript.

JWT, JSON Web Tokens

A JSON Web Token (JWT) is a compact, URL-safe means of representing claims transferred between two parties.

K

Kibo

Kibo offers solutions for customer segmentation and targeting, campaign testing and optimization, 1:1 personalization, audience insights, and customer recommendations.

Klaviyo

Klaviyo is an email platform for targeting, personalizing, measuring, and optimizing email and Facebook campaigns.

Koupon Media

Koupon Media provides a platform that enables brands and retailers to launch data-driven campaigns that combine loyalty programs and mobile offers for their most valuable customers.

Kustomer

Kustomer is a platform for customer experience, service, and support that provides a complete picture of your customer when they contact your company.

L

lakehouse

A lakehouse is an open architecture solution that combines the best elements of data lakes and data warehouses. A lakehouse uses low cost cloud storage and open formats for sharing data.

Lakehouse CDP

A Lakehouse CDP is a customer data platform that connects directly to a data lakehouse, which is an open, cross-platform solution that shares and activates data without replication. A Lakehouse CDP unifies and enriches customer data without having to copy data or build and manage connections for activation, analytics, or AI use cases.

lapse rate

Lapse rate is the percentage of customers who purchased in the previous time period, but did not purchase during the current time period.

large dataset

A large dataset is a file over 500 GB in size.

latest order date, latest order datetime

Latest order date is the date on which the customer placed their most recent order.

latest order ID

Latest order ID is the order ID for a customer’s most recent order.

latest order revenue

Latest order revenue is the total revenue connected with a customer’s most recent order, ignoring returned and canceled items.

latest order total items

Latest order total items is the number of items purchased in a customer’s most recent order, ignoring returned and canceled items.

lifetime customer value (LCV)

See customer lifetime value.

lifetime average item price

The average individual item price for all orders, ignoring returns and cancellations.

lifetime average number of items

The average number of items purchased for all orders, ignoring returns and cancellations.

lifetime average order value

The average lifetime revenue for all orders, ignoring returns and cancellations.

lifetime largest order value

Lifetime largest order value identifies the largest order connected with a customer, ignoring returns and cancellations, for a customer’s entire purchase history.

lifetime order frequency

A lifetime order frequency is the total number of orders that a customer has made during their entire relationship with your brand.

lifetime order revenue

The lifetime revenue for all items, ignoring returns and cancellations.

lifetime total items

The lifetime number of individual items associated with the transaction, ignoring returns and cancellations.

lifetime value (LTV)

See customer lifetime value.

linkage tables

A linkage table is a custom domain table that traces records back to their corresponding source table records before applying privacy compliance actions.

list price

A list price is the manufacturer’s suggested retail price (MSRP) for all units of an item, before shipping, tax, or applying discount amounts. A list price represents a single unit of the item purchased.

Listrak

Listrak enables personalized cross-channel interactions that help automate campaigns, build customer loyalty, and increase conversion rates.

LiveRamp

LiveRamp allows clients to combine customer data from various online and offline sources, centering around the use of web cookies that allow websites to remember visitors.

load operation

A load operation defines the fileset for a courier. Specify each file in the fileset by name, format, and feed, and then assign a load operation.

location

A location is a unique combination of address, address2, city, state, and postal details for a customer.

-or-

A location is a unique combination of address, address2, city, state, and postal details for a unique Amperity ID.

lookalike audience,

A lookalike audience is an audience with a similar profile to an existing audience, such as the same demographics, shared interests, or similar product preferences and purchase histories.

Looker,

Looker is an enterprise platform for business intelligence, data applications, and embedded analytics.

lookup table

A lookup table provides access to customer-specific lists or categories when those lists or categories are long and dynamic. For example: a complete list of airport names with their corresponding airport code.

loyalty ID

The identifier for a loyalty program connected with a customer.

-or-

The identifier for a loyalty program connected with a unique Amperity ID.

M

Magento, Adobe Magento

See Adobe Commerce.

matchback

A matchback is a process where a mail file–associated with a purchase credit–matches back to untracked sales.

Mailchimp

Mailchimp is a cloud-based marketing automation platform and an email marketing service with an API that integrates third-party systems and a web UI that manages email contacts, templates, and lists.

manual courier

A manual courier is an active courier that is not configured to run as part of a courier group. Run this courier manually from the Sources page.

manual segment

A manual segment is an active segment that is not configured to run as part of a scheduled workflow. Run a segment manually from the Segments page.

master data management (MDM)

Master data management (MDM) is an approach to data management that uses the Amperity ID to link all unique identifiers across all data sources to a common identifier.

match category

A match category is a classifier that applies to an individual record-pair within a cluster of record-pairs. The match category is the result of this classification.

Match Category

Description

Exact

Amperity has the highest confidence that these records represent the same person because all profile data exactly matches.

Excellent

Amperity has near perfect confidence that these records belong to the same person, despite select types of profile data not matching.

High

Using deductive reasoning, Amperity has high confidence that these records match, despite some profile data not matching.

Moderate

Amperity has moderate confidence that these records match, due to weak or fuzzy matches between unique customer attributes, such as email, phone, or address.

Weak

Amperity lacks confidence, but if asked to guess, Amperity would assert these records do belong to the same individual, because they match on non-unique customer attributes, such as name, state, zip code.

Non-match

Amperity has high confidence that these records do NOT match, because core profile data is in conflict.

match rate

Match rate is the number of records sent from Amperity that matched with records in a downstream marketing application, shown as a percentage. For example, an audience of 1000 email addresses that match with 765 email addresses has match rate of 76.5%.

match type

The score. Possible values: “scored”, “scored_transitive”, and “trivial_duplicate”.

MDM table

A master data management (MDM) table has a record of each unique identifier for every data source in a single table that associates all unique IDs to the Amperity ID.

Merged Customers

The Merged Customers table collects rows from the Unified Coalesced table, and then collapses rows into a single row per Amperity ID.

Merged Households

The Merged Households table applies address-based householding and address standardization to the output of the Merged Customers table and adds a column for household ID. Use the Merged Households table to improve campaigns that send offers to shared physical addresses, such as direct mail campaigns.

Meta Ads Manager

Meta Ads Manager is a unified ad creation tool that your brand can use to create and publish ads to Facebook, Messenger, Instagram and the Meta Audience Network.

metrics

Metrics are measurements that help you understand performance across two broad categories: customers and revenue. Customer metrics track how your customers change over time. Revenue metrics show how your customer spend money, including by category, by product, and by segment.

Metrics tab

The Metrics page shows individual metrics as a card. Each card provides an overview of the current data that is available for a single metric. The list of metrics cards is configurable. Amperity provides a set of out-of-the-box cards, including new purchaser trends, purchase activity summary, revenue by historical lifetime spend, revenue summary, transacting purchasers, and weekly revenue trends.

Microsoft Advertising

Microsoft Advertising is a pay-per-click advertising platform that displays ads based on keywords in a user’s search query.

Microsoft Dataverse

Microsoft Dataverse is secure, cloud-based table storage for business applications. For example: Microsoft 365, Microsoft Azure, Dynamics 365, and the Microsoft Power Platform: Power BI, Power Apps, Power Automate, Power Virtual Agents, and Power Pages.

Microsoft Dynamics

Microsoft Dynamics CRM is a customer relationship management (CRM) software application that supports sales, marketing, and services. It is part of the Microsoft Dynamics family of business applications.

Microsoft Power BI

Microsoft Power BI is a collection of software services, applications, and connectors that work together to turn unrelated sources of data into coherent, visually immersive, and interactive insights.

mobile interactions table

A mobile interactions table is represents the history a customer has with mobile applications associated with a brand, such as:

  • Opens

  • Page views

  • Items added to a cart

  • Items purchased

MoEngage

MoEngage is a cross-channel customer engagement platform built for marketers and product owners who value agility over complexity. Use real-time insights and personalized cross-channel communications and adapt to evolving customer expectations.

Monetate

See Kibo

Mosaic USA (Experian)

Mosaic USA is a consumer segmentation system from Experian that classifies United States households into 71 unique types and 19 groups .

MSRP, manufacturer’s suggested retail price

The manufacturer’s suggested retail price (MSRP) is the price before applying shipping costs, taxes, and discounts. MSRP is sometimes referred to as the base price.

multi-brand

A flag that indicates if a customer has interacted with more than one brand.

multichannel

Multi-purchase channel is a flag that indicates if a customer has purchased from more than one channel.

multichannel campaign

A multichannel marketing campaign blends marketing strategies across channels like email, SMS, and paid media to offer customers a personalized experience, but with consistent messages, offers, and products across all channels.

multichannel customer

A multichannel customer shows activity in at least two channels during the last measured time period.

multichannel marketing

A multichannel marketing strategy blends distribution channels–retail storefronts, websites, and catalogues–with promotional channels to make it easier for customers to purchase.

Multi-touch-attribution

A multi-touch attribution identifies each marketing channel that the customer has interacted while making a purchase.

N

National Change of Address, NCOA

National Change of Address (NCOA) is a secure dataset of about 160 million permanent change-of-address (COA) records. NCOA records consist of the names and addresses of individuals, families, and businesses who have filed a change-of-address with the United States Postal Service (USPS).

newline-delimited JSON, NDJSON

Newline-delimited JSON (NDJSON) is a data format for structured data that defines the structure of JSON data using lines as separators. Each line in a NDJSON file is a valid JSON value.

net order revenue

Net order revenue is the revenue for all items in an order, including returned and canceled items, but not including discounted amounts.

net sales

The total amount of the gross sales minus deductions, such as returns, discounts, or allowances, from the total.

Neustar

Neustar–a TransUnion company–provides real-time information and analytics for the Internet, risk, digital performance, and defense, telecommunications, entertainment, and marketing industries. Neustar also provides clearinghouse and directory services to the global communications and Internet industries.

nicknames

A nickname is a variation of a customer’s given name that should not affect merging of customer profiles during identity resolution. For example: Mike is a nickname for Michael, Sue is a nickname for Susan, Tom and Tommy are nicknames for Thomas, Vicki and Vickie are nicknames for Victoria.

notifications

See recent activity.

O

OAuth, Open Authorization

OAuth is an open standard for access delegation, commonly used to grant websites or applications access to information on other websites.

ODBC, Open Database Connectivity

Open Database Connectivity (ODBC) uses a driver-based system to define how any client may access any database.

offline conversions

An offline conversion is an offline event or activity where a customer takes a desired action on an add platform, mobile app, or website.

offline conversions dataset

An offline conversion dataset is an offline event or activity that occurs on an add platform, website, or in-store. Brands can upload this data to a marketing campaign to measure customer offline activity.

offline event

An offline event occurs when a customer takes actions on a brand’s website, mobile app, or in an ad platform.

omni-channel marketing

Omnichannel marketing blends online and offline marketing strategies to offer a consistent and personalized brand experience across channels, such as website, social media, mobile devices, and in store.

one-time campaign

A one-time campaign represents a specific campaign message sent only once.

one and done

One and done is a flag that indicates if a customer has made only one purchase.

Oracle Business Analytics

Oracle Business Analytics is a desktop BI tool for data exploration and visualization.

Oracle Data Cloud

Oracle Data Cloud is a set of solutions that enable agencies, brands, media platforms, and publishers to use data to drive better business outcomes.

Oracle Data Management Platform (DMP)

Oracle Data Management Platform (DMP), formerly known as BlueKai, is a cloud-based big data platform for personalizing online, offline, and mobile marketing campaigns with more useful information about targeted audiences.

Oracle OPERA

Oracle OPERA is a property management solution that consolidates reservations, loyalty, sales and catering systems, and guest services into a single platform.

ORC, Optimized Row Columnar (ORC)

Optimized Row Columnar (ORC) is columnar-storage file format similar to Parquet. It is the file format that is the Stitch process outputs to the customer 360 database.

orchestration

An orchestration defines the relationship between query results and a destination, including the destination and the frequency at which an orchestration runs.

orchestration group

An orchestration group is one or more orchestrations scheduled using a cron expression string to define schedule frequency. For example, schedule an orchestration group to run at 8:30 AM every day of the week: 30 8 * * *.

order canceled quantity

The total number of cancelled items in the original transaction.

order canceled revenue

The total amount of revenue for all canceled items in the transaction.

order cost

Order cost represents the total cost of goods sold (COGS) for a single transaction, minus returns, cancellations, and discounts.

order date, order datetime

Order datetime is the date and time on which a customer places an order.

order discount amount

Order discount amount is the total discount amount applied to the entire order.

order discount percent

Order discount percent is the percentage discount applied to the order value for the entire transaction, along with any item-specific or unit-specific discount percentages.

order ID

An order ID is the unique identifier for the order. It links together all items in the same transaction. For returns and cancellations, the order ID is the unique identifier for the original order, including returned or canceled items.

order list price

Order list price is the total value for a transaction, as defined by the manufacturer’s suggested retail price (MSRP) for all units of this item.

order profit

Order profit is the amount of profit earned from a single transaction.

order quantity

Order quantity is the total number of individual items in the transaction.

order returned quantity

Order returned quantity is the total number of returned items in the original transaction.

order returned revenue

Order returned revenue is the total amount of revenue for all returned items.

order revenue

The total amount of revenue for all items in a transaction after discounts and ignoring returns or cancellations.

order shipping amount

The order shipping amount is the total cost of shipping all items in the same transaction.

order subtotal

An order subtotal is the amount for an order, before applying discounts.

order tax amount

An order tax amount is the total amount of taxes applied to the entire order.

OTT, Over-the-top

Over-the-top (OTT) refers to physical devices on which consumers stream video and music to their Smart TVs, mobile phones, tablets, and desktop computers using an app or a website. Examples of OTT devices include Netflix, Disney+, Paramount+, Apple TV, Hulu, and more.

Outbrain

Outbrain is a native advertising platform using targeted advertising to recommend articles, slideshows, blog posts, photos or videos to a reader.

outbound connection

An outbound connection is a directional connection between Amperity Bridge and a lakehouse shared between production and all sandboxes.

overcluster, overclustering

An overcluster–a false positive–occurs when distinct records are incorrectly added to a cluster of records. Each overcluster affects the precision of identity resolution. Investigate to understand why they occur.

P

paid channels

A paid channel allows a brand to target advertising using demographics, keywords, and intent. A paid marketing campaign has the benefit of already knowing who the customers are and leads to better campaigns and better results.

pairwise comparison

A pairwise comparison is a process that compares, and then scores all possible connections between all records in a group of records.

pairwise connection

A pairwise connection is a pair of matching records within a block that have an initial score above threshold. All pairwise connections that score above threshold represent a single, unique individual.

The data explorer, showing pairwise connections.

See cluster graph and stitched records.

pairwise connection score

Every pairwise connection has a score with two parts separated by a period.

The first part–the record pair score–correlates to the match category, which is a machine learning classifier applied by Amperity to individual record pairs. The record pair score corresponds to the classification:

  • 5 for exact matches

  • 4 for excellent matches

  • 3 for high matches

  • 2 for moderate matches

  • 1 for weak matches

  • 0 for non-matches

Stitch uses the second part–the record pair strength–to show the quality of the record pair score. This value appears in the Stitch report as a two decimal number. A record pair strength by itself is not a direct indicator of the quality of a pairwise connection score.

Panda Printing

Panda Printing creates high-quality print and promotional products for businesses of all sizes. If you can think of it, they can print it.

passthrough 360 table

A passthrough table adds a domain table to your database. Passthrough domain tables often have a field for the Amperity ID, but otherwise are the same as the fields in the source domain table.

pCLV

Predicted customer lifetime value predicts the total value of all orders a customer is likely to make if they return to make another purchase during the next 365 days.

pCLV index score

A predicted customer lifetime value (predicted CLV) index score is the comparison between customers associated with a given segment to customers associated with everything else.

For example, an index score of 5x for orders indicates that the average customer in the segment places 5x orders when compared to customers who are not in the segment.

PebblePost

PebblePost offers a full-stack solution that leverages scaled proprietary data, algorithmic optimization, advanced measurement and analytics, along with a print and mail logistics network.

people-based measurement

People-based measurement is an approach to marketing that measures incremental changes in customer behavior. For example, did a campaign increase customer lifetime value, how much value did this campaign add when part of a coordinated campaign, or did a campaign meet revenue goals and expectations?

Persado

Persado is an AI-driven content generation platform that uses segment data to personalize words, concepts, and ideas. Use Persado to increase purchases, loyalty registrations, automatic payment sign-ups, and self-service enrollments.

personally identifiable information (PII)

Personally identifiable information (PII) is any data that could identify a specific individual. PII data includes details like names, addresses, email addresses, and other profile attributes. PII data sometimes includes attributes like a loyalty number, customer relationship management (CRM) system identifiers, or foreign keys in customer data.

PGP, Pretty Good Privacy

Pretty Good Privacy (PGP) is an encryption program that provides cryptographic privacy and authentication for data communication by signing, encrypting, and decrypting data files and formats. Amperity supports PGP encryption.

phone

The phone number connected with a customer. A customer may have more than one phone number.

-or-

The phone number connected with a unique Amperity ID. A customer profile may have more than one phone number.

PII semantic

A PII semantic assigns consistency to customer data to ensure that PII data is more discoverable across datasets.

PII table

A PII table collects fields from the Unified Coalesced table that contain PII data and standardizes PII values. Build queries against any combination of PII data to build audiences using the visual Segment Editor.

Pinterest

Pinterest is a visual discovery engine on which your brand can engage with your customers. Send a list of products that are in your product catalog to Pinterest to enable pins, and then send a list of customers with whom your brand wants to engage.

PIPEDA, Personal Information Protection and Electronic Documents Act

The Personal Information Protection and Electronic Documents Act (PIPEDA) is a Canadian federal privacy law for how private sector organizations collect, use, and share personal information.

policy

A policy represents a set of actions that are available to a user when assigned to that policy.

postal

The zip code or postal code connected with the location of a customer.

-or-

The zip code or postal code connected with the location of a unique Amperity ID.

Postman

Postman is a collaboration platform for API development that enables support for sending data to Amperity using the Streaming Ingest REST API.

PowerReviews

PowerReviews is a user-generated content vendor that helps brands and retailers grow their business. Better quality customer product ratings lead to improved product quality and customer experiences.

precision

Precision is the relation between true positives and the total number of true positives and false positives. A decrease in overclustering increases precision.

predicted average order revenue

Predicted average order revenue predicts the average value of each order a customer is likely to make if they return to make another purchase during the next 365 days.

predicted customer lifetime value (pCLV)

See pCLV.

predicted order frequency

Predicted order frequency predicts the number of orders a customer is likely to make if they return to make another purchase during the next 365 days.

predicted probability of transaction

Predicted probability of transaction represents the likelihood that a customer returns to make another purchase during the next 365 days.

See p(return).

predictive attribute

Each predictive model outputs a collection of predictive attributes. Use predictive attributes to define segments that filter customers based on their predicted behaviors.

predictive models

Predictive models are a feature of Amperity that predicts customer behavior, such as customer lifetime value, churn propensity, product affinity, and lifecycle events.

Predicted CLV Attributes

The Predicted CLV Attributes table has information, for each individual Amperity ID, about predicted future spend and the probability of churn.

Presto SQL

See Trino.

p(return)

p(return) is a probabilistic score that predicts churn likelihood and represents how likely is it for an individual customer to purchase in the next year.

Amperity calculates this score using various factors, such as the customer’s historical purchase behavior and engagement levels.

primary key, fk

A primary key is a column in a data table that uniquely identifies each row in a data source or data table.

product affinity

Product affinity is a predictive model that identifies which customers are likely to purchase by using a combination of historical purchase data and lookalike audiences. The predicted affinity model outputs a ranked list of customers with three recommended audience sizes.

product ID

The unique identifier for a product.

product base ID

The unique identifier for a group of products that share a set of baseline attributes. This identifier does not include any SKU details.

product catalog

A product catalog represents data that describes purchased items, including unique identifiers, stock keeping units, or SKUs, along with other product attributes.

product recommendation

A product recommendation predicts which products a given customer is most likely to purchase.

Profile API

The Profile API is a RESTful API that supports building custom endpoints against unified customer profiles to enable any downstream workflow.

PSV, pipe-separated values

A pipe-separated values (PSV) file is a delimited text file that uses a pipe to separate values. A PSV file stores tabular data in plain text. Each line of the file is a data record. Each record consists of one or more fields, separated by pipes. The use of the pipe as a field separator is the source of the name for this file format.

purchase brand

The brand purchased by customer in a transaction.

purchase channel

A purchase channel is the channel in which a customer makes a transaction. For example: in-store or online.

purchase curve

A purchase curve represents the relationship between purchases and audience size. A purchase curve adds customers, but flattens as audience sizes grow larger.

The purchase curve and audience sizes.
purchases table

A purchases table is an interaction record that has a history of purchases made by customers, including details about goods and services, such as:

  • Order quantity

  • Itemized order value

  • Date of purchase

  • Purchase location

Q

QA database table

A QA database table is output by Stitch for use with the Stitch QA process.

QA tables include:

  • Detailed Examples

  • Stitch Blocking Keys

  • Stitch Scores

  • Unified Changes Clusters

  • Unified Changes PKS

  • Unified Coalesced

  • Unified Preprocessed Raw

  • Unified Scores

Quad

Quad is a marketing solutions provider that combines a strong print foundation with an integrated marketing platform to help marketers improve spend efficiency across offline and online media channels.

Qlik Sense

Qlik Sense is a cloud-based, self-service BI tool. Use it to explore data in any direction, extracting data by making selections, and viewing filtered data and associated items.

qualified trivial duplicate

A qualified trivial duplicate is a set of records with enough matching PII to score 3.0 or greater and group together.

See trivial duplicate.

Queries (page)

The Queries page provides the interface for users to build, define, and manage a list of queries.

query

A query is SQL that runs from the Queries page against database tables in the Customer 360 page. A query returns a refined and filtered subset of useful customer data.

query folder

A folder helps you organize the list of segments in the Queries page. You may configure up to three levels of folders.

R

raw data

Amperity loads data in raw form preserving the original schema as well as identifiers from all systems. Examples of identifiers include loyalty numbers, ecommerce IDs, cookies, or any other identifier mapped to a customer.

raw UID2 values

Raw UID2 values are unencrypted alphanumeric identifiers that are created from a user’s email address. Raw UID2 values are encrypted to create a UID2 token. Raw UID2 values, and their associated UID2 tokens, are case sensitive and are unique.

RCFile, Record Columnar File

Record Columnar File (RCFile) is a data placement structure that determines how to store relational tables on computer clusters using the MapReduce framework.

Real-time Profile API

See Profile API.

recall

Recall is the relation between true positives to the total number of true positives and false negatives. A false negative is an incorrect split. An increase in underclustering decreases recall.

recent activity

The recent activity panel appears on the right side of the Sources, Stitch, Customer 360, Queries, Segments, Campaigns, and Destinations pages and updates every time a workflow runs within your tenant.

recipient group

See treatment group.

record count

Record count is the total number of records, or rows, within a given dataset. This could include a source feed or Amperity table, or, it could describe the number of records that result from a probabilistic or deterministic connection between two Amperity tables.

record pair

The Stitch process takes customer tables, extracts the customer profiles and compares pairs of records together using over 40 different machine learning models. Each record pair score represents the strength of the match.

record pair score

The record pair score correlates to the match category, which is a classifier applied by Amperity to individual record pairs. The record pair score corresponds to the classification:

  • 5 for exact matches

  • 4 for excellent matches

  • 3 for high matches

  • 2 for moderate matches

  • 1 for weak matches

  • 0 for no matches

record pair strength

The record pair strength represents the strength of the record pair score. It is a two digit number. For example: .31 is a lower strength and .93 is a higher strength.

record type

A record is a row of data. A record is one of the following record types: customer profile or interaction record.

records

Records are individual rows of data, synonymous to individual rows in relational databases. Each record is a collection of columns that store values like first name, email address, or last purchase. In Amperity terms, a record is a row of data that refers to a single customer, a visitor, a transaction, an interaction, or behavior.

records ingested

Records ingested is a static value that tracks the total number of non-duplicate records added to Amperity during the previous week. Each record is an individual row of data from any customer data source, including both customer profiles and interaction records.

records sent

Records sent is a statistic that tracks the total number of records sent by Amperity to all destinations during the previous week. Each record is an individual row of data returned by a segment.

rectangular data

Rectangular data is a structured dataset with defined rows and columns, such as a SQL table output to an Amazon S3 bucket or Azure storage.

recurring campaign

A recurring campaign automatically sends an updated or refreshed audience with a predefined campaign message and cadence for a list of recipients. For example, a state change, an accepted return, a change to loyalty status, or an alert based on credit card status.

Reddit, Reddit Ads

Reddit is home to unique communities, engaged conversations, and the best memes. Reddit Ads allows brands to find their community on Reddit, and then engage with your customers within the 100K+ active communities on Reddit using targeted ads and promoted posts.

request datasource

An identifier that represents the first grouping of records done by Stitch. This grouping relies on semantic patterns.

request strategy

The compliance request strategy used for matching exact email data, semantic tags, and Amperity IDs. Possible values: exact or connected_pii.

request type

The compliance request. Possible values: delete, delete_pii, or dsar.

resource group

A resource group represents one or more databases in the Customer 360 tab. Users with access to a resource group can build queries and segments against that database and can send data from that database to downstream workflows.

retention rate

Retention rate is the percentage of customers who continue to purchase your products and services over a given time period.

RFM (customer value)

RFM is a method used to analyze customer value that represents three dimensions:

  • Recency. How recently did the customer purchase?

  • Frequency. How often does the customer purchase?

  • Monetary. How much does the customer spend?

Individual scores for recency, frequency, and monetary have a scale of 1-10. Combined RFM scores exist on a scale. “111” represents the lowest possible RFM score. “101010” represents the highest possible RFM score, or “three ‘10’ scores”. Higher scores represent preferred behaviors.

RR Donnelly

RR Donnelly provides marketing and business communications, commercial printing, and related services.

RSA (cryptography)

RSA is a cryptographic system for generating public and private key pairs. RSA secures data transmissions between Amperity and SFTP. Use the public key to encrypt data. Use the private key to decrypt data.

S

S3

See Amazon S3.

Sailthru by Marigold

Sailthru by Marigold is a cross-channel marketing platform built to deliver personalized experiences for consumers across email, web, and mobile channels.

Salesforce Commerce Cloud

Salesforce Commerce Cloud​ is a multi-tenant, cloud-based commerce platform that enables brands to create intelligent, unified buying experiences across all channels.

Salesforce Interaction Studio

Salesforce Interaction Studio is a real-time personalization and interaction management solution.

Salesforce Marketing Cloud

Salesforce Marketing Cloud is a provider of digital marketing automation and analytics software and services.

Salesforce Marketing Cloud Personalization

Salesforce Marketing Cloud Personalization is a real-time interaction management tool that helps your brand visualize, track, and manage customer experiences.

Salesforce Pardot

Pardot is a marketing automation solution that helps your brand engage buyers, grow relationships, and close deals.

Salesforce Sales Cloud

SalesForce Sales Cloud brings customer information together into an integrated platform, and then provides access to thousands of applications through the AppExchange.

SAML

Security Assertion Markup Language (SAML) is a protocol used to share authentication and authorization information between an identity provider (IDP) and security provider (SP).

sandbox

A sandbox is a copy of your production tenant in which you can make configuration changes, verify the results, and then promote those changes to production.

SAP Business Objects

SAP Business Objects is a centralized BI tool for data reporting, visualization, and sharing and is the on-premise BI layer for SAP’s Business Technology Platform.

SAS Cloud Analytic Services

SAS Cloud Analytic Services is a cloud-based runtime environment for data management and analytics with SAS.

saved query

A saved query is a SQL statement that runs against source data before loading the results to a domain table. A saved query uses Spark SQL syntax.

second order date, second order datetime

Second order date is the date on which the customer placed their second order.

second order ID

Second order ID is the order ID for a customer’s second order.

second order revenue

Second order revenue is the total revenue connected with a customer’s second order, ignoring returned and canceled items.

second order total items

Second order total items is the number of items purchased in a customer’s second order, ignoring returned and canceled items.

second-party data

Second-party data is first-party data shared by the company that owns it and has the right to share this data with another brand or company. Second-party data can be a good approach to audience expansion, especially when both datasets fit a similar customer profile. Governance, legal concerns, and privacy laws are more likely to affect how your brand can share second-party data.

Security Assertion Markup Language

See SAML.

segment

A segment defines a specific attribute profile for use with a marketing campaign. The list of matching customers is the audience.

segment folder

A folder helps you organize the list of segments in the Segments page. Add up to three levels of folders.

Segments (page)

The Segments page provides the interface for building, defining, and managing a list of segments.

The Segments page in Amperity.
semantic

A semantic is a way to apply a common understanding to individual points of data across data sources, even when data sources have different schemas, naming conventions, and levels of data quality.

semantic tag

A semantic tag standardizes profile (PII), transaction, and other important customer details across all columns in all data tables.

semantic type

A semantic group is directly associated with data values that appear in customer data tables. Semantic groups exist for columns that contain values like first names, email addresses, home addresses, cities, and phone numbers. Amperity has many built-in semantic groups, including for personally identifiable information (PII), transactions, itemized transactions, and other consumer behaviors.

semi-structured data

Semi-structured data is not shaped as rows and columns, but still has elements that separates the data into a clear hierarchy. Amperity supports many types of semi-structured data formats, including CSV, TSV, Parquet, ORC, and JSON.

separation key, sk

A separation key, or “sk”, is for deterministic unmatching of records.

Unmatched unmatched Unmatching unmatching

service provider, SP

A service provider (SP) is a system that accepts authentication responses from an identity provider (IDP), and then provides access to a service. Amperity is the service provider in this context.

SFTP

The Secure File Transfer Protocol (SFTP) is a network protocol that provides file access, file transfer, and file management over any reliable data stream.

shared access signature (SAS)

A shared access signature (SAS) is a URI that grants restricted access rights to Azure Storage resources.

shared dataset

A shared dataset represents all databases and database tables configured for outbound sharing with another organization.

Shopify

Shopify is an ecommerce platform for online stores and retail point-of-sale systems that consolidates shopping, payments, marketing, shipping, and customer engagement tools into a single unified experience.

single sign-on

See SSO.

single-channel customer

A single-channel customer shows activity in a single channel during the last measured time period.

site interactions table

A site interactions table represents the history a customer has with a brand through websites, such as:

  • Site visits

  • Page views

  • Items added to a cart

  • Items purchased

  • Conversions

SKU, stock keeping unit

A stock keeping unit (SKU) is an identifier that captures unique details for individual products, including specific attributes to differentiate by color, size, material, and other product details.

slice

Do not use the term “slice” when referring to data sent from Amperity to external locations. Use segment or database export instead.

See segment.

SMG, Service Management Group

Service Management Group (SMG) helps brands boost loyalty and improve business performance by combining technology and differentiated, strategic services to collect, analyze, and share feedback and behavioral data.

SMS Opt Status

The SMS Opt Status table has a row for each unique combination of phone number, brand, region, and SMS program.

Snapchat

Snapchat is a social media platform for accessing pictures and messages for a short time. Snapchat offers a self-serve platform from which brands can manage ads and advertising campaigns.

SnapPass

SnapPass allows sharing secrets in a secure, ephemeral way. Input a single or multi-line secret, define how long someone has to view the secret, and then generate a single-use URL. Share the URL to share the secret.

snapshot email

A snapshot email sent on a weekly basis from Amperity to give an overview of your customer data.

Snappy

Snappy is a compression library aims for high speeds and reasonable compression. Files compressed with snappy tend to be larger, but the process to compress and decompress them is faster.

Snowflake

Snowflake is an analytic data warehouse that is fast and flexible. Snowflake uses a SQL database engine designed for the cloud.

SoundCommerce

SoundCommerce is a data platform for consumer brands and retailers that spans marketing, merchandising, supply, fulfillment, delivery, and customer service.

source ID

Source IDs are unique identifiers that exist within each of your upstream data sources, including customer databases, transactional databases, or behavioral data assets. Amperity measures Stitch quality by comparing the number of source IDs to the final number of Amperity IDs. This shows the degree to with Amperity matched and reconciled data when building customer profiles.

Sources tab

The Sources page provides the interface for defining how customer data sources send data to Amperity, and then how Amperity prepares this data for processing.

The Sources page in Amperity.
Spark SQL

Spark SQL is a high performance SQL query engine used by Amperity to ingest data, create domain tables, and extend the outcome of the Stitch process within your customer profile database.

sparse customer record

A sparse customer profile is an incomplete set of personally identifiable information (PII) that can still identify a unique individual when compared against rich sets of PII data. For example, a dataset with sparse customer profiles may only contain last names, the last 4 digits of credit cards, and store zip codes associated with the transaction.

Square

Square is a platform for managing online stores, payments, and transactions.

SQL 360 table

Add a SQL table to a database using Spark SQL and a SELECT statement that defines the table’s schema.

SQL Query Editor

The SQL Query Editor is the user interface for a full SQL query engine based on Presto SQL that interacts with customer database tables in Amperity. The SQL Query Editor relies primarily on using the SELECT statement, along with common table expressions, joins, functions, and other parts of Presto SQL to build and design advanced queries.

The SQL Queries Editor located within the Segments page in Amperity.
SQL Segment Editor

The SQL Segment Editor is an optional interface for building attribute profiles using SQL. Start with a SELECT statement that returns the Amperity ID, and then apply a series of WHERE statements to define one or more attribute groups that match specific conditions and values.

SQL Workbench

SQL Workbench is a cross-platform SQL query tool that runs on any operating system.

SSO

Single sign-on allows a user to use a single ID and password to access many applications.

SSO domain

An SSO domain is a defined customer domain that maps to the identity provider used for single sign-on authentication to Amperity. For example: socktown.com for all users of the Socktown tenant.

standard core table

Standard core tables contain the results of the identity resolution process for your tenant.

standard database table

A standard database table forms the foundation of your databases. Standard tables are output by Stitch and offer unified data structures for all data sources.

Standard tables include:

  • Campaign Recipients

  • Customer 360

  • Customer Attributes

  • Email Engagement Attributes

  • Email Engagement Summary

  • Lifecycle Events

  • Merged Customers

  • Transaction Attributes Extended

  • Unified Customer

  • Unified Coalesced

  • Unified Email Events

  • Unified Itemized Transactions

  • Unified Product Catalog

  • Unified Scores

  • Unified Transactions

along with predictive modeling tables, such as Predicted CLV Attributes.

state

The state or province connected with the location of a customer.

-or-

The state or province connected with the location of a unique Amperity ID.

static sources table

Use a static sources table for a large volume of data that is not updated on a regular basis or remains unchanged over longer time periods.

Stitch

Stitch uses patented algorithms to process massive volumes of data and discover the hidden connections in your customer profiles that identify unique individuals. Stitch outputs a unified collection of data that assigns a unique identifier to each unique customer profile.

Stitch_Blocking_Keys

The Stitch Blocking Keys table has all blocking keys used during the Stitch process.

Stitch_BlocklistValues

The Stitch BlocklistValues table is a passthrough table for values identified by the bad-values blocklist, and then excluded from the Stitch process.

Stitch_Scores

The Stitch Scores table has all scores generated by Stitch, including scores that are not associated with an Amperity ID. Use this table to help understand why records are not associated with an Amperity ID.

Stitch tab

The Stitch page shows detailed results of the Stitch process, which takes customer data, extracts customer profiles, and then compares record pairs using over 40 different machine learning models. Each score represents the strength of the match. Amperity creates clusters of records based on the connection between pairs, and then gives each cluster a unique Amperity ID.

The Stitch page in Amperity.
Stitch QA

Stitch QA is a process that monitors the quality of Stitch results. Stitch QA has two parts: a database and a set of queries. Analyze the results to identify values for labeling, blocklisting, or to discover ways to tune the Stitch process to better match your tenant’s dataset.

Stitch QA tables

Stitch generates a series of Stitch QA. Use these tables in a dedicated database to review the quality of the identity resolution process and understand how the Amperity ID gets assigned to each customer profile.

See stitched domain tables and unified tables.

stitch-labels

Stitch labels identify when

  • A single customer profile is incorrectly merged together, or overclustered.

  • Two customer profiles are incorrectly split apart, or underclustered.

stitched domain table

A stitched domain table exists for each domain table with:

  • Applied semantic tags

  • An applied foreign key that allows Amperity to link records in the domain table to an Amperity ID that exists in a standard core table

See Stitch QA tables and unified tables.

stitched records

A stitched record is a unique output of the Stitch process that associates the Amperity ID to a customer.

The data explorer, showing stitched records.

See cluster graph and pairwise connection.

stitched table

A stitched table is a database table that is output by the Stitch process. The list of stitched tables created when PII semantics are present include Unified Coalesced, Unified Scores, Unified Changes Clusters, and Unified Changes PKs.

Streaming Ingest API

The Streaming Ingest API is a tenant-specific API that supports real-time streaming of data from external systems to Amperity.

streaming JSON

Streaming JSON is a way to send increments of data using NDJSON formatting within each increment. Each line in a NDJSON file is a valid JSON value.

structured data

Structured data adheres to a data model or schema. Amperity requires structured data to be in a table format with columns and rows. Amperity supports many types of structured data formats, including database tables in various formats and other forms of rectangular data.

sub-audience

A sub-audience is a list of customers who receive a specific communication related to a marketing campaign.

sum of discount amounts

The sum of discount amounts is the total of all discount amounts applied to each item within a transaction.

sum of itemized revenue

The sum of itemized revenue for the original order, not including returns or cancellations.

supersized cluster

A supersized cluster is a cluster of records with more than 64 matching records. A supersized cluster does not typically represent a unique individual and is not worthy of further analysis.

suppression table

A suppression table associates incoming requests to

  • Delete records

  • Respond to data subject access requests (DSAR)

with user records in the customer 360 database. User records in a suppression table are not usable in downstream workflows until after Amperity deletes all matching PII data.

surname

The last name connected with a customer.

-or-

The last name connected with a unique Amperity ID.

T

table

See data table.

Tableau

Tableau is a visual analytics platform that enables people and organizations to make the most of their data. Tableau connects to a data source, and then queries that data directly.

taxonomy

A taxonomy defines a formal structure for classes of objects. For example, a product taxonomy defines product categories, and then organizes individual product types within each category or subcategory.

temporary table

A temporary table is a table that exists within the customer 360 database that stores intermediate results, after which results are available to other tables in the customer 360 database.

tenant

A tenant represents everything within the Amperity platform as provided to a single customer, including:

  • Processes that pull data from customer data sources

  • Processes that send data to destinations

  • Sandboxes

A tenant runs fully within an environment located in Amazon AWS or Azure.

third-party data

Third-party data is a broad category that includes, but is not limited to, data collected by an entity, and then sold or distributed to companies that want to leverage that data. The usefulness of third-party data is often restricted by a combination of lower data quality, privacy laws, and general commercial interest.

Important

Amperity does not sell third-party data or mark it up in any way.

Throtle

Throtle provides brands and marketers a complete view of their customers and accurate targeting across all devices and channels. Target customers using connected TVs, cookieless identities, and mobile advertising IDs (MAIDs), such as the Identifier for Advertising (IDFA) from Apple and the Google Advertising ID (GAID) from Google.

TikTok Ads

TikTok is the world’s leading destination for short-form mobile videos. TikTok’s mission is to capture and present the world’s creativity, knowledge, and moments that matter in everyday life.

title

The title that precedes a full name connected with a customer, such as “Mr”, “Mrs”, and “Dr”.

-or-

The title that precedes a full name connected with a unique Amperity ID. For example: Mr, Mrs, and Dr.

token

A token is a time-limited bundle of authentication and authorization information used to access Amperity.

total discount amount

The total discount amount for the entire transaction, including item-specific discounts. A total discount amount should equal the order-level discount amount plus the sum of all item-level discount amounts for the same order ID.

trade-area append (TAA)

Trade-area append (TAA) is a process that uses names, zip codes, and credit card hashes to identify groups of users for online and offline marketing campaigns.

transaction

A transaction represents data that captures orders at the order level, with a summary row per order.

transaction semantic

A transactions semantic is a way to identify brands, channels, stores, orders, products, quantities, per-item costs, and total costs. Use transactions semantics when a data source has one row per order.

Transaction Attributes Extended

The Transaction Attributes Extended table has attributes for customer flags, customer orders, data differences, time period roll-ups, and RFM scores. Many extended attributes have duration, order position, frequency, and revenue.

transitive connection

A transitive connection exists between individual records when any two records share a strong match to an intermediate record, but do not have a strong match to each other. For example: record 1 matches record 2, record 3 matches record 2, neither records 1 or 3 match to each other, but they have a transitive connection because both match record 2.

treatament group

A treatment group is the percentage of an audience who receives communications related to a marketing campaign. A campaign may have more than one treatment group.

Trino

Trino is a distributed SQL query engine designed to efficiently query large amounts of data using distributed queries. Formerly known as Presto SQL.

trivial duplicate

A trivial duplicate is a set of records that share enough matching PII to identify a single unique individual. Stitch identifies trivial duplicates early in the identity resolution process. Stitch uses only one of the records identified as trivial duplicates.

See qualified trivial duplicate.

TSV, tab-separated value

A tab-separated values (TSV) file is a delimited text file that uses a tab to separate values and stores tabular data in plain text. Each line in the file is a data record. Each record consists of one or more fields, separated by tabs.

Tulip

Tulip is a store experience platform that helps your brand connect with your customers by using personalized retail experiences, access to online and in-store products, and seamless omnichannel checkout and fulfillment.

U

UID2 tokens

A Unified ID 2.0 (UID2) token, also called an advertising token, is an encrypted form of a raw UID2 value generated from a user’s email address. UID2 tokens are refreshed periodically, are case sensitive, and are unique.

undercluster

An undercluster–a false negative–occurs when distinct records are incorrectly split from a cluster of records. Each undercluster affects the precision of identity resolution. Investigate to understand why they occur.

Unified ID 2.0

Unified ID 2.0 is an open source identity framework that benefits consumers by allowing them to control over data sharing with publishers and advertisers.

Unified Changes Clusters

The Unified Changes Clusters table has a history of changes to cluster graphs, relative to the previous Stitch run.

Unified Changes PKS

The Unified Changes PKS table has a history of changes to primary keys, relative to the previous Stitch run.

Unified Coalesced

The Unified Coalesced table has all PII data processed through Stitch. Each semantic tag is a column header. All data is coalesced into a single table. A unique Amperity ID may appear in more than one row.

Unified Compliance

The Unified Compliance table supports privacy rights workflows and has the search results for data subject access requests (DSAR) and customer delete requests. Each matching record has a row in the Unified Compliance table.

Unified Customer

The Unified Customer table has every row of every stitched table with all semantics coalesced into a single column. Stitch removes bad values and supersized clusters. A unique Amperity ID may appear in more than one row.

Unified Email Events

The Unified Email Events table has individual email event information, such as sends, opens, clicks, opt-in and opt-out preferences, bounces, and conversions. Apply email event semantic tags to data sources to have Stitch generate this table.

Unified Itemized Transactions

The Unified Itemized Transactions table has rows of transactional data summarized to the item level, and then coalesced into a single column for each unique combination of order ID and product ID. The order ID is unique by Amperity ID.

Unified Loyalty

The Unified Loyalty table has a row for every customer who belongs to your loyalty program, unique by combination of Amperity ID and loyalty ID.

Unified Loyalty Events

The Unified Loyalty Events table has a row for loyalty program events, unique by Amperity ID and event datetime.

Unified Paid Media

The Unified Paid Media table builds customer profiles that contain all known PII–including email addresses, phone numbers, names, and physical addresses–for each Amperity ID. This is different from the Merged Customers table, which has only the best PII for each Amperity ID. Use the Unified Paid Media to help increase the total number of matched customers with your paid media campaigns.

Unified Preprocessed Raw

The Unified Preprocessed Raw table is an output of Stitch. Use this table during Stitch QA to view data exactly as Stitch uses it after preprocessing and blocklisting. This table has one row for each trivial duplicate. Semantic values represent pre-processing values. Stitch removes bad values and replaces them with NULL.

Unified Product Catalog

The Unified Product Catalog table has a row for every item in your product catalog. Descriptive attributes include name, brand, size, and color. Grouping attributes include category, sub-category, class, and sub-class. Other attributes include MSRP, gender, and description.

Unified Scores

The Unified Scores table records all pairwise comparison scores and match categories for all groups of records. For each group of records, records all pairwise scores that are present between records within that group.

unified tables

Stitch generates a series of tables that unify your brand’s customer profiles, transactions, and interactions. Rows of records in unified tables are unique by Amperity ID.

See stitched domain tables and Stitch QA tables.

Unified Transactions

The Unified Transactions table has one row for each unique transaction record, with each order ID associated to an Amperity ID.

uniqueness

Uniqueness divides the number of unique values–cardinality–by the number of rows in a table.

unit cost

Unit cost is the cost to produce a single unit of one item.

unit discount amount

Unit discount amount is the discount amount applied to a single unit of one item.

unit list price

Unit list price is the manufacturer’s suggested retail price (MSRP) for a single unit of an item.

unit profit

Unit profit represents the amount of profit earned when selling a single unit of an item.

unit revenue

The total revenue for a single unit of an item or the total revenue for a single unit of a returned or cancelled item.

unit subtotal

A unit subtotal is the amount for a single unit of one item, before applying discounts.

unit tax amount

A unit tax amount is the total amount of taxes for a single unit.

units per transaction

See UPT.

unqualified trivial duplicate

An unqualified trivial duplicate is a set of records that do not have enough matching PII to score 3.0 or greater and cannot group together.

See trivial duplicate.

unstructured data

Unstructured data is data that does not adhere to a particular data model or definition, such as text or binary data.

Note

Amperity does not support unstructured data as a data source because is difficult to map to other data sources and is less useful for identifying common patterns and trends across datasets.

UPC code

A Universal Product Code (UPC or UPC code) is a barcode that is widely used to track items in stores.

UpSet Plot

An UpSet plot highlights the connectivity between data sources. Similar to a Venn diagram, an UpSet plot helps you to visualize the intersections of different data sources. Different from a Venn diagram, an UpSet plot has visualizations shown by data source.

Each bar in the UpSet plot shows the number of Amperity IDs that share records from the included sources and represents a mutually exclusive set of Amperity IDs.

Add up all the Amperity IDs for each bar to see the total number of Amperity IDs for the selected data source.

An UpSet plot chart, located within the Stitch page in Amperity.
UPT, units per transaction

The average number of items purchased in any given transaction.

upstream

Upstream refers to the part of a workflow that occurs before the current step, often with a dependency. For example, a data table is only updated when there is new customer data.

Urban Airship

See Airship.

UUID, universally unique identifier

A universally unique identifier (UUID) is a 128-bit number that uniquely identifies information in a computer system. A UUID is 32 digits displayed in five groups separated by hyphens in the form of 8-4-4-4-12. For example: 123e4567-e89b-12v3-a456-426614174000.

V

vertex, vertices

A vertex is a data item in a graph database. A vertex is sometimes referred to as a node.

Vibes

Vibes offers a variety of mobile marketing products and services, such as text message marketing (SMS or MMS), mobile wallet marketing, push notifications, and mobile web experiences.

VinoVisit

VinoVisit.com is a winery reservation and guest management software platform that helps customers book real-time winery reservations and multi-day itineraries.

vinSuite

vinSuite is wine software designed to manage direct-to-consumer wine business, website content management, e-commerce, wine club, and point-of-sale.

Visual Query Editor

The visual Query Editor allows creating queries using Presto SQL without having to write SQL. It has a series of dropdown menus, pick-lists, and combinations of WHERE and AND clauses. Use it to build SQL queries that run against tables in the customer profile database.

The Visual Query Editor located within the Segments page in Amperity.
Visual Segment Editor

The Segment Editor is the user interface for building segments. It uses a series of dropdowns, picklists, conditions, and values to define an attribute profile. Refresh segment insights to see how many customers match the profile.

The Visual Segments Editor located within the Segments page in Amperity.

W

Walmart Connect

Walmart Connect is an advertising platform that allows brands to reach shoppers across Walmart’s digital and physical stores. Send audiences to the Walmart demand-side platform to reach Walmart customers online and in-store.

win-back campaign

A win-back campaign uses a series of personalized email messages sent to customers who have stopped engaging with your brand to try to get them to re-engage with your brand.

win-back percentage

A win-back percentage represents the percentage of inactive customers who have returned to make a purchase within the measurement window.

workflow

A workflow is the series of steps that your data follows as it moves through Amperity. Information about a workflow, including details and individual tasks, is available from the Workflows page.

Wyng

Wyng is a marketing-software platform that enables advertising agencies and large brands to build and run consumer-engagement and loyalty campaigns, often featuring user-generated content shared through social media.

X

XML, eXtensible Markup Language

eXtensible Markup Language (XML) is a supported data format for customer data sources.

Y

YouTube

YouTube is an online video platform on which your brand can run in-stream, bumper, video, and discovery ads to build interest, brand awareness, and inspire your customers to take action.

Z

Zendesk

Zendesk is a service-first customer relationship management (CRM) application. Companies use Zendesk to improve customer relationships by providing support, chat, and community services that span channels.

Numbers

4-5-4 calendar

A 4-5-4 calendar divides years into months using a 4 weeks–5 weeks–4 weeks pattern. Each week starts on a Sunday and ends on a Saturday. Each quarter has the same number of days. A 4-5-4 calendar can be useful for comparing like days for sales reporting purposes.