Glossary

Welcome to the Amperity glossary.

A

Active Campaign

Active Campaign is an automation platform for ecommerce, business-to-consumer, and business-to-business that helps you use email and SMS marketing to reach, nurture, convert, and grow your customer base.

active customer

An active customer is one whose likelihood of returning to make another purchase is above 50%.

active database

An active database is fully configured to use stitched output tables, is ready to be run, is ready for queries and segmentation, and may be added to an SLA workflow.

active segment

An active segment can be run from the Segments tab or be run automatically (using an orchestration). An active segment:

  • Must have a unique name

  • Remains active until deleted

  • May only be edited as a draft segment, which is created automatically; a draft segment is assigned the same name as the active segment, but with “–Draft” appended to it

  • Remains active until a draft segment is activated

  • May be assigned to an automatic or manual workflow

Acxiom

Acxiom collects, analyzes, and sells information about customers and businesses for use with targeted advertising campaigns.

address

The address that is associated with the location of an individual customer record. For example: 123 Main Street.

-or-

The address that is associated with the location of a unique Amperity ID. For example: 123 Main Street.

address-based householding

Address-based householding is a process that standardizes addresses, pairs them with a last name, and then assigns that pair a Household ID.

address2

Additional address information, such as an apartment number or a post office box, that is associated with the location of an individual customer record. For example: Apt #9.

-or-

Additional address information, such as an apartment number or a post office box, that is associated with the location of a unique Amperity ID. For example: Apt #9.

Adobe Analytics

Adobe Analytics provides actionable intelligence about user activity on Web sites and mobile devices. Marketers can use these analytics to optimize navigation within sites and applications and to understand what users are doing in real-time.

Adobe Campaign

Adobe Campaign is a cloud-based service that manages direct marketing campaigns, leads, resources, customer data, and analytics.

Adobe Commerce

Adobe Commerce is a digital commerce and order management platform that allows customers to build multi-channel ecommerce experiences on a single platform. Adobe Commerce uses MySQL for database management. Previously known as Magento.

Adobe Customer Attributes

Adobe Customer Attributes is a component of Adobe Experience Cloud that enables users to upload visitor information (customer attributes), and then make those details available to Adobe Target and Adobe Analytics.

Adobe Experience Platform

Adobe Experience Platform is a component of Adobe Experience Cloud that transforms data into robust customer profiles that update in real-time and help marketers understand how to optimize customer experiences across every channel.

Adobe Marketo

Adobe Marketo is marketing automation software that streamlines, automates, and measures marketing tasks and workflows.

Affinity table

An Affinity table associates individual customers to the products they are most likely to purchase. Use an Affinity table to help deliver personalized experiences to your customers.

Airship

Airship is a customer engagement platform that helps marketers connect with their customers across every stage of the customer lifecycle.

Amazon Kinesis Data Firehose

Amazon Kinesis Data Firehose is a fully managed service for delivering real-time streaming data to Amazon S3.

Amazon Pinpoint

Amazon Pinpoint is a flexible and scalable communications service for inbound and outbound marketing. Use Amazon Pinpoint to connect with your customers over a variety of channels, such as email, SMS, push, voice, in-app messaging.

Amazon QuickSight

Amazon QuickSight is a cloud-based, self-service BI tool for creating and publishing interactive dashboards.

Amazon Redshift

Amazon RedShift is a data warehouse located within Amazon Web Services that can handle massive sets of column-oriented data.

Amazon S3, Amazon Simple Storage Service

Amazon Simple Storage Service (Amazon S3) stores customer data files of any size in many file formats.

Amperity

Amperity is a cloud application that ingests all customer data in its native format from any source, and then resolves customer identities across records even when data is incomplete, inconsistent, or lacks linking keys. All behavioral, contextual, and transactional data is merged to form actionable customer profile views with out-of-the-box attributes, affinities, and insights. Use queries and segments to identify specific groups of customers, and and then send those results to any downstream workflow.

Amperity Data Warehouse (ADW)

See Business Intelligence Connect.

Amperity ID

An Amperity ID is a patented unique identifier that is assigned to clusters of customer records. A single Amperity ID represents a single individual. Unlike other systems, the Amperity ID is reassessed every day for the most comprehensive view of your customers.

Note

The Amperity ID is a universally unique identifier (UUID) that is represented by 36 characters spread across five groups separated by hyphens: 8-4-4-4-12.

For example:

123e4567-e89b-12d3-a456-426614174000
Amperity.js

Amperity.js is a JavaScript-based software development kit that integrates directly with any website or web application to support real-time streaming of data from that website to Amperity.

Amp360

Amp360 merges behavioral, contextual, and transactional data into actionable profiles, even when your data sources are incomplete, inconsistent, or lack linking keys.

AmpID

AmpID resolves customer identities across all of your customer records by applying proprietary machine learning algorithms.

AmpIQ

AmpIQ enables customer-centric marketing campaigns. Use segment insights to build high-value segments. Use those segments to add audiences to campaigns. Build campaigns that send those audiences to any combination of downstream marketing workflows.

anonymous visitor

An anonymous visitor is an individual who visits a website, but does not sign up to receive information and does not purchase anything from the website. Anonymous visitors are interesting from a marketing perspective because they have shown some form of intent to engage with a brand. Anonymous visitors are a significant percentage–often more than 95%!–of website traffic.

Apache Avro

Apache Avro is a row-oriented remote procedure call and data serialization framework developed within the Apache Hadoop ecosystem. Avro uses JSON to define data types and protocols, and serializes data in a compact binary format.

Apache Parquet

Apache Parquet is a free and open-source column-oriented data storage format developed within the Apache Hadoop ecosystem. It is similar to RCFile and ORC, but provides more efficient data compression and encoding schemes with enhanced performance and can better handle large amounts of complex bulk data.

Attentive Mobile

Attentive Mobile is a personalized mobile messaging platform that improves brand engagement with mobile consumers.

attributes explorer

The attributes explorer is an expandable list of tables and column names that may be used for reference while building queries and segments. The attributes explorer is located on the right-side of the Segments tab and shows, for each table that is available to be queried, the list of available attributes.

attributes table

An attributes table consolidates a set of fields for use with a specific downstream workflow, and then assigns names to each field that correctly map to the naming patterns that are required by that workflow.

audience

An audience is a list of customers who match the attributes, filters, and conditions that are applied to a segment.

audience sizes

Audience sizes represent the total number of customers that are necessary for a percentage of purchases to be made within a given time window.

  • A large audience represents the percentage of the total audience that was required to capture 90% of total purchases 30 days ago.

  • A medium audience represents the percentage of the total audience that was required to capture 70% of total purchases 30 days ago.

  • A small audience represents the percentage of the total audience that was required to capture 50% of total purchases 30 days ago.

The purchase curve and audience sizes.
AUR, average unit retail

The average selling price for a retail item.

authentication

Authentication is the process of establishing that a user is who they say they are. For example, by asking a user to provide a username and password.

authorization

Authorization is the process of establishing that a user is allowed to perform a given action. Authorization is mapped to a role in Amperity.

automatic courier

An automatic courier is an active courier that is configured to be run as part of a courier group on a scheduled basis. An automatic courier is configured from the Sources tab and is a prerequisite for certain downstream processes.

automatic query

An automatic query is an active query that is configured to be run whenever upstream data changes. An automatic query is configured from the Queries tab.

automatic segment

An automatic segment is an active segment that is configured to be run whenever upstream data changes. An automatic segment is configured from the Segments tab.

average order value

Average order value is the average order value for a unique customer’s transactions.

average unit retail

See AUR.

AWS Lambda

AWS Lambda runs code for any type of application or backend service that can be configured to run automatically from within Amazon Web Services to support any downstream workflow.

Azure Blob Storage

Azure Blob Storage is an object storage solution for the cloud that is optimized for storing massive amounts of unstructured data.

Azure Data Factory

Azure Data Factory is a service in Azure that can convert any data format into another data format, such as converting Apache Parquet to CSV.

Azure Data Lake Storage

Azure Data Lake Storage is a no-limits data lake that enables access to files as if they were in a directory structure.

Azure Data Lake Storage Gen1 is a hyper-scale repository that is optimized for big data analytics workloads.

Azure Data Lake Storage Gen2 combines an Apache Hadoop compatible file system that uses an integrated hierarchical namespace along with the scale of Azure Blob Storage.

Azure Data Share

Azure Data Share is a simple and safe service for sharing data in any format and any size with Amperity. Azure Data Share requires no infrastructure setup or management and uses underlying Azure security measures as they are applied to both Azure accounts. Snapshot-based sharing of data can be automated and does not require a special access key.

Azure Synapse Analytics

Azure Synapse Analytics is a limitless analytics service that brings together enterprise data warehousing and analytics. Azure Synapse Analytics has four components: SQL analytics, Apache Spark, hybrid data integration, and a unified user experience.

B

bad-values blocklist

A bad-values blocklist contains known values that appear frequently in data and should be excluded from the Stitch process.

Bazaarvoice

Bazaarvoice allows retailers to manage user-generated content on their website.

birthdate

The date of birth that is associated with an individual customer record.

-or-

The date of birth that is associated with a unique Amperity ID.

block

A block is a group of records that match the characteristics defined by the blocking strategy.

blocking

Blocking is a process that uses simple rules to divide massive sets of data records into small blocks that are rapidly processed and offer higher probabilities of discovering matching records.

blocking key, bk-[semantic]

A blocking key defines a specific combination of characters to be used as a blocking strategy. For example, the first three characters in given-name, the first character in surname, and birthdate represent a blocking key.

blocking strategy

A blocking strategy acts like a filter against a very large data set. Each blocking strategy applies its filter and all records that match are grouped together into a block. Each record that matches a blocking strategy is a blocking key.

Bluecore

Bluecore is a marketing technology company that intelligently connects casual shoppers to products with the goal of transforming those casual shoppers into lifetime customers.

Bluekai

See Oracle DMP.

brand

The brand associated with a customer interaction.

Braze

Braze is a leading marketing automation platform that allows users to create custom experiences based on sophisticated customer attributes and segments, then map those experiences to campaigns.

Brightloom

Brightloom is a customer growth platform (CGP) company for restaurants, retailers, and consumer brands that leverages customer transaction history and marketing campaign data to predict individual customer buying preferences.

Business Intelligence Connect

Business Intelligence Connect is an Amperity-managed cloud data warehouse that provides an easy-to-access location from which you can use any BI tool to access all of your Amperity data.

byte order mark

The byte order mark (BOM) is an optional usage of the special Unicode character “U+FEFF BYTE ORDER MARK”. This character is used as a magic number at the start of a file to indicate byte order and character encoding in the file.

C

California Consumer Privacy Act (CCPA)

The California Consumer Privacy Act (CCPA) is law that covers data protection and privacy in the state of California. It gives control to individuals over their personal data and addresses the transfer of personal data, including providing for the ability to request removal of data.

Camelot SMM

Camelot SMM provides marketing strategies and media services to top brands of all sizes, leveraging data, technology, and industry-leading experience.

campaign

A campaign is a message or offer that is sent to a specific group of customers or recipients.

Campaign Monitor

Campaign Monitor is an email marketing platform that tracks details related to email campaigns (opens, clicks, bounces, unsubscribes, spam complaints, and recipients) and email subscriber lists (active, unconfirmed, bounced, and deleted subscribers), and other details.

campaign type

A campaign may be one of the following types:

  • One-time

  • Recurring

Campaign_Recipients

The Campaign_Recipients table contains a list of Amperity IDs associated with any campaign (i.e., sub-audiences) that were sent from Amperity. Each campaign is associated with the segment used to generate the list of recipients, the control and recipient groups (including sub-audiences) used for the campaign, its launch date, and all destinations to which the campaign was sent.

Campaigns tab

The Campaigns tab enables effective use of customer data to activate a variety of use cases across your marketing stack.

cardinality

Cardinality is a measure of how many unique values are present in data. A higher cardinality indicates a larger percentage of unique values, whereas a lower cardinality indicates a higher percentage of repeat values.

CASS, coding accuracy support system

Coding accuracy support system (CASS) is an address standardization concept that helps clean address to make them more effective for direct mail campaigns.

channel

A channel is the purchase method used by a customer. For example: online, point-of-sale, loyalty, in-store, and so on.

Cheetah Digital

Cheetah Digital is a customer engagement platform that helps marketers deliver personalized experiences, create effective cross-channel messaging, and increase customer loyalty.

churn

Churn represents the likelihood of a customer not making a purchase during the next year.

churn prevention

Churn prevention is a process of identifying, and then marketing to users who are likely to churn.

churn propensity

Churn propensity is a predictive model that determines the likelihood that a customer will be active at any point in time, based on their purchase history. The churn propensity model outputs a score between 0 and 1 that represents a customer’s probability of return.

city

The city that is associated with the location of an individual customer record.

-or-

The city that is associated with the location of a unique Amperity ID.

claim

A claim is a set of information that is provided by an identity provider (IDP) to a service provider (Amperity). Each individual claim key specifies a single claim, such as a user’s email address, name, or the role to which they are assigned in Amperity.

clienteling

Clienteling is a technique used by retail sales associates to establish long-term relationships with key customers based on data about their preferences, behaviors and purchases.

cluster graph

A cluster graph is one of the outcomes of the Stitch process. It is a visual representation of every pairwise connection in a cluster of records. Each pair can be explored in more detail.

The data explorer, showing the cluster graph.

See pairwise connection and stitched records.

cluster transition

A cluster transition occurs when records move from one cluster to another during the Stitch process.

clustering

Clustering is the process of deciding which records are included in a customer profile. A matching threshold defines the minimum threshold at which two records can be matched, and then included in a cluster. Lower quality matches may be included, but only as a transitive connection. Distinct customer profiles emerge as a cluster of record pairs.

CLV

Customer lifetime value (CLV) measures how valuable a customer has been to your company or brand.

COGS, cost of goods sold

Cost of goods sold (COGS) are the direct costs of producing goods that are sold by a brand, including the costs of materials and labor to produce the item, but excluding indirect expenses like distribution or sales.

combined requests table

The Combined_Requests table consists of all the data subject access request (DSAR) data pulled from single or multiple tables with compliance semantics assigned to them.

common table expression (CTE)

A common table expression (CTE) is a named subquery defined by the WITH clause. A CTE defines a set of variables that act as a temporary view, an optional list of column names, and a query expression. The results of the query expression behaves like a table. Each column of that table is defined by the optional list of column names.

company

The company, typically an employer or small business, that is associated with an individual customer record.

-or-

The company, typically an employer or small business, that is associated with a unique Amperity ID.

completed query

A completed query is an active query that has run successfully through Amperity as part of an orchestration within the past 24 hours.

completed segment

A completed segment is an active segment that has run successfully through Amperity as part of an orchestration within the past 24 hours.

completion

Completion is a simple percentage of data rows that contain some value within a column.

compliance detail report table

The Compliance_Detail_Report passthrough table contains one row for each found record.

compliance overview report table

The Compliance_Overview_Report passthrough table contains a row for each request.

component ID

An identifier that represents a set of records that are transitively connected with a score above threshold as an outcome of blocking and initial scoring. Records that share a component ID, but have different Amperity IDs, were split during hierarchical comparison.

Concise Binary Object Representation (CBOR)

CBOR (Concise Binary Object Representation) is a binary data serialization format loosely based on JSON. Like JSON it allows the transmission of data objects that contain name–value pairs, but in a more concise manner. This increases processing and transfer speeds at the cost of human-readability.

consumer packaged goods (CPG)

Consumer packaged goods are items that are used on a daily basis by consumers that require routine replacement or replenishment, such as food, beverages, clothes, personal items, and household products.

control group

A control group is the percentage of an audience who will not receive communications related to a marketing campaign. Use control groups to establish a baseline against which you can measure the success of a campaign over time.

Cordial

Cordial is a cross-channel marketing and data platform that collects all of your customer data in one platform for use with audience segments, trends, and automated customer experiences.

cost

Cost represents the total cost of all goods sold (COGS) for a product.

country

The country that is associated with the location of an individual customer record.

-or-

The country that is associated with the location of a unique Amperity ID.

courier

A courier brings data from external system to Amperity. A courier relies on a feed to know which fileset to bring to Amperity for processing.

courier fileset

See fileset.

courier group

A courier group is a list of one (or more) couriers that are run as a group, either ad hoc or as part of an automated schedule. A courier group can be configured to act as a constraint on downstream workflows.

courier group schedule

A schedule defines the frequency at which a courier group runs. All couriers in the same courier group run as a unit and all tasks must complete before a downstream process can be started. The schedule is defined using cron.

courier plugin

A courier plugin is a connector that is built into Amperity for the purpose of collecting data from commonly used data sources.

Criteo

Criteo is a commerce media platform that helps marketers and media owners manage and scale campaigns.

cron

Cron is a time-based job scheduler that uses cron syntax to automate scheduled jobs to run periodically at fixed times, dates, or intervals.

crontab syntax

Cron syntax specifies the fixed time, date, or interval at which cron will run. Each line represents a job, and is defined like this:

┌───────── minute (0 - 59)
│ ┌─────────── hour (0 - 23)
│ │ ┌───────────── day of the month (1 - 31)
│ │ │ ┌────────────── month (1 - 12)
│ │ │ │ ┌─────────────── day of the week (0 - 6) (Sunday to Saturday)
│ │ │ │ │
│ │ │ │ │
│ │ │ │ │
* * * * * command to execute

For example, 30 8 * * * represents “run at 8:30 AM every day” and 30 8 * * 0 represents “run at 8:30 AM every Sunday”. Amperity validates your cron syntax and shows you the results. You may also use crontab guru to validate cron syntax.

Cross Country Computer

Cross Country Computer specializes strategic solutions for companies that are seeking to evolve their marketing methods around acquiring, retaining, and developing valuable customers.

CSV, comma-separated values

A comma-separated values (CSV) file, defined by RFC 4180 , is a delimited text file that uses a comma to separate values. A CSV file stores tabular data (numbers and text) in plain text. Each line of the file is a data record. Each record consists of one or more fields, separated by commas. The use of the comma as a field separator is the source of the name for this file format.

currency

Currency represents the type of currency that was used to pay for an item. For example: dollar.

custom database table

A custom database table exists when domain SQL is used to build a domain table. These tables are used as inputs for standard database tables when semantic tags are applied and may be passed through to any database.

custom domain table

A custom domain table is a domain table that defines its schema using Spark SQL.

customer

A customer is a known individual with a core set of associated personally identifiable properties, such as their given name, surname, birthdate, mailing address, email address, and phone number.

customer 360 database

A customer 360 database is built using standard outputs of the Stitch process that provide a unified view of customer data, including customer profile and interaction records that are linked together by the Amperity ID, organized, merged, and ready for use in segmentation.

customer 360 profile

Amperity creates a unique profile for each unique customer record, and then stores these profiles in the Customer_360 table. Each unique profile is assigned an Amperity ID. Amperity calculates the number of unique profiles by counting the number of Amperity IDs in the Customer_360 table.

Customer 360 tab

The Customer 360 tab provides the interface to view and manage all databases, including the customer 360 database, along with related customer profile and interaction data. Explore and interact with stitched tables. Use Spark SQL to build the customer 360 database, add tables, define predictions, and surface data points that can be used with queries and segments.

The Customer 360 tab in Amperity.
C360, Customer 360 (data table)

The Customer_360 table is the unified view of the customer across all points of engagement, including attributes that cross systems. This table does not exist by default and must be created within the customer 360 database. Each row represent a complete record for a unique individual, including their Amperity ID, merged PII data, and summary attributes.

Customer_Attributes

The Customer_Attributes table contains a series of flags that identify attributes about individuals, such as if that individual can be contacted, if a marketable email address, physical address, or phone number is available, if they are an employee, reseller, or if the individual represents a test account, along with identifying each individuals’s revenue relationship with a brand.

customer data table

A customer data table contains customer records from an external source system.

customer key, ck

A customer key is a column in a data table that contains a unique identifier that represents a unique ID for that data as it exists in the customer’s data outside of Amperity.

customer lifecycle status

Customer lifecycle status is represented by a probabilistic score–referred to as p(return) or “probability of return”–that identifies if a customer is active or if they are likely to churn.

A customer’s p(return) score determines the customer lifecycle status tier to which they are assigned:

Status tier

p(return) score

Active

p(return) score is over 60%

Cooling down

p(return) score is between 50%-60%

At risk

p(return) score is between 35%-50%

Highly at risk

p(return) score is between 20%-35%

Lost

p(return) score is below 20%

customer lifetime value (CLV)

See CLV.

customer profile

A customer profile is a collection of attributes that are associated with a single unique individual in the customer 360 database. The total number of customer profiles is equal to the total number of rows in the Customer 360 data table. This total correlates strongly, but not exactly, to the total number of Amperity IDs assigned to unique individuals in the same data set.

customer record

A customer record is a row in a customer data table that contains information (columns) about the customer. Who they are, where they live, and how much they spend. For example, a email list table contains names, email addresses, phone numbers, and so on. All domain tables that contain customer records are stitched together to create Amperity IDs.

Custora

Custora is cloud-based customer analytics solution that applies machine learning across retail, organization, and customer touchpoints.

D

Dashboard tab

The Dashboard tab provides a visual overview of the state of Amperity and the actions it has taken.

data explorer

The Data Explorer provides a detailed way to navigate through data in Amperity. The information available from the Data Explorer depends on the location in Amperity from which it was accessed:

  • The Stitch tab opens the Data Explorer to three views: stitched connections, cluster graphs, and pairwise connections.

  • The Customer 360 tab opens the Data Explorer to a view that shows all tables in the database, from which you may view each of the individual tables.

  • The Segments tab opens the Data Explorer to a specific table, and then provides a view of its schema along with sample data.

The data explorer, as opened from within the Stitch tab in Amperity.
data mapping

A data mapping template defines how columns in Amperity output are mapped to fields or columns required by a destination, including the ordering and naming of columns. This is especially useful when a destination has case-sensitive column names or when it uses a REST API with specific requirements for naming patterns. A destination that does not have requirements for the shape of data should use null instead of defining a data mapping structure.

data source

A data source is a system in a customer’s environment that sends data to Amperity as structured or semi-structured data via a filedrop or REST API.

data subject access request (DSAR)

A data subject access request (DSAR) is a written request made by an individual to ask for their data to be handled according to General Data Protection Regulation (GDPR) and California Consumer Privacy Act (CCPA)regulations. A Data Subject Access Request (DSAR) requires a response within a pre-defined time window, typically 30 days.

data table

A data table is a set of rows that are organized into named columns with types, typically provided as output from some type of database application in a SQL format.

data template

A data template defines how columns in Amperity data structures are sent to downstream workflows. A data template is part of the configuration for sending query and segment results from Amperity to an external location.

data type

A data type defines the type of data that is in a column. Amperity supports the following types: integer, string, float, decimal, date, datetime, map, array, and Boolean. When defining a feed for ingesting customer data to Amperity, it is important to consistently apply the correct data type for incoming columns.

Databricks

Databricks provides a unified platform for data and AI that supports large-scale processing for batch and streaming workloads, standardized machine learning lifecycles, and accelerated data science workflows for large datasets.

DataGrid

DataGrid pulls all your customer data from any source-–online, offline, historical, streaming–without requiring schema planning or ETL. DataGrid sends results of any size and type to any of your downstream applications and workflows.

Datalogix

Datalogix, now a component of Oracle Data Cloud, connects offline purchase signals to digital media, helping marketers in the United States create comprehensive consumer profiles across digital, mobile, offline, and TV.

See Oracle Data Cloud.

days since last order

Days since latest order is the number of days that have elapsed since a customer has placed an order.

deduplication

Deduplication is an output of the Stitch process that identifies the total number of unique individuals within a customer data set.

deduplication rate

The deduplication rate represents the total number of unique individuals within a customer data set. This rate measures the difference between the total number of original identifiers in customer data and the total number of Amperity IDs that were assigned to unique individuals.

delete confirmation

A delete confirmation is sent from Amperity to confirm that a request to delete a user record was completed.

delete user record request

A delete user record request. This is a written request that asks for data related to a specific user record to be deleted from Amperity.

Demandware

See Salesforce Commerce Cloud.

derived semantic

A derived semantic is a semantic that can be inferred from existing data. For example, “given-name” and “surname” semantics can be inferred from “full-name” and “gender” can be inferred from “title”. Amperity creates derived semantics automatically to ensure the greatest number of individual semantics are available to Stitch during identity resolution.

destination

A destination is a location that receives data from Amperity.

destination plugin

A destination plugin is a template that defines how Amperity sends data to an external location. When you select a plugin, the dialog boxes are updated for the correct fields required by the selected plugin.

Destinations tab

The Destinations tab provides the interface for defining the destinations that may receive data from Amperity, the destinations to which individual segments are sent, and if this workflow is automated.

The Destinations tab in Amperity.
Detailed_Examples

The Detailed_Examples table contains detailed examples of Stitch results. Use these examples to help identify which features lead to scores with the biggest effect on overall Stitch results, including how they are associated with various combinations of fields that contain PII data.

deterministic

Deterministic (rules-based) identity resolution generates exact matches — with simple and minimal rules — representing values. The results prioritize predictability over accuracy. For example, this option is very useful for operational use cases associating a person with their payments.

digital channel

The digital channel by which a transaction was made. For example: Facebook, Google Ads, email, etc.

direct mail

Direct mail is a type of marketing campaign that marketers employ to deliver printed advertisement materials to customers through the postal mail.

display advertising

Display advertising is advertising that is placed on websites for the purpose of attracting new customers that are otherwise difficult to reach.

domain table

A domain table is created for each data feed that has been configured to ingest a data source into Amperity. The data in a domain table is processed to include semantic tagging alongside the original fields from the data feed. Domain tables that contain customer records are made available to the Stitch process to identify unique individuals and assign them Amperity IDs. Domain tables that contain interaction records are used to create attributes that are associated with the unique individuals who have been assigned Amperity IDs.

Domo

Domo is a cloud-based, self-service BI tool that helps you visualize data from a single dashboard.

downstream

Downstream refers to the part of a workflow that occurs after the current step to which the downstream process often has some type of dependency. For example, if this data table updates, then the following segments will run automatically: segment A and segment B.

draft database

A draft database is in an inactive state that is not ready for segmentation. A draft database may used only for testing or experimentation. It may also be progressing toward being put into an active state.

draft segment

A draft segment is any segment that has not been made an active segment. A draft segment:

  • Is the default type for any segment that is added to Amperity

  • Exists in a draft state until activated

  • May exist indefinitely

  • Is appended with “–Draft” when it’s a working copy of an existing active segment; only a single working copy of an active segment may exist at any time

  • Is auto-saved by Amperity

  • Allows results of queries to be downloaded as a CSV file

  • May be run via the segments editors

  • Must be activated before it can be run from the Segments tab or be added to an orchestration

DSAR

See data subject access request (DSAR).

DSAR response

A response to a DSAR is sent from Amperity to confirm that PII data related to the subject of a DSAR was discovered.

duplicate record

A duplicate record is a record that is matched to one or more other customer records as part of an Amperity cluster. Amperity uses duplicate records to calculate the duplication rate.

duplication rate

The duplication rate is reported by Amperity as the percentage of all ingested customer records that are duplicate records. Duplication rate is calculated for each customer table as well as for the overlap between customer tables.

Dynamic Yield

Dynamic Yield helps companies quickly build and test personalized, optimized, and synchronized digital customer experiences.

E

early repeat purchaser

Early repeat purchaser is a flag that indicates if a customer made a repeat purchase within the previous 90 days.

early repeat rate, ERR

Early repeat rate is a measure of one-time to two-time buyer conversion based on second purchases that were made within the previous 90 days.

edge

An edge is the relationship between two vertices in a graph database. Each edge has a type and must start with one vertex and end with another.

email

The email address that is associated with an individual customer record. A customer record may be associated with multiple email addresses.

-or-

The email address that is associated with a unique Amperity ID. A customer record may be associated with multiple email addresses.

email events

Email events associate email summary statistics to brands, email addresses, regions, event types, event dates and times, and sender IDs.

email summary statistics

Email summary statistics provide fields that summarize customer engagement with your brand. Individual statistics include brand, email address, counts for opens and clicks by day (1, 3, 5, 7, and 14) and by month (3, 6, 9, and 12), engagement frequency, and engagement status.

Email_Engagement_Attributes

The Email_Engagement_Attributes table contains all fields from the Email_Engagement_Summary table, unique by Amperity ID and by email address. This table may be extended for related email engagement and transaction attributes.

Email_Engagement_Summary

The Email_Engagement_Summary table contains a summary of email event statistics, such as counts for opens and clicks, the first open, and the most recent click, unique by email address.

encoding method

Warning

TODO.

environment

An environment represents the cloud platform in which an Amperity tenant runs: Amazon AWS or Microsoft Azure. All aspects of an Amperity tenant run within the same environment, including:

  • Processes that pull data from customer data sources

  • Processes that send data to destinations

  • SQL for databases, queries, and segments

  • Sandboxes

Epsilon

Epsilon provides a suite of loyalty marketing services that spans database marketing, direct mail, email marketing, web development, loyalty programs, analytics, data services, strategic consulting, and creative services.

Epsilon Abacus is a cooperative managed on behalf of all its members to provide data and analytic expertise to help improve your marketing effectiveness and deliver a greater return on your marketing investment.

Epsilon Conversant provides services to help marketers recognize, implement and measure the value of personalization solutions.

Epsilon Targeting offers market intelligence solutions to better understand multi-channel buying patterns and habits, and then to follow-up with highly personalized messages for better marketing campaign results.

escape character

An escape character is specific to a data format and allows certain character sequences to be interpreted differently from characters that are not prefixed by that escape character.

Evergage

See Salesforce Interaction Studio.

ExactTarget

See Salesforce Marketing Cloud.

exclusion list

An exclusion list identifies a list of customers who should not receive communications related to a marketing campaign.

Experian

Experian provides credit reporting services and scores to businesses and consumers to help protect them from fraud and identity theft.

F

Facebook Ads

Facebook Ads appear in the news feed and in the right-side column on Facebook. Facebook Ads provides an API for integrating with third-party systems.

feed

A feed defines how data should be loaded into a domain table, including specifying which columns are required and which columns should be associated with a semantic tag that indicates that column contains customer profile (PII) and transactions data.

Feed Editor

The Feed Editor is the second step in the process of adding a new data source to Amperity. The Feed Editor is the interface in which you assign field types, apply semantic tags and primary keys, and indicate if the data associated with this feed should be made available to Stitch.

field

A field refers to a single piece of data within a record or a set of records. In relational databases, this corresponds to the data within a column. For instance, within an Amperity customer table, a field would refer to an individual piece of data, such as zip code, or phone number.

filedrop

A filedrop is the most common way to transfer data to and from Amperity. A cloud-based filedrop location is built to store and retrieve any amount of data from anywhere without restriction on file format or file size. A filedrop location may be Amperity-managed or customer-managed.

filename template

A filename template defines the naming patterns for files that are sent by Amperity to a location in which files are dropped. A filename template specifies the name of the file and then uses Jinja-style string formats to append a date to the filename to ensure that any downstream process can identify which file is the one to be picked up.

fileset

A fileset is a group of files that are processed as a unit by a single courier. A fileset defines each file individually by name, datestamp, file format, and load operation. A courier expects all files in a fileset to be available for processing, unless a file is specified as optional.

first order date, first order datetime

First order date is the date on which the first order was placed by a customer.

first order ID

First order ID is the order ID for the first order that was made by a customer.

first order revenue

First order revenue is the total revenue that is associated with a customer’s first order, ignoring returned items and/or cancelled items.

first order total items

First order total items represents the number of items that were purchased as part of a customer’s first order, ignoring returned items and/or cancelled items.

first-party data

First-party data is provided to a company or a brand from the customer, often directly. For example: filling out a registration card, completing an online form, requesting an emailed receipt, signing up for a benefit or rebate, browsing an ecommerce site, and purchasing an item in a store using a credit card.

first-to-latest order days

First-to-latest order days is the number of days that have elapsed between the date of the first order and the date of the latest order.

first-to-second order days

First-to-second order days is the number of days that have elapsed between the date of the first order and the date of the second order.

foreign key, fk

A foreign key is a column in a data table that acts as primary key and can be used for deterministic matching of records. A record pair is assigned an exact match score (5.0) when foreign keys contain identical values during pairwise comparison.

Forget Users

Forget Users is a tool that is available to users of Amperity who have Full Admin privileges. This tool is used to delete user records from domain tables to support certain CCPA and GDPR workflows that require the deletion of user records based on requests from customers.

full name

A combination of given name (first name) and surname (last name) that is associated with an individual customer record and is stored as a combined value in a single field within customer data. A full name may include a middle name or initial.

-or-

The combination of given name (first name) and surname (last name) that is associated with a unique Amperity ID. A full name may also include the middle name or initial.

G

General Data Protection Regulation (GDPR)

The General Data Protection Regulation (GDPR) is law that covers data protection and privacy in the European Union (EU) and the European Economic Area (EEA). It gives control to individuals over their personal data and addresses the transfer of personal data outside the EU and EEA areas. GDPR simplifies the regulatory environment for international business by unifying regulation within the EU.

gender

The gender that is associated with an individual customer record.

-or-

The gender that is associated with a unique Amperity ID.

generational suffix

The suffix that identifies to which family generation a customer record belongs. For example: Jr., Sr. II, and III.

-or-

The suffix that identifies to which family generation a unique Amperity ID belongs. For example: Jr., Sr. II, and III.

given name

The first name that is associated with an individual customer record.

-or-

The first name that is associated with a unique Amperity ID.

Google Ads

Google Ads appear within the Google advertising network to web users. Advertisers pay to display brief advertisements, service offerings, product listings, and so on.

Google Advertising ID, Google AAID

The Google advertising ID (Google AAID) is a device identifier for advertisers that anonymously tracks users on Android devices.

Google Audience Partner API

Google Audience Partner API uses first-party data that your customers have shared with you to enable advertising across Google – including search and display advertising, the Shopping tab, Gmail, and YouTube – to your customers and to similar audiences.

Google Big Query

Google BigQuery is a fully-managed data warehouse that provides scalable, cost-effective, serverless software that can perform fast analysis over petabytes of data and querying using ANSI SQL.

Google Chrome

Google Chrome is the recommended web browser for Amperity.

Google Cloud Storage

Google Cloud Storage is an online file storage web service for storing and accessing data on Google Cloud Platform infrastructure.

Google Pub/Sub

Google Pub/Sub is a low-latency messaging service that can be configured within Google Cloud to stream data (including real-time) to Google Cloud Storage.

graph database

A graph database stores relationships between data items. A graph database that contains Amperity data stores relationships between customer profiles, transactions, and the Amperity ID graph.

gzip

gzip is a file format used for file compression and decompression.

H

hard conflict

A hard conflict occurs when profile values are different enough to ensure that two records should not be grouped together. A hard conflict is most often associated with a birthdate or a generational suffix, but can be associated with other combinations of profile data.

hierarchical comparison

A hierarchical comparison is a step in the Stitch process that occurs after pairwise scoring to closely examine each group of records to identify edge cases, such as married couples with overlapping profile (PII) data or children with the same name as a parent who live at the same address.

high cardinality profile attribute

A high cardinality profile attribute contains a very large number of distinct values. For example, phone numbers, email addresses, and postal codes. High-cardinality profile attributes with an exact or approximate match are useful during identity resolution.

holdout control group

See control group.

Household ID

A Household ID is a universally unique identifier (UUID) that uniquely identifies the combination of a standardized address and a last name.

householding

A household is a group of people who share a physical address and possibly other attributes. For example: a shared device or network, a shared last name, a shared phone number. Householding is a process that identifies a unique household in a data set.

HubSpot

HubSpot is a CRM platform built around dedicated hubs for marketing, sales, services, content management, and operations.

I

Identifier for Advertisers (IDFA)

The Identifier for Advertisers (IDFA) is a random device identifier that tracks and identifies a user, but without revealing personally identifiable information (PII). IDFA is assigned by Apple to a user’s device and is used for customized advertising.

identity provider, IdP, IDP

An identity provider (IDP) is system that creates, maintains, and manages identity information for users, and then provides claims and authentication. Common examples of IDPs include Auth0, Azure AD, Okta, and PingFederate.

identity resolution

Identity (ID) resolution is the process of connecting and matching different data points across multiple devices and channels to form a unified view of a single customer, allowing brands to connect the dots between fragmented data to form a complete picture of an actual person.

inactive customer

An inactive customer is one whose likelihood of returning to make another purchase is below 50%.

Infutor

Infutor is a consumer identity management platform that helps brands understand their consumers and make informed marketing and risk decisions using a secure, privacy compliant foundation to improve inbound engagements and outbound marketing reach, and to minimize fraud and collections risk.

interaction record

An interaction record is a row in a customer data table that contains information about customer behavior, such as purchases (items bought, items returned, costs of items, etc.) and preferences (brands, products, cart adds, etc.).

is cancellation?

A flag that indicates if the item was cancelled.

is returned?

A flag that indicates if the item was returned.

item cost

Item cost is the cost to produce all units of an item.

item discount amount

Item discount amount is the discount amount that is applied to all units that are associated with a single item within a single transaction.

item discount percent

Item discount percent is the percentage discount that is applied to all units that are associated with a single item within a single transaction.

item list price

The manufacturer’s suggested retail price (MSRP) for all units of this item.

item profit

Item profit represents the amount of profit that is earned when all units of an item are sold.

item quantity

Item quantity is the total number of items in an order. When an item has been returned or an order has been cancelled, item quantity is the total number of items that were returned and/or cancelled.

item revenue

The total revenue for all units of an item, after discounts are applied. When an item has been returned or the order has been cancelled, the total revenue for all items that were returned and/or cancelled.

item subtotal

An item subtotal is the amount for an item, before discounts are applied.

item tax amount

An item tax amount is the total amount of taxes that are associated with the purchase of an item.

itemized transaction

An itemized transaction represents data that captures orders at the item level, with multiple rows per order and one order per item with clearly identified returns and cancellations.

itemized transaction semantic

An itemized transactions semantic is a way to identify brands, channels, stores, orders, products, quantities, per-item costs, total costs, and so on. Use itemized transactions semantics when a data source contains one row per item.

J

JDBC, Java Database Connectivity

Java Database Connectivity (JDBC) uses the Java programming language to define how a client may access a database.

jitter

Jitter tracks the rate of change with Amperity IDs between Stitch runs.

Joda-Time

Joda-Time is an open-source date and time library that is used by Amperity to establish consistency in filename patterns. The recommended pattern is “Segment_Name_MM-dd-YYYY”, where “Segment_Name” is the name of the segment and “MM-dd-YYYY” will append the current date.

journey

A journey is a milestone (or series of milestones) that a customer goes through as they experience a brand.

journey orchestration

Journey orchestration is a process that enables businesses to use what they know about a customer–previous behavior, preferences, real-time activity, and so on–to personalize each customer’s journey and improve business outcomes.

JSON, JavaScript Object Notation

JavaScript Object Notation (JSON) is language-independent data format that is derived from (and structured similar to) JavaScript.

JWT, JSON Web Tokens

JSON Web Tokens are an open, industry standard RFC 7519 method for representing claims securely between two parties.

K

Klaviyo

Klaviyo is an email platform for targeting, personalizing, measuring, and optimizing email and Facebook campaigns.

Koupon Media

Koupon Media provides a platform that enables brands and retailers to launch data-driven campaigns that combine loyalty programs and mobile offers for their most valuable customers.

Kustomer

Kustomer is a platform for customer experience, service, and support that provides a complete picture of your customer when they contact your company.

L

landing area

The location in Amperity to which data is placed after it was collected from a source location by a courier.

lapse rate

Lapse rate is the percentage of customers who purchased in the previous time period, but did not purchase during the current time period.

large dataset

A large dataset is a file over 500GB in size.

latest order date, latest order datetime

Latest order date is the date (and time) on which the most recent order was placed by a customer.

latest order ID

Latest order ID is the order ID for the most recent order that was made by a customer.

latest order revenue

Latest order revenue is the total revenue that is associated with a customer’s most recent order, ignoring returned items and/or cancelled items.

latest order total items

Latest order total items is the number of items that were purchased as part of a customer’s most recent order, ignoring returned items and/or cancelled items.

lifetime customer value (LCV)

See customer lifetime value.

lifetime average item price

The average individual item price for all orders, ignoring returns and/or cancellations.

lifetime average number of items

The average number of items purchased for all orders, ignoring returns and/or cancellations.

lifetime average order value

The average lifetime revenue for all orders, ignoring returns and/or cancellations.

lifetime largest order value

Lifetime largest order value identifies the largest order that is associated with a customer, ignoring returns and/or cancellations, for a customer’s entire purchase history.

lifetime order frequency

A lifetime order frequency is the total number of orders that that a customer has made during their entire relationship with your brand.

lifetime order revenue

The lifetime revenue for all items, ignoring returns and/or cancellations.

lifetime total items

The lifetime number of individual items associated with the transaction, ignoring returns and/or cancellations.

lifetime value (LTV)

See customer lifetime value.

linking tables

A linking table is used to connect custom domain table row data with the records associated with each data row.

list price

A list price is the manufacturer’s suggested retail price (MSRP) for all units of this item, before shipping, tax, or discount amounts are applied. A list price represents a single unit of the item purchased.

Listrak

Listrak enables personalized cross-channel interactions that help automate campaigns, build customer loyalty, and increase conversion rates.

LiveRamp

LiveRamp allows clients to combine customer data from various online and offline sources, centering around the use of web cookies that allow websites to remember visitors.

load operation

A load operation defines the fileset to be processed by a courier. Each file in the fileset is specified by name, format, and feed, after which it is assigned a load operation.

location

A location is a unique combination of address, address2, city, state, and postal details for an individual customer record.

-or-

A location is a unique combination of address, address2, city, state, and postal details for a unique Amperity ID.

lookalike audience,

A lookalike audience is an audience with a similar profile to an existing audience, such as similar demographics, interests, and historical behaviors.

Looker,

Looker is an enterprise platform for business intelligence, data applications, and embedded analytics.

lookup table

A lookup table provides access to customer-specific lists or categories when those lists and/or categories are long and dynamic. For example: a complete list of airport names with their corresponding airport code.

loyalty ID

The identifier for a loyalty program that is associated with an individual customer record.

-or-

The identifier for a loyalty program that is associated with a unique Amperity ID.

M

Magento, Adobe Magento

See Adobe Commerce.

matchback

A matchback is a process where a mail file–associated with a purchase credit–is matched back to untracked sales.

Mailchimp

Mailchimp is a cloud-based marketing automation platform and an email marketing service that provides an API for integrating with third-party systems and a web UI for managing email contacts, templates, and lists.

manual courier

A manual courier is an active courier that is not configured to run as part of a courier group. A manual courier must be run directly from the Sources tab and does not have dependencies on downstream processing outcomes.

manual segment

A manual segment is an active segment that is not configured to run as part of an automated or orchestrated workflow. A manual segment is run directly from the Segments tab.

master data management (MDM)

Master data management (MDM) is an approach to data management that uses the Amperity ID to link all unique identifiers across all data sources to a common identifier to create a trusted single version of the truth for customer 360 data.

match category

A match category is a classifier that is applied by Amperity to an individual record-pair within a cluster of record-pairs. The match category is the result of this classification.

Match Category

Description

Exact

Amperity has the highest confidence that these records represent the same person because all profile data exactly matches.

Excellent

Amperity has near perfect confidence that these records belong to the same person, despite select types of profile data not matching.

High

Using deductive reasoning, Amperity has very high confidence that these records match, despite some profile data not matching.

Moderate

Amperity has moderate confidence that these records match, due to weak or fuzzy matches between highly unique customer attributes (email, phone, address).

Weak

Amperity lacks confidence, but if asked to guess, Amperity would assert these records do belong to the same individual, because they match on non-unique customer attributes (name, state, zip code).

No conflict

Amperity has high confidence that these records do NOT match, because core profile data is in conflict.

match type

The type of score being applied. Possible values are as follows: “scored”, “scored_transitive”, and “trivial_duplicate”.

MDM table

A master data management (MDM) table contains a record of each unique identifier every data source added to a single table that associates all unique IDs to the Amperity ID, after which it can be sent to downstream external systems as an export.

Merged_Customers

The Merged_Customers table is a data table that must be added to a customer 360 database. The purpose of the Merged_Customers table is to collect rows from the Unified_Coalesced table, and then collapse those into a single row per Amperity ID. Columns in the Merged_Customers table are made available to segmentation via the Customer_360 table.

Merged_Households

The Merged_Households table is a data table that must be added to a customer 360 database to support address-based householding. The purpose of the Merged_Households table is to apply address-based householding and address standardization to the output of the Merged_Customers table. The contents of this table contain the Household ID and can be used to improve the results of certain campaign types, such as direct mail.

metrics

Metrics are measurements that help you understand performance across two broad categories: customers and revenue. Customer metrics track how your customers change over time. Revenue metrics show how your customer spend money, including by category, by product, and by segment.

Metrics tab

The Metrics tab shows individual metrics as a card. Each card provides an overview of the current data that is available for a single metric. The list of metrics cards is configurable. Amperity provides a set of out-of-the-box cards, including new purchaser trends, purchase activity summary, revenue by historical lifetime spend, revenue summary, transacting purchasers, and weekly revenue trends.

Microsoft Advertising

Microsoft Advertising is a pay-per-click advertising platform that displays ads based on keywords in a user’s search query.

Microsoft Dataverse

Microsoft Dataverse is secure, cloud-based table storage for business applications, such as those belonging to the Microsoft Power Platform (Power BI, Power Apps, Power Automate, Power Virtual Agents, and Power Pages), Microsoft 365, Microsoft Azure, Dynamics 365, and other standalone applications.

Microsoft Dynamics

Microsoft Dynamics CRM is a customer relationship management (CRM) software application that is focused on the supporting sales, marketing, and services. It is part of the Microsoft Dynamics family of business applications.

Microsoft Power BI

Microsoft Power BI is a collection of software services, applications, and connectors that work together to turn unrelated sources of data into coherent, visually immersive, and interactive insights.

mobile interactions table

A mobile interactions table is represents the history a customer has with mobile applications that are associated with a brand, such as:

  • Opens

  • Page views

  • Items added to a cart

  • Items purchased

  • and so on

Monetate

Monetate offers solutions for customer segmentation and targeting, campaign testing and optimization, 1:1 personalization, audience insights, and customer recommendations.

Mosaic USA (Experian)

Mosaic USA is a consumer segmentation system from Experian that classifies US households into 71 unique types and 19 groups , providing a 360-degree view of consumer choices, lifestyles, preferences, and habits.

MSRP, manufacturer’s suggested retail price

The manufacturer’s suggested retail price (MSRP) is the price before shipping costs, taxes, and/or discounts have been applied. MSRP is sometimes referred to as the base price.

multi-brand

A flag that indicates if a customer has interacted with more than one brand.

multi-channel

Multi-purchase channel is a flag that indicates if a customer has purchased from more than one channel.

multi-channel campaign

A multi-channel marketing campaign blends marketing strategies across apps, websites, and devices to provide customers a personalized experience, but with consistent messages, offers, and products across all channels.

multi-channel customer

A multi-channel customer shows activity in at least two channels during the last measured time period.

multi-channel marketing

A multichannel marketing strategy blends distribution channels (retail storefronts, websites, and catalogues) with promotional channels for the purpose of making it easy for customers to purchase.

Multi-touch-attribution

A multi-touch attribution identifies each marketing channel that the customer has interacted while making a purchase.

N

National Change of Address, NCOA

National Change of Address (NCOA) is a secure dataset of approximately 160 million permanent change-of-address (COA) records consisting of the names and addresses of individuals, families, and businesses who have filed a change-of-address with the United States Postal Service (USPS).

newline-delimited JSON, NDJSON

Newline-delimited JSON (NDJSON) is a data format for structured data that defines the structure of JSON data using lines as separators. Each line in a NDJSON file is a valid JSON value.

Neustar

Neustar provides real-time information and analytics for the Internet, risk, digital performance, and defense, telecommunications, entertainment, and marketing industries, and also provides clearinghouse and directory services to the global communications and Internet industries.

notifications

A notification provides status updates within Amperity. Notifications are available for the Sources, Stitch, Customer 360, Queries, Segments, Campaigns, and Destinations tabs.

O

OAuth, Open Authorization

OAuth is an open standard for access delegation, commonly used to grant websites or applications access to information on other websites.

ODBC, Open Database Connectivity

Open Database Connectivity (ODBC) uses a driver-based system to define how any client may access any database.

omni-channel marketing

An omnichannel marketing strategy blends a multichannel marketing strategy with social media and mobile devices to provide customers a personalized experience, but with consistent messages, offers, and products across channels.

one-time campaign

A one-time campaign represents a specific campaign message that is configured to be sent only once.

one and done

One and done is a flag that indicates if a customer has made only one purchase.

Oracle Bronto (Deprecated)

Oracle Bronto was a cloud-based commerce marketing automation platform that focused on personalization.

Oracle Business Analytics

Oracle Business Analytics is a desktop BI tool for data exploration and visualization.

Oracle Data Cloud

Oracle Data Cloud is a set of solutions that enable agencies, brands, media platforms, and publishers to use data more effectively and drive better business outcomes.

Oracle Data Management Platform (DMP)

Oracle Data Management Platform (DMP), formerly known as BlueKai, is a cloud-based big data platform that enables marketing organizations to personalize online, offline, and mobile marketing campaigns with richer and more-actionable information about targeted audiences.

ORC, Optimized Row Columnar (ORC)

Optimized Row Columnar (ORC) is columnar-storage file format similar to Parquet. It is the file format that is the Stitch process outputs to the customer 360 database.

orchestration

An orchestration defines the relationship between query results and a destination, including the location to which those query results will be sent and the frequency at which the orchestration will be run.

orchestration group

An orchestration group is one (or more) orchestrations that are scheduled using a crontab file to define the schedule’s frequency. For example, an orchestration group can be scheduled to run at 8:30 AM every day of the week: 30 8 * * *.

order cancelled quantity

The total number of items in the original transaction that were cancelled.

order cancelled revenue

The total amount of revenue for all cancelled items in the transaction.

order cost

Order cost represents the total cost of goods sold (COGS) for a single transaction.

order date, order datetime

Order datetime is the date (and time) on which an order was placed.

order discount amount

Order discount amount is the total discount amount that is applied to the entire order.

order discount percent

Order discount percent is the percentage discount that is applied to the order value for the entire transaction, in addition to any item or unit-specific discount percentages.

order ID

An order ID is the unique identifier for the order and links together all of the items that were part of the same transaction. When an item has been returned or when an order has been cancelled, the order ID is the unique identifier for the original order, including the returned or cancelled items.

order list price

The total value for a transaction, as defined by the manufacturer’s suggested retail price (MSRP) for all units of this item.

order profit

Order profit is the amount of profit that is earned from a single transaction.

order quantity

Order quantity is the total number of individual items associated with the transaction.

order returned quantity

Order returned quantity is the total number of items in the original transaction that were returned.

order returned revenue

Order returned revenue total amount of revenue for all returned items in a transaction.

order revenue

The total amount of revenue for all items in a transaction after discounts are applied, ignoring returns and/or cancellations.

order shipping amount

The order shipping amount is the total cost of shipping all items in the same transaction.

order subtotal

An order subtotal is the amount for an order, before discounts are applied.

order tax amount

An order tax amount is the total amount of taxes that are associated with an entire order.

overcluster, overclustering

An overcluster, or a false positive, occurs when distinct records are incorrectly added to a cluster of records. Each overcluster affects the precision of identity resolution and should be investigated to understand why it occurred.

P

paid channels

A paid channel allows a brand to target advertising based on demographics, keywords, intent, and so on. A paid marketing campaign has the benefit of already knowing who the customers are and leads to better campaigns and better results.

pairwise comparison

A pairwise comparison is a process that compares, and then scores all of the possible connections between all records in a group of records.

pairwise connection

A pairwise connection is a pair of matching records within a block that have an initial score above threshold. Each pairwise connection within a block is scored, after which all pairwise connections that scored above threshold represent a single, unique individual.

The data explorer, showing pairwise connections.

See cluster graph and stitched records.

pairwise connection score

A score is assigned to every pairwise connection. The score is measured in two parts, separated by a period.

The first part–the record pair score–correlates to the match category, which is a machine learning classifier that is applied by Amperity to individual record pairs. The record pair score corresponds to the classification: 5 for exact matches, 4 for excellent matches, 3 for high matches, 2 for moderate matches, 1 for weak matches, and 0 for no conflicts.

The second part–the record pair strength–is used by Stitch to help determine the quality of the record pair score. This value appears in the Stitch report as a two decimal number. A record pair strength by itself is not a direct indicator of the quality of a pairwise connection score.

Panda Printing

Panda Printing creates high-quality print and promotional products for businesses of all sizes. If you can think of it, they can print it.

passthrough 360 table

A passthrough table adds a table to the customer 360 database using an existing table without making any changes to its schema.

pCLV

Predicted customer lifetime value represents the total value of all orders a customer is predicted to make if they return to make another purchase during the next 365 days.

pCLV index score

A predicted customer lifetime value (predicted CLV) index score is the comparison between customers associated with a given segment to customers associated with everything else.

For example, an index score of 5x for orders indicates that the average customer in a segment is expected to place 5x orders as compared to customers who are not in the segment.

PebblePost

PebblePost offers a full-stack solution that leverages scaled proprietary data, algorithmic optimization, advanced measurement and analytics, along with a print and mail logistics network.

people-based measurement

People-based measurement is an approach to marketing that measures incremental changes in customer behavior. For example, did a campaign increase customer lifetime value, how much value did this campaign add when part of a coordinated campaign, or did a campaign meet revenue goals and expectations?

Persado

Persado is an AI-driven content generation and decisioning platform that uses segment data to personalize words, concepts, and ideas to help your business grow purchases, loyalty registrations, automatic payment sign-ups, and self-service enrollments.

personally identifiable information (PII)

Personally identifiable information (PII) is any data that could potentially identify a specific individual. PII data includes details like names, addresses, email addresses, and other profile attributes, but can also include attributes like a loyalty number, customer relationship management (CRM) system identifiers, and foreign keys in customer data.

PGP, Pretty Good Privacy

Pretty Good Privacy (PGP) is an encryption program that provides cryptographic privacy and authentication for data communication by signing, encrypting, and decrypting data files and formats. Amperity supports PGP encryption.

phone

The phone number that is associated with an individual customer record. A customer record may be associated with multiple phone numbers.

-or-

The phone number that is associated with a unique Amperity ID. A customer record may be associated with multiple phone numbers.

PII semantic

A PII semantic assigns consistency to customer data to ensure that PII data is more easily discovered across many sets of data.

PII table

A PII table collects all of the fields from the Unified_Coalesced table that contain PII data, standardizes PII values (as necessary), and then allows querying against any combination of PII data while building segments in the Visual Segment Editor.

policy

A policy represents a set of actions that are available to a user when that policy is assigned to them.

postal

The zip code or postal code that is associated with the location of an individual customer record.

-or-

The zip code or postal code that is associated with the location of a unique Amperity ID.

Postman

Postman is a collaboration platform for API development that enables support for sending data to Amperity using the Streaming Ingest REST API.

PowerReviews

PowerReviews is an ecosystem of consumers, advocates and influencers who create and curate authentic content to accelerate sales.

precision

Precision is the relation between true positives, i.e. correct matches, and the total number of true positives and false positives, i.e. incorrect matches. A decrease in overclustering will increase precision.

predicted average order revenue

Predicted average order revenue represents the average value of each order a customer is predicted to make if they return to make another purchase during the next 365 days.

predicted customer lifetime value (pCLV)

See pCLV.

predicted order frequency

Predicted order frequency represents the number of orders a customer is predicted to make if they return to make another purchase during the next 365 days.

predicted probability of transaction

Predicted probability of transaction represents the likelihood that a customer will return to make another purchase during the next 365 days.

See p(return).

predictive attribute

Each predictive model outputs a collection of predictive attributes. A predictive attribute is represented by a column in a table that can be used during segmentation. Use these attributes to define segments that filter customers based on predicted behavior.

predictive models

A predictive model is a feature of AmpIQ that predict customer behavior, such as predicted customer lifetime value (predicted CLV), churn propensity, product affinity, and lifecycle events.

Predicted_CLV_Attributes

The Predicted_CLV_Attributes table contains information, for each individual Amperity ID, about predicted future spend and the probability of churn.

Presto SQL

See Trino.

p(return)

p(return) is a probabilistic score that predicts churn likelihood and represents how likely is it for an individual customer to purchase in the next year.

primary key, fk

A primary key is a column in a data table that uniquely identifies each row in a data source or data table.

product affinity

Product affinity is a predictive model that identifies which customers are likely to purchase by using a combination of historical purchase data and lookalike audiences. The predicted affinity model outputs a ranked list of customers with three recommended audience sizes.

product ID

The unique identifier for a product.

product base ID

The unique identifier for a group of products that share a set of baseline attributes. This identifier does not include any SKU details.

product catalog

A product catalog represents data that describes purchased items, including unique identifiers, stock keeping units (SKUs), and other product attributes.

product recommendation

A product recommendation predicts which products a given customer is most likely to purchase.

Profile Accelerator for Adobe

Profile Accelerator for Adobe provides comprehensive, accessible, and accurate customer profiles that are pre-configured for Adobe Marketing Cloud.

PSV, pipe-separated values

A pipe-separated values (PSV) file is a delimited text file that uses a pipe to separate values. A PSV file stores tabular data (numbers and text) in plain text. Each line of the file is a data record. Each record consists of one or more fields, separated by pipes. The use of the pipe as a field separator is the source of the name for this file format.

purchase brand

The brand for which a transaction was made.

purchase channel

A purchase channel is the channel from which a transaction was made. For example: in-store or online.

purchase curve

A purchase curve represents the relationship between purchases and audience size. A purchase curve quickly adds customers, but flattens as audience sizes grow larger.

The purchase curve and audience sizes.
purchases table

A purchases table is a type of interaction record that contains a history of purchases made by customers, including details about goods and/or services that were ordered, such as:

  • Quantity

  • Itemized order value

  • Date of purchase

  • Purchase location

Q

QA database table

A QA database table is output by Stitch for use with the Stitch QA process.

QA tables include:

  • Detailed_Examples

  • Stitch_Blocking_Keys

  • Stitch_Scores

  • Unified_Changes_Clusters

  • Unified_Changes_PKS

  • Unified_Coalesced

  • Unified_Preprocessed_Raw

  • Unified_Scores

Quad

Quad is a marketing solutions provider that combines a strong print foundation with an integrated marketing platform to help marketers improve spend efficiency across offline and online media channels.

Qlik Sense

Qlik Sense is a cloud-based, self-service BI tool that allows you to explore data in any direction, extract data by making selections, and immediately viewing filtered data and associated items.

Queries (tab)

The Queries tab provides the interface that allows users to build, define, and manage a list of queries.

query

A query is SQL that is run from the Queries tab against database tables in the Customer 360 tab. A query returns a refined and filtered subset of useful customer data.

query folder

A folder helps you organize the list of segments in the Queries tab. Up to three levels may be added.

R

raw data

Amperity loads data in raw form preserving the original schema as well as identifiers (or keys) from all systems. Examples of keys include loyalty numbers, ecommerce IDs, cookies, or any other identifier mapped to a customer.

RCFile, Record Columnar File

Record Columnar File (RCFile) is a data placement structure that determines how to store relational tables on computer clusters using the MapReduce framework.

recall

Recall is the relation between true positives to the total number of true positives and false negatives, i.e. incorrect splits. An increase in underclustering will decrease recall.

recipient group

A recipient group is the percentage of an audience who will receive communications related to a marketing campaign. A campaign may have more than one recipient group.

record count

Record count is the total number of records (rows) within a given data set. This could include a source feed or Amperity table, or, it could describe the number of records that result from a probabilistic or deterministic connection between two Amperity tables.

record pair

The Stitch process takes customer tables, extracts the customer records and compares pairs of records together using over 40 different machine learning models. Each record pair is given a score, which represents the strength of the match.

record pair score

The record pair score correlates to the match category, which is a classifier that is applied by Amperity to individual record pairs. The record pair score corresponds to the classification: 5 for exact matches, 4 for excellent matches, 3 for high matches, 2 for moderate matches, 1 for weak matches, and 0 for no matches.

record pair strength

The record pair strength represents the strength of the record pair score. It is a two digit number. For example: .31 is a lower strength and .93 is a very high strength.

record type

A record is a row of data. A record is associated with one of the following record types: customer record or interaction record.

records

Records are individual rows of data, synonymous to individual rows in relational databases. Each record is made up of a collection of columns that store values like first name, email address, last purchase, and so on. In Amperity terms, a record is a row of data that refers to a single customer, a visitor, a transaction, an interaction, a behavior, and so on.

records ingested

Records ingested is a static value that tracks the total number of non-duplicate records that were added to Amperity during the previous week. Each record is an individual row of data from any customer data source, including both customer records and interaction records.

records sent

Records sent is a statistic that tracks the total number of records sent by Amperity to all destinations during the previous week. Each record is an individual row of data that was returned by a segment.

rectangular data

Rectangular data is a structured dataset with clearly defined rows and columns, such as a SQL table output to an Amazon S3 bucket or Azure storage.

recurring campaign

A recurring campaign is sent automatically based on a state change, such as an accepted return, a change to a loyalty program, or an alert based on credit card status, with a predefined campaign message and cadence to a list of recipients.

request datasource

An identifier that represents the first grouping of records done by Stitch. This grouping is based on identical semantic patterns.

request email

The email address for the customer. This is used to find their records in Amperity.

request id

The tracking identifier for the customer’s compliance workflow. This ID should be provided by the customer and must be unique.

request strategy

The compliance request strategy used for matching exact email data, semantic tags, and Amperity IDs.

request type

The type of compliance request. Possible values: delete or data subject access request (DSAR).

resource group

A resource group represents one or more databases in the Customer 360 tab. Users with access to a resource group can build queries and segments against that database and can send data from that database to downstream workflows.

retention rate

Retention rate is the percentage of customers who continue to purchase your products and services over a given time period.

RFM (customer value)

RFM is a method used to analyze customer value that represents three dimensions:

  • Recency. How recently did the customer purchase?

  • Frequency. How often does the customer purchase?

  • Monetary. How much does the customer spend?

A score for recency, frequency, and monetary is assigned on a scale of 0-9, and is then aggregated into a combined RFM score that is assigned on a scale of 0-999. Maximum scores represent preferred behaviors.

RR Donnelly

RR Donnelly provides marketing and business communications, commercial printing, and related services.

RSA (cryptography)

RSA is a cryptographic system that may be used to generate public and private key pairs for the purpose of securing data transmission to and from Amperity via SFTP. The public key is used to encrypt data. The private key is based on a very large prime number and is used to decrypt data.

S

S3

See Amazon S3.

Sailthru

Sailthru is a cross-channel marketing platform built to deliver personalized experiences for consumers across email, web, and mobile channels.

Salesforce Commerce Cloud

Salesforce Commerce Cloud​ is a multi-tenant, cloud-based commerce platform that empowers brands to create intelligent, unified buying experiences across all channels.

Salesforce Interaction Studio

Salesforce Interaction Studio is a real-time personalization and interaction management solution.

Salesforce Marketing Cloud

Salesforce Marketing Cloud is a provider of digital marketing automation and analytics software and services.

Salesforce Pardot

Pardot is a marketing automation solution that is focused on helping your company engage buyers, grow relationships, and close deals.

Salesforce Sales Cloud

SalesForce Sales Cloud brings customer information together into an integrated platform, and then provides access to thousands of applications through the AppExchange.

SAML

Security Assertion Markup Language (SAML) is a protocol used to share authentication and authorization information between an identity provider (IDP) and security provider (SP).

sandbox

A sandbox is a full copy of your tenant from which a user who is assigned to the DataGrid Administrator role can make changes. For example, adding a data source and feed, adding a destination, and managing workflows with SLA status. After these changes are validated, they can be safely promoted to your production tenant.

SAP Business Objects

SAP Business Objects is a centralized BI tool for data reporting, visualization, and sharing and is the on-premise BI layer for SAP’s Business Technology Platform.

SAS Cloud Analytic Services

SAS Cloud Analytic Services is a cloud-based run-time environment for data management and analytics with SAS.

saved query

A saved query is a SQL statement that may be applied to data prior to loading it to a domain table. A saved query is defined using Spark SQL syntax.

second order date, second order datetime

Second order date is the date on which the second order was placed by a customer.

second order ID

Second order ID is the order ID for the second order that was made by a customer.

second order revenue

Second order revenue is the total revenue that is associated with a customer’s second order, ignoring returned items and/or cancelled items.

second order total items

Second order total items is the number of items that were purchased as part of a customer’s second order, ignoring returned items and/or cancelled items.

second-party data

Second-party data is first-party data that is shared by the company that owns the first-party data (and has been given the right to share this data) with another brand or company. Second-party data can be a good approach to audience expansion, especially when both sets of data closely fit a similar customer profile. Governance, legal concerns, and privacy laws are more likely to affect how second-party data may be shared.

Security Assertion Markup Language

See SAML.

segment

A segment defines a specific attribute profile that can be used to initiate marketing campaigns using the list of customers that match that profile.

segment folder

A folder helps you organize the list of segments in the Segments tab. Up to three levels may be added.

Segments (tab)

The Segments tab provides the interface that allows users to build, define, and manage a list of segments.

The Segments tab in Amperity.
semantic

A semantic is a way to apply a common understanding to individual points of data across multiple data sources, even when data sources have different schemas, naming conventions, and levels of data quality. Assigning a semantic tag to individual columns in customer data is an important prerequisite to the Stitch process.

semantic tag

A semantic tag standardizes profile (PII), transaction, and other important customer details across all columns in all data tables.

semantic type

A semantic type is directly associated with data values that appear in customer data tables. Semantic types exist for columns that contain values like first names, email addresses, home addresses, cities, phone numbers, and so on. Amperity has many built-in semantic types, including groupings for personally identifiable information (PII), transactions, itemized transactions, and other consumer behaviors.

semi-structured data

Semi-structured data does not reside in rows and columns, but still contains elements that separates the data into a clear hierarchy. Amperity supports many types of semi-structured data formats, including CSV, TSV, Parquet, ORC, and JSON.

separation key, sk

A separation key (sk) is used for deterministic unmatching of records.

service provider, SP

A service provider (SP) is a system that accepts authentication responses from an identity provider (IDP), and then provides access to a service. Amperity is the service provider in this context.

SFTP

The SSH File Transfer Protocol (also Secure File Transfer Protocol, or SFTP) is a network protocol that provides file access, file transfer, and file management over any reliable data stream.

shared access signature (SAS)

A shared access signature (SAS) is a URI that grants restricted access rights to Azure Storage resources.

Shopify

Shopify is an ecommerce platform for online stores and retail point-of-sale systems that consolidates shopping, payments, marketing, shipping, and customer engagement tools into a single unified experience.

single sign-on

See SSO.

single-channel customer

A single-channel customer shows activity in a single channel during the last measured time period.

site interactions table

A site interactions table represents the history a customer has with a brand via websites, such as:

  • Site visits

  • Page views

  • Items added to a cart

  • Items purchased

  • Conversions

  • and so on

SKU, stock keeping unit

A stock keeping unit (SKU) is an identifier that captures all of the unique details of any individual product, including specific attributes that differentiate by color, size, material, and so on.

SLA, Service Level Agreement

A service level agreement (SLA) is condition in Amperity that guarantees that a process will run successfully. In the rare case where a process does not run successfully it is treated with the highest level of urgency by Amperity on-call systems and support engineers.

slice

Do not use the term “slice” when referring to data sent from Amperity to external locations. Use segment or database export instead.

See segment.

SMG, Service Management Group

Service Management Group (SMG) helps brands boost loyalty and improve business performance by combining technology and differentiated, strategic services to collect, analyze, and share feedback and behavioral data.

Snapchat

Snapchat is a social media platform that allows users to access pictures and messages for a short time. Snapchat provides a self-serve platform from which brands can manage ads and advertising campaigns.

SnapPass

SnapPass allows secrets to be shared in a secure, ephemeral way. Input a single or multi-line secret, along with an expiration time, and then generate a one-time use URL that may be shared with anyone. Amperity uses SnapPass for sharing credentials to systems with customers.

snapshot email

A snapshot email is sent on a weekly basis from Amperity to provide a quick overview of your customer data.

Snappy

Snappy is a compression library aims for high speeds and reasonable compression instead of maximum compression. Files that are compressed with snappy tend to be larger, but the process to compress (and decompress them) is significantly faster.

Snowflake

Snowflake is an analytic data warehouse that is fast, easy to use, and flexible. Snowflake uses a SQL database engine that is designed for the cloud. Snowflake can provide tables as a data source to Amperity.

SoundCommerce

SoundCommerce is a data platform for consumer brands and retailers that spans marketing, merchandising, supply, fulfillment, delivery, and customer service.

source database table

A source database table is a domain table that is created by a feed in the Sources tab. These tables are used as inputs for standard database tables when semantic tags are applied and may be passed through to any database.

source ID

Source IDs are unique identifiers that exist within each of your upstream data sources, including customer databases, transactional databases, or behavioral data assets. One way that Amperity measures Stitch quality is by comparing the number of Source IDs to the final number of Amperity IDs, showing the degree to with we matched and reconciled your data, forming comprehensive customer views.

Sources tab

The Sources tab provides the interface for defining how customer data sources send data to Amperity, and then how Amperity will prepare this data for processing.

The Sources tab in Amperity.
Spark SQL

Spark SQL is a high performance SQL query engine that is used by Amperity to ingest data, create domain tables, and extend the outcome of the Stitch process in your customer 360 database.

sparse customer record

A sparse customer record is an incomplete set of personally identifiable information (PII) that can still be used to identify a unique individual when compared against rich sets of PII data. For example, a data set with sparse customer records may only contain last names, the last 4 digits of credit cards, and store zip codes associated with the transaction.

Square

Square is a platform for managing online stores, payments, and transactions.

SQL 360 table

A SQL table adds a table to the customer 360 database using Spark SQL and a SELECT statement to define the table’s schema.

SQL Query Editor

The SQL Query Editor is the user interface for a full SQL query engine based on Presto SQL that interacts with customer database tables in Amperity. The SQL Query Editor relies primarily on using the SELECT statement, along with common table expressions, joins, functions, and other components of Presto SQL to build and design advanced queries.

The SQL Queries Editor located within the Segments tab in Amperity.
SQL Segment Editor

The SQL Segment Editor is an optional interface that allows you to build an attribute profile with SQL. Start with a SELECT statement that returns the Amperity ID, and then apply a series of WHERE statements to define one (or more) attribute groups that match specific conditions and values.

SQL Workbench

SQL Workbench is a cross-platform SQL query tool that runs on any operating system.

SSO

Single sign-on allows a user to use a single ID and password to access multiple applications.

SSO domain

An SSO domain is a defined customer domain that maps to the identity provider used for single sign-on authentication to Amperity. For example: acme.com for all users of the ACME tenant.

standard database table

A standard database table forms the foundation of your databases. Standard tables are output by Stitch and provide unified data structures for all data sources to which semantic tags were applied.

Standard tables include:

  • Campaign_Recipients

  • Customer_360

  • Customer_Attributes

  • Email_Engagement_Attributes

  • Email_Engagement_Summary

  • Lifecycle_Events

  • Merged_Customers

  • Transaction_Attributes

  • Transaction_Attributes_Extended

  • Unified_Customer

  • Unified_Coalesced

  • Unified_Email_Events

  • Unified_Itemized_Transactions

  • Unified_Product_Catalog

  • Unified_Scores

  • Unified_Transactions

along with tables that are associated with predictive modeling and AmpIQ, such as Predicted_CLV_Attributes.

state

The state or province that is associated with the location of an individual customer record.

-or-

The state or province that is associated with the location of a unique Amperity ID.

static sources table

A static sources table is used when a large volume of legacy data cannot be made available to Amperity on a regular basis and is expected to remain unchanged over time. For example, a database that contains older data stored in raw tables. Data in a static source is made available to Amperity in its raw state.

Stitch

Stitch uses patented algorithms to evaluate massive volumes of data to discover the hidden connections in your customer records that identify unique individuals. Stitch outputs a unified collection of data that assigns a unique identifier to each unique individual that is discovered within your customer records.

Stitch_Blocking_Keys

The Stitch_Blocking_Keys table contains all of the blocking keys that are associated with the Stitch process.

Stitch_BlocklistValues

The Stitch_BlocklistValues table is added to the Stitch QA database as a passthrough table for values identified by the bad-values blocklist, which were excluded from the Stitch process.

Stitch_Scores

The Stitch_Scores table contains all scores generated by Stitch, including scores that are not associated with an Amperity ID. Use this table to help understand why records were not associated with an Amperity ID.

Stitch tab

The Stitch tab shows detailed results of the Stitch process, which takes customer data, exctracts customer records, and then compares record pairs using over 40 different machine learning models. Each record pair is given a score, which represents the strength of the match. Amperity creates clusters of records based on the connection between pairs, and then gives each cluster a unique Amperity ID.

The Stitch tab in Amperity.
Stitch QA

Stitch QA is a process that monitors the quality of Stitch results. Stitch QA has two components: a database and a set of queries. The results of these queries are analyzed to help identify values that should be labeled or blocklisted and discover situations where the results of the Stitch process require tuning to match your tenant’s data set.

stitch-labels

Stitch labels identify when a single customer record was incorrectly merged together (overclustered) or when two customer records were incorrectly split apart (underclustered).

stitched records

A stitched record is a unique output of the Stitch process that associates the Amperity ID to an individual customer record.

The data explorer, showing stitched records.

See cluster graph and pairwise connection.

stitched table

A stitched table is a database table that is output by the Stitch process. The list of stitched tables created when PII semantics are present include Unified_Coalesced, Unified_Scores, and Merged_Customers. The list of stitched tables created when transaction semantics are present include Unified_Transactions and Transaction_Attributes.

StreamRT

StreamRT is the realtime event routing layer for DataGrid. StreamRT enriches your customer profiles with realtime data, and then enables realtime connections with the systems that interact directly with your customers.

Streaming Ingest REST API

The Streaming Ingest REST API is a tenant-specific API that supports real-time streaming of data from external systems to Amperity.

streaming JSON

Streaming JSON is a way to send increments of data using NDJSON formatting within each increment. Each line in a NDJSON file is a valid JSON value.

structured data

Structured data adheres to a data model or schema. Amperity requires structured data to be in a table format with columns and rows. Amperity supports many types of structured data formats, including database tables in various formats, Snowflake tables via REST API, and other forms of rectangular data.

sub-audience

A sub-audience is a list of customers who will receive a specific type of communication related to a marketing campaign.

sum of discount amounts

The sum of discount amounts is the total of all discount amounts that were applied to each item within a transaction.

sum of itemized revenue

The sum of itemized revenue for the original order, not including returns and/or cancellations.

supersized cluster

A supersized cluster is a cluster of records that is discovered during the Stitch process that has more than ~100 matching records. When a cluster has more than ~100 records, this is more often an indicator for abandonment of continued analysis than one of an indicator of interest for further analysis.

suppression table

A suppression table associates incoming requests to

  • Delete records

  • Respond to data subject access requests (DSAR)

with user records in the customer 360 database. User records in a suppression table are prevented from being usable in downstream workflows until all matching PII data is deleted from Amperity.

surname

The last name that is associated with an individual customer record.

-or-

The last name that is associated with a unique Amperity ID.

T

table

See data table.

Tableau

Tableau is a visual analytics platform that empowers people and organizations to make the most of their data. Tableau connects to a data source, and then queries that data directly.

taxonomy

A taxonomy defines a formal structure for classes of objects. For example, a product taxonomy defines product categories, and then organizes individual product types within each category (or subcategory).

temporary table

A temporary table is a table that exists within the customer 360 database for the purpose of storing intermediate results, and then making those results available to other tables in the customer 360 database.

tenant

A tenant represents everything within the Amperity platform as provided to a single customer, including:

  • Processes that pull data from customer data sources

  • Processes that send data to destinations

  • Sandboxes

A tenant runs fully within an environment located in Amazon AWS or Azure.

third-party data

Third-party data is a broad category that includes, but is not limited to, data that is collected by an entity, and is then sold or distributed to companies that want to leverage that data. The usefulness of third-party data is often restricted by a combination of lower data quality, privacy laws, and general commercial interest.

Important

Amperity does not sell third-party data or mark it up in any way.

Throtle

Throtle provides brands and marketers a complete view of their customers and enables accurate targeting across all devices and channels, including IP addresses, connected TVs, cookieless identities, and mobile advertising IDs (MAIDs), such as the Identifier for Advertising (IDFA) from Apple and the Google Advertising ID (GAID) from Google.

TikTok Ads

TikTok Ads is an advertising platform that allows marketers to use a combination of first- and third-party data to build custom audiences, and then advertise to those audiences on TikTok.

title

The title that precedes a full name that is associated with an individual customer record. For example: Mr., Mrs, and Dr.

-or-

The title that precedes a full name that is associated with a unique Amperity ID. For example: Mr., Mrs, and Dr.

token

A token is a time-limited bundle of authentication and authorization information that is used when accessing Amperity.

total discount amount

The total discount amount for the entire transaction, including item-specific discounts. A total discount amount should equal the order-level discount amount plus the sum of all item-level discount amounts for the same order ID.

trade-area append (TAA)

Trade-area append (TAA) is a process that uses names, zipcodes, and credit card hashes to identify groups of users for online and offline marketing campaigns.

transaction

A transaction represents data that captures orders at the order level, with a summary row per order.

transaction semantic

A transactions semantic is a way to identify brands, channels, stores, orders, products, quantities, per-item costs, total costs, and so on. Use transactions semantics when a data source contains one row per order.

Transaction_Attributes

The Transaction_Attributes table describes the behavior and history of customer transactions, such as purchase history and frequency, brand preferences, order history, time between purchases, and so on. This table is added automatically when semantics related to transactions are applied to interaction records.

Transaction_Attributes_Extended

The Transaction_Attributes_Extended table contains additional attributes for customer flags, customer orders, data differences, time period rollups, and RFM scores, many of which are represented by duration (days, months, years, lifetime), order position (first, second, last), frequency, and revenue.

transitive connection

A transitive connection exists between individual records when any two records share a strong match to an intermediate record, but do not have a strong match to each other. For example: record 1 matches record 2, record 3 matches record 2, neither records 1 or 3 match to each other, but they have a transitive connection because both match record 2.

Trino

Trino is a distributed SQL query engine that is designed to efficiently query vast amounts of data using distributed queries. Trino is used by the Amperity SQL segment editors to define segments, which are SQL queries that return data from stitched data tables. Formerly known as Presto SQL.

trivial duplicate

A trivial duplicate is a set of nearly-identical records that are identified by Stitch early in the identity resolution process. Only one of the nearly-identical records is passed to downstream Stitch processes.

TSV, tab-separated value

A tab-separated values (TSV) file is a delimited text file that uses a tab to separate values. A TSV file stores tabular data (numbers and text) in plain text. Each line of the file is a data record. Each record consists of one or more fields, separated by tabs. The use of the tab as a field separator is the source of the name for this file format.

U

undercluster

An undercluster, or a false negative, occurs when distinct records are incorrectly split from a cluster of records. Each undercluster affects the precision of identity resolution and should be investigated to understand why it occurred.

Unified ID 2.0

Unified ID 2.0 is an open source identity framework that benefits consumers by allowing them to control how their data is shared with publishers and advertisers.

Unified_Changes_Clusters

The Unified_Changes_Clusters table contains a history of changes to cluster graphs, relative to the previous Stitch run.

Unified_Changes_PKS

The Unified_Changes_PKS table contains a history of changes to primary keys, relative to the previous Stitch run.

Unified_Coalesced

The Unified_Coalesced table contains every row from every stitched table that is associated with an Amperity ID, with defined semantics coalesced into a single column. A unique Amperity ID may appear in more than one row. Columns are added to this table when semantic values match values in the bad-values blocklist.

Unified Compliance

The Unified_Compliance table consists of all the tables tagged with the email, request-id, request-strategy, and request-type semantic tags, which are used to suppress data per customer deletion requests or a data subject access request (DSAR).

Unified_Customer

The Unified_Customer table contains every row of every stitched table with all semantics coalesced into a single column, with two exceptions that are removed from the table: values from a bad-values blocklist and values associated with supersized clusters. A unique Amperity ID may appear in more than one row.

Unified_Email_Events

The Unified_Email_Events table contains individual email event information, such as sends, opens, clicks, opt-in and opt-out preferences, bounces, and conversions. This table is added as Stitch output when email event semantic tags are applied to data sources.

Unified_Itemized_Transactions

The Unified_Itemized_Transactions table contains every row of every stitched table with all transactional data summarized to the item level, and then coalesced into a single column for each unique combination of order ID and product ID. The order ID is associated with an Amperity ID.

Unified_Preprocessed_Raw

The Unified_Preprocessed_Raw table is an output of Stitch. Use this table as part of Stitch QA to view data exactly how values were used by Stitch after preprocessing and blocklisting. This table contains one row for each trivial duplicate. Only the semantics used by Stitch are shown. Semantic values represent pre-processing values and/or bad values that were removed (and replaced by NULL).

Unified_Product_Catalog

The Unified_Product_Catalog table contains a row for every product SKUs, broken down by product ID and base ID, along with list prices and costs.

Unified_Scores

The Unified_Scores table records all of the pairwise comparison scores and match categories for all groups of records, and then for each group of records all of the pairwise scores that are present between records within that group.

Unified_Transactions

The Unified_Transactions table contains one row for each unique transaction record, with each order ID associated to an Amperity ID.

uniqueness

Uniqueness divides the number of unique values–cardinality–by the number of rows in a table.

unit cost

Unit cost is the cost to produce a single unit of one item.

unit discount amount

Unit discount amount is the discount amount that is applied to a single unit of one item.

unit list price

The manufacturer’s suggested retail price (MSRP) for a single unit of an item.

unit profit

Unit profit represents the amount of profit that is earned when a single unit of an item is sold.

unit revenue

The total revenue for a single unit of an item. When an item has been returned or the order has been cancelled, the total revenue for a single unit of an item that was returned and/or cancelled.

unit subtotal

A unit subtotal is the amount for a single unit of one item, before discounts have been applied.

unit tax amount

A unit tax amount is the total amount of taxes that are associated with a single unit.

units per transaction

See UPT.

unstructured data

Unstructured data is data that does not adhere to a particular data model or definition, such as text or binary data.

Note

Amperity (generally) does not support unstructured data as a data source because it cannot be easily mapped to other data sources. As such, it is less useful for identifying common patterns and trends across many data sets.

UpSet Plot

An UpSet plot highlights the connectivity between data sources. Similar to a Venn diagram, an UpSet Plot helps you to visualize the intersections of different data sources. Different from a Venn diagram, an UpSet Plot contains additional visualizations that are broken down by data source. Each bar in the UpSet Plot shows the number of Amperity IDs that share records from the included sources. Therefore, each bar represents a mutually exclusive set of Amperity IDs. If you were to add up all the Amperity IDs for each bar, you’d see the total number of Amperity IDs for the selected data source.

An UpSet plot chart, located within the Stitch tab in Amperity.
UPT, units per transaction

The average number of items purchased in any given transaction.

upstream

Upstream refers to the part of a workflow that occurs before the current step, often with some type of dependency. For example, a data table is only updated when new customer data is provided.

Urban Airship

See Airship.

UUID, universally unique identifier

A universally unique identifier (UUID) is a 128-bit number that uniquely identifies information in a computer system. It is represented by 32 digits and four characters (hyphens) displayed in five groups separated by hyphens in the form of 8-4-4-4-12. For example: 123e4567-e89b-12d3-a456-426614174000.

V

vertex, vertices

A vertex is a data item in a graph database. A vertex is sometimes referred to as a node.

Vibes

Vibes is a mobile marketing company that provides a variety of mobile marketing products and services, such as text message marketing (SMS/MMS), mobile wallet marketing, push notifications, and mobile web experiences.

VinoVisit

VinoVisit.com is a winery reservation and guest management software platform that helps customers book real-time winery reservations and multi-day itineraries.

vinSuite

vinSuite is wine software designed to manage direct-to-consumer wine business, website content management, e-commerce, wine club, and point-of-sale.

Visual Query Editor

The Visual Query Editor is a user interface that allows creating queries using Presto SQL without having to write SQL. The Visual Query Editor is combination of filters for WHERE and AND clauses organized as a series of configurable drop-down menus and pick-lists that build SQL queries that run against tables in the customer 360 database.

The Visual Query Editor located within the Segments tab in Amperity.
Visual Segment Editor

The Visual Segment Editor is the user interface for building segments in AmpIQ. This editor uses a series of drop-downs, picklists, conditions, and values to define an attribute profile. Refresh segment insights to see how many customers match the profile, and then active it as a segment.

The Visual Segments Editor located within the Segments tab in Amperity.

W

win-back percentage

A win-back percentage represents the percentage of customers who were inactive, but returned and made a purchase within the measurement window.

Wyng

Wyng is a marketing-software platform that enables advertising agencies and large brands to build and run consumer-engagement and loyalty campaigns, often featuring user-generated content shared through social media.

X

XML, eXtensible Markup Language

eXtensible Markup Language (XML) is supported data format for customer data sources.

Y

Z

Zendesk

Zendesk is a service-first CRM company that builds software designed to improve customer relationships by providing support, chat, and community services that span channels.

Numbers

4-5-4 calendar

A 4-5-4 calendar divides years into months using a 4 weeks - 5 weeks - 4 weeks pattern. Each week starts on a Sunday and ends on a Saturday. Each quarter has the same number of days. A 4-5-4 calendar can be useful for comparing like days for sales reporting purposes.