File Requirements for Matched Lists or Lookalike Generation
This document defines file formats that are supported for generating matched lists or lookalike audiences. For the same, files will need to be formatted as .csv (optionally gzipped) with headers designating each column.
Column headers are limited to a maximum of 256 characters
Column values are limited to a maximum of 1024 characters
Column headers are case-insensitive
Max file size: 1GB (drag and drop) and 10GB (SFTP)
For users who are generating matched lists with CSV files and also using Excel to view/edit the CSV files, please follow the instructions linked here.
Single quote is not allowed in the filename.
Supported Column Details
Column | Description | Examples |
---|---|---|
email* email1 email2 email3 email4 | Zeta accepts email addresses in US and international formats. Zeta accepts up to 5 email addresses as separate attributes. Up to 5 clear text email addresses may be considered in a single input record, in the matching waterfall order of: email, email1, email2, email3, email4. E.g., If there is no match for “email”, the next field considered for matching is “email1”, and so on. | |
email_md5 | Zeta accepts email addresses in US and international formats in md5 hash format. |
|
email_sha256 | Zeta accepts email addresses in US and international formats in sha256 hash format. |
|
zync_cookie | Cookie ID for a visitor from Zeta integrated site. |
|
sizmek_cookie | Cookie ID surfaced in the bitstream of our Demand Side Platform (DSP). |
|
first_name* | Zeta accepts first name and first name initials, with or without accents. Initials can be provided with or without a period. |
|
last_name* | Zeta accepts full last names with or without accents. |
|
address_line_1* | First line of the postal address |
|
address_line_2* | Second line of the postal address |
|
zip* | Zeta accepts US and international zip and postal codes. US zip codes may include a 4-digit extension as long as they are separated by a hyphen. The extension is not required and will not further improve the match rate. |
|
city* | Zeta accepts full city names as they normally appear. |
|
state* | Zeta accepts full names of US and international states and provinces, as well as the abbreviated versions of US states. |
|
maid | Mobile Advertising Identifier (MAID) is used to target Mobile phones and tablets using Android as GAID (Google Advertising ID) and IOS as IDFA (Identifier for Advertising). |
|
phone phone1 phone2 phone3 phone4 | Zeta accepts 10-digit phone numbers. Phone numbers must include a US country code to be used for matching; 1 must precede a phone number in the United States. The Matched List won’t accept any non-numerical characters such as spaces, periods, etc. Zeta accepts up to 5 phones as separate attributes. Up to 5 phones may be considered in a single input record, in the matching waterfall order of: phone, phone1, phone2, phone3, phone4. E.g., If there is no match for “phone”, the next field considered for matching is “phone1”, and so on. |
|
When uploading a file for Lookalike, it is necessary to have the first header of the file as "Email". However, it is not mandatory to have any values in the field.
The name and postal match can only be performed if all the required/mandatory fields (marked with a *) have values inserted:
email
first_name
last_name
address_line_1
address_line_2
and at least one city, state, or zip
If the aforementioned requirements are not met in the input file, the (name + postal) stages match process will yield 0 results.
The address_line_2 field is mandatory but nullable. If you do not have a value, you can leave it empty.
If the matched list is created using email_md5, the mandatory fields mentioned above are not necessary.
Unless the fields matched upon are (name + postal), only one field needs to be in the header of a matched list seed file. However, a seed file’s header may contain multiple fields. Even so, only one populated field per record is required to yield results, with the exception of the record only containing (name+postal).
For file-based uploads containing hashed email data intended to generate a matched list, the data is not stored in Snowflake. Instead, it is securely uploaded to an S3 location, where it is processed by our servers. The physical location of the S3 storage aligns with our secure data infrastructure protocols.
Matching Logic of Matched Lists
2,000 records are required as a minimum to generate a matched list.
postal_fullname_address
phone
email_md5 | email*
email_sha256
ifa_ifa
maid_maid
cookie_disqus
cookie_zync
cookie_sizmek
cookie_appnexus
bsin
where postal_fullname_address
refers to the following fields:
postal_fullname_address = [
"postal_firstname",
"postal_lastname",
"postal_addressline1",
"postal_addressline2",
"postal_city",
"postal_state",
"postal_zip",
]
Example File