What is this? A data table to classify email domains in Personal/Business/Freemail/Junkmail.
Why was this created? The product org realized the importance of how an user's email domain impacts their probability of conversion. Segmenting new namespaces and users by this information is a crucial way to study and identify high-value customers.
Marketing also segments by email domain for communications and scores new leads with a business email differently to those with personal or generic email domains.
Previously, there have been a few Sisense snippets or sheets with email domain classifications, but this approach was not performant and did not act as a SSOT (Source of Truth) for classifying email domains. The Email Domain Classification table solves both of these issues.
The email domain classification table can be accessed in Sisense by quering the table PROD.WORKSPACE_MARKETING.EMAIL_DOMAIN_CLASSIFICATION.
For ease of use for team members, the email domain classification has been surfaced to PROD.COMMON.DIM_USER, the Dimensional Model table which contains information about GitLab.com users. The corresponding fields are:
An occurrence of a NULL value in the above fields indicates a business email domain.
New email domains are emerging everyday, so a way to update this information is crucial. To upload the Email Domain Classification data table to our Datawarehouse we rely on the GCP Driveload process. The following steps can be used to update the Email Domain Classification data table:
Replace the old email_domain_classification.csv
with the newest updated list. Make sure:
a. The new CSV file only has two columns: DOMAIN
and CLASSIFICATION
. Keep the column names uppercase.
b. There are no duplicates email domains.
c. The CLASSIFICATION
column is not empty for any email domain. Preferably, the value for Classification should be one of the following: personal, personal - junkmail and personal - freemail.