Semarchy Email enricher
The Semarchy Email enricher standardizes and improves the quality of email addresses.
Description
This enricher processes an input email address and splits it into a local-part (username) and a domain name. Any syntax errors in both parts are automatically corrected. Additionally, the domain name validity is verified using MX records lookup. To expedite checks and automated fixes on domain names, the plugin utilizes a domain name cache.
This plugin is thread-safe and supports parallel execution. |
Domain name cache
The plugin uses several mechanisms for faster checks and automated fixes on domain names:
-
Domain names already checked as valid (based on MX records lookup) are persisted in a domain name cache stored in a datasource. This avoids repeating MX lookups.
-
A list of known domains (e.g.,
hotmail.com
,gmail.com
, etc.) is automatically seeded in the hostname validation cache. -
Common domain mistakes are fixed using a seeded replace list. For example
gmai.com
is automatically fixed togmail.com
using the cache. -
Invalid domains are automatically fixed to similar valid domains already present in the cache. For example,
semarcyh.com
is fixed tosemarchy.com
assemarchy.com
was previously checked as a valid domain name.
For more information about the domain name cache, see Domain name cache.
Plugin parameters
The following table lists the plugin parameters.
Parameter name | Mandatory | Type | Description |
---|---|---|---|
Datasource |
No |
String |
Name of the datasource used to store the hostname validation cache. This datasource must be configured in the platform. |
Lowercase User Name |
No |
String |
Set to |
Offline Mode |
No |
String |
Set to |
Processing Mode |
No |
String |
|
Plugin inputs
The following table lists the plugin inputs.
Input name | Mandatory | Type | Description |
---|---|---|---|
Input Email Address |
Yes |
String |
Input email address to cleanse. |
Plugin outputs
The following table lists the plugin outputs.
Output name | Type | Description |
---|---|---|
Cleansed Email Address |
String |
Cleansed email address returned by the enricher. This address may be valid or not. The syntactic validity or domain name validity of the email address is indicated in the other plugin outputs. |
Valid Domain |
String |
Flag (0 or 1) indicating whether the domain name is valid or not (based on syntax and MX records lookup) in the cleansed email address. In Offline mode, this parameter returns 1 or 0 if the domain name appears in the local domain cache as valid or invalid. It returns |
Valid Domain Syntax |
String |
Flag (0 or 1) indicating whether the domain name syntax is valid or not in the cleansed email address. |
Valid Email Syntax |
String |
Flag (0 or 1) indicating whether the cleansed email address is syntactically valid or not. |
Valid Username Syntax |
String |
Flag (0 or 1) indicating whether the local-part (user name) syntax is valid or not in the cleansed email address. |
Valid Input Domain |
String |
Flag (0 or 1) indicating whether the domain name is valid or not (based on syntax and MX records lookup) in the input email address. In Offline mode, this parameter returns 1 or 0 if the domain name appears in the local domain cache as valid or invalid.
It returns |
Valid Input Domain Syntax |
String |
Flag (0 or 1) indicating whether the domain name syntax is valid or not in the input email address. |
Valid Input Email Syntax |
String |
Flag (0 or 1) indicating whether the input email address is syntactically valid or not. |
Valid Input Username Syntax |
String |
Flag (0 or 1) indicating whether the local-part (user name) syntax is valid or not in the input email address. |