Filter assets
Semarchy xDG harvesters supports filtering the assets to harvest using patterns.
You can filter databases, schemas, tables, views, etc, using regular expressions on the asset names you want to allow or deny in the harvesting process.
This feature only applies to specific sources. The source page indicates whether it is supported. |
Sample recipe
The following sample recipe configures harvesting a subset of the assets using regular expression patterns.
source:
type: postgres # A source that support assets filtering.
config:
# Connection parameters for the source
# ...
# Harvest all databases but temp
database_pattern: "{'allow': ['.*'],
'deny': ['temp'],
'ignoreCase': True}"
# Harvest only the customerB2C schema (case sensitive)
schema_pattern: "{'allow': ['customerB2C'],
'deny': ['.*'],
'ignoreCase': False}"
# Harvest all table but those prefixed with MTA_ (case sensitive)
table_pattern: "{'allow': ['.*'],
'deny': ['semarchy.public.customerB2C.MTA_.*'],
'ignoreCase': False}"
sink:
# sink configuration
Configure assets filters
The following source parameters configure assets filtering using patterns:
|
Lists of regular expressions patterns to define the databases to include ( |
|
Lists of regular expressions patterns to define the schemas to include ( |
|
Lists of regular expressions patterns to define the tables to include ( |
|
Lists of regular expressions patterns to define the views to include ( |