SAP HANA

This source extracts metadata from a SAP HANA database.

Overview

Harvesting with this source extracts the following:

  • Schemas, views, and tables metadata

  • Columns and column types

This source supports:

Sample recipe

Example 1. File source sample recipe
source:
  type: hana
  config:
    host_port: localhost:39041
    database: semarchyDemoDatabase
    username: SemUser
    password: SemPassword

sink:
  # sink config

Parameters

The following table lists the source parameters.

Parameter

Description

host_port

Database server URI and port. This parameter is mandatory.

username

Database server user.

password

This user’s password.

convert_urns_to_lowercase

Set to True to convert dataset URNs to lowercase. Defaults to False.

database

Database name to harvest. If not set, harvests all databases.

options

Options to pass to SQLAlchemy engine creation as kwargs.

scheme

Default: hana+hdbcli.

sqlalchemy_uri

URI of database to connect to. See this page for more information. Takes precedence over other connection parameters.

include_tables

Set to True (default) to harvest tables.

include_table_location_lineage

Set to True (default) to include table lineage to the underlying storage location.

include_views

Set to True (default) to harvest views.

include_view_lineage

Set to True (default) to parse the views' SQL to populate dataset-level table-to-view and view-to-view lineage.

include_view_column_lineage

Set to True (default) to parse the views' SQL to populate column-level table-to-view and view-to-view lineage, if include_view_lineage is set to True.

incremental_lineage

Set to True to emit lineage as incremental to existing lineage. Defaults to False.

schema_pattern

Lists of regular expressions patterns to define the schemas to include (allow) or exclude (deny) in the harvesting process.
The resular expression is only for the schema name. For example, to match all tables in the customerB2C schema, use the customerB2C regex.
Default value is {'allow': ['.*'], 'deny': [], 'ignoreCase': True}.
The ìgnoreCase option ignores case sensitivity during pattern matching.

table_pattern

Lists of regular expressions patterns to define the tables to include (allow) or exclude (deny) in the harvesting process.
The resular expression should match the full table name (database.schema.table). For example, to match all tables in the customerB2C schema of public schema the semarchy database, use the semarchy.public.customerB2C.* regex.
Default value is {'allow': ['.*'], 'deny': [], 'ignoreCase': True}.
The ìgnoreCase option ignores case sensitivity during pattern matching.

view_pattern

Lists of regular expressions patterns to define the views to include (allow) or exclude (deny) in the harvesting process.
The resular expression should match the full view name (database.schema.view). For example, to match all views in the customerB2C schema of public schema the semarchy database, use the semarchy.public.customerB2C.* regex.
Default value is {'allow': ['.*'], 'deny': [], 'ignoreCase': True}.
The ìgnoreCase option ignores case sensitivity during pattern matching. Note that this option defaults to the table_pattern value, if set.

platform_instance

Platform instance all assets belong to.

env

Environment assets belong to. Defaults to PROD.