Deploy a Semarchy xDM instance in AWS
You will be guided to deploy Semarchy xDM on AWS. The estimated time to complete these steps is two hours.
It is highly recommended not to use the root user for any deployment or operations described below. You are advised to always follow the policy of least privilege for all access granted as part of the deployment, such as security groups.
Step 1. Check the AWS credentials
Before starting the deployment check that the AWS Command Line Interface (CLI) is correctly configured:
-
Run the following command to retrieve your AWS Identity and Access Management (IAM) user:
$ aws iam get-user
Command output{ "User": { "Path": "/", "UserName": "<username>", "UserId": "<user_id>", "Arn": "arn:aws:iam::<account_id>:user/<username>", "CreateDate": "2022-02-08T16:19:26+00:00", "PasswordLastUsed": "2022-04-21T08:41:18+00:00" } }
If the output is different it means that AWS CLI is not configured correctly. For more information, see Getting started with the AWS CLI.
-
All the following commands will be executed on the default AWS region. Run the following command to confirm the default region used by AWS CLI:
$ aws ec2 describe-availability-zones --output text --query 'AvailabilityZones[0].[RegionName]'
Command outputeu-west-1
If the region is not set correctly, see Configuration basics.
Step 2. (Optional) Create the Aurora RDS instance
This step is optional. If you already have a running database instance, you can proceed to the next section.
This guide provides AWS CLI commands to create the database instance. This can also be achieved using the AWS console UI. For more information on AWS CLI for Relational Database Service (RDS), see interface reference for Amazon RDS.
These commands only provide mandatory parameters but you need to ensure they meet your requirements. For more information, see AWS CLI Command Interface.
Start by creating the database instance into which the Semarchy xDM repository and, optionally, the data locations for your applications, are stored.
-
Run the following command to create the database cluster:
$ aws rds create-db-cluster --db-cluster-identifier <db_cluster_name> --engine aurora-postgresql --master-username <db_clsuter_username> --master-user-password <db_cluster_password> --database-name semarchy_repository --tags Key=xdm-eks
-
Run the following command to create the writer instance:
$ aws rds create-db-instance --db-cluster-identifier <db_cluster_name> --db-instance-identifier <writer_instance_name> --db-instance-class db.t4g.medium --engine aurora-postgresql --db-subnet-group-name <db_subnet_group_name> --vpc-security-group-ids <security_group_id> --tags Key=xdm-eks
Make sure that the DB subnet group and the security group are part of the same VPC. -
(Optional) Add reader instances to dispatch the load between multiple reader instance(s) for read-only operations such as dashboards and BI tools:
$ aws rds create-db-instance --db-cluster-identifier <db_cluster_name> --db-instance-identifier <reader_instance_name> --db-instance-class db.t4g.medium --engine aurora-postgresql --db-subnet-group-name <db_subnet_group_name> --vpc-security-group-ids <security_group_id> --tags Key=xdm-eks
-
Follow the cluster and instance creation progress using the AWS console and wait for their status to change to Available (estimated time: 10 minutes). Note the endpoint URLs for later:
By default, the Aurora database cluster is not encrypted. For more information, see Encrypting Amazon Aurora resources.
Step 3. (Optional) Create the EKS cluster
This step is optional. If you already have an Elastic Kubernetes Service (EKS) cluster configured with your AWS account, you can move to the next section.
This section guides you to create the EKS cluster needed to deploy the Semarchy xDM images.
Use the eksctl tool to generate a series of AWS CLI command lines based on a configuration file. The tool will generate and execute two CloudFormation stacks resulting in the creation of the EKS cluster and a group node with three nodes.
-
Download the EKS cluster configuration file template and edit the content of the file to match your specific requirements. For more information, see the Config file schema.
-
Run the following command from the folder containing the configuration file:
$ eksctl create cluster --config-file <cluster_configuration_file>.yaml
The operation takes approximately 10 to 20 minutes.
-
Open the AWS EKS console and check whether the cluster and group node are created and have an Active and Ready status respectively:
-
Run the following command to configure kubectl with your cluster. For more information, see Create a kubeconfig for Amazon EKS.
$ aws eks update-kubeconfig --region <region_id> --name <cluster_name>
Command outputAdded new context arn:aws:eks:<region_id>:<aws_accout_id>:cluster/<cluster_name> to /Users/<username>/.kube/config
-
Run the following command to test your kubectl configuration:
$ kubectl get svc
Command outputNAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE kubernetes ClusterIP 172.20.0.1 <none> 443/TCP 23m
Step 4. (Optional) Create the Ingress load balancer cluster
This step is optional. If you already have a load balancer configured with your EKS cluster, you can proceed to the next section.
The load balancer is necessary to route users to the active instance or the passive instances. In the example, you will be using the NGINX Ingress Controller but it can be replaced by any load balancer supporting sticky sessions (it is mandatory for the passive instances).
-
Run the following command to install the NGINX Ingress Controller:
$ kubectl apply -f https://raw.githubusercontent.com/kubernetes/ingress-nginx/controller-v1.1.2/deploy/static/provider/cloud/deploy.yaml
Command outputnamespace/ingress-nginx created serviceaccount/ingress-nginx created serviceaccount/ingress-nginx-admission created role.rbac.authorization.k8s.io/ingress-nginx created role.rbac.authorization.k8s.io/ingress-nginx-admission created clusterrole.rbac.authorization.k8s.io/ingress-nginx created clusterrole.rbac.authorization.k8s.io/ingress-nginx-admission created rolebinding.rbac.authorization.k8s.io/ingress-nginx created rolebinding.rbac.authorization.k8s.io/ingress-nginx-admission created clusterrolebinding.rbac.authorization.k8s.io/ingress-nginx created clusterrolebinding.rbac.authorization.k8s.io/ingress-nginx-admission created configmap/ingress-nginx-controller created service/ingress-nginx-controller created service/ingress-nginx-controller-admission created deployment.apps/ingress-nginx-controller created job.batch/ingress-nginx-admission-create created job.batch/ingress-nginx-admission-patch created ingressclass.networking.k8s.io/nginx created validatingwebhookconfiguration.admissionregistration.k8s.io/ingress-nginx-admission created
-
Run the following command to retrieve the internal IP of the load balancer:
$ kubectl get svc -n ingress-nginx ingress-nginx-controller
Command outputNAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE ingress-nginx-controller LoadBalancer <Cluster_IP> <External_IP> 80:31347/TCP,443:32449/TCP 29s
Take note of the cluster IP; it will be used later to confirm that the deployment was successful.
Step 5. Update security group
At this step, you need to authorize the EKS cluster to connect to the Aurora cluster before completing the deployment.
Edit the inbound rules of the security group of your Aurora writer instance:
-
Navigate to AWS console > RDS.
-
From your Aurora cluster, select the writer instance, and navigate to Connectivity & Security > Security, and edit the VPC Security Group.
-
Navigate to the Inbound Rules tab and select Edit Inbound rules.
-
Select Add rule and set the following values:
-
Type: PostgreSQL
-
Source: eks-cluster-sg-xdm-eks-
<ressource_name>`
Use the search tool to find the source. -
Name: EKS cluster
-
-
Click Save to validate.
Step 6. Set the ConfigMap
At this step, you need to define a Kubernetes ConfigMap to set environment variables shared with every pod you deploy.
All the following Kubernetes commands are executed in the default namespace. |
Seven environment variables are defined and used for the Semarchy xDM startup configuration:
-
SEMARCHY_SETUP_TOKEN
: the setup token that you need to enter during the Semarchy Repository creation. -
XDM_REPOSITORY_DRIVER
: JDBC driver class for the repository database. Leave the default value for a Postgres database. -
XDM_REPOSITORY_URL
: JDBC URL for the repository database. It needs to match your Database writer instance endpoint. -
XDM_REPOSITORY_USERNAME
: database user to connect the repository database. -
XDM_REPOSITORY_PASSWORD
: database user password. -
XDM_REPOSITORY_READONLY_USERNAME
: database read-only user to connect the repository database. -
XDM_REPOSITORY_READONLY_PASSWORD
: database read-only user password.
It is mandatory to modify the above values (except XDM_REPOSITORY_DRIVER ).
|
-
Download the sample manifest file for the ConfigMap.
-
Edit the file and save your modifications.
-
Run the following command from the folder containing your manifest file:
$ kubectl apply -f <configmap_file>.yaml
Command outputconfigmap/semarchy-config created
Step 7. Execute the SQL init script
At this step, you need to configure the database schemas required to create the Semarchy repository. As the Aurora cluster is not accessible on the internet by default, use the Kubernetes pod to access it and run the SQL initialization script. If necessary you can configure the Aurora cluster to make it available outside of the AWS Virtual Private Cloud (VPC).
-
Download the sample manifest file for the disposable pod (based on a Debian image). This pod will be on the same virtual network as the database instance(s) and will be able to access it.
-
Run the following command to deploy a disposable pod:
$ kubectl apply -f <disposable_pod_file>.yaml
Command outputpod/semarchy-disposable-pod created
-
Run the following command until the pod is started (Status: Running):
$ kubectl get pod semarchy-disposable-pod
Command outputNAME READY STATUS RESTARTS AGE semarchy-disposable-pod 1/1 Running 0 16s
The pod can take about 10 to 20 seconds to get running.
-
Download the SQL script and edit it to match the values you have set in the ConfigMap.
-
semarchy_repository
: database used for the repository. -
semarchy_repository_username
: database username to connect to the repository database. -
semarchy_repository_password
: database user password. -
semarchy_repository_ro_username
: database read-only user to connect to the repository database. -
semarchy_repository_ro_password
: database read-only user password.
-
-
Save the file and run the following command to copy the script file to the disposable pod
tmp
folder:$ kubectl cp init-db.sql semarchy-disposable-pod:/tmp
-
Run the following command to access the disposable pod bash:
$ kubectl exec -it semarchy-disposable-pod -- bash
-
Install the following curl command that you will use later:
$ apk add curl
-
Run the following command to go to the
tmp
folder:$ cd /tmp
-
Run the following command to connect to the Database writer instance and execute the initialization script:
$ psql --host "<writer_instance_endpoint>" --username "<db_cluster_username>" --dbname "semarchy_repository" < init-db.sql
Command outputPassword for user <db_cluster_username>:
-
Enter the database cluster master password (
<db_cluster_password>
) and press Enter:Command outputCREATE SCHEMA GRANT ALTER DEFAULT PRIVILEGES ALTER DATABASE CREATE EXTENSION CREATE EXTENSION CREATE ROLE GRANT ROLE CREATE SCHEMA CREATE ROLE GRANT ALTER ROLE GRANT
-
Run the following command to connect to the database writer instance with the Semarchy repository user (
XDM_REPOSITORY_USERNAME
) created with the initialization script:$ psql --host "<writer_instance_endpoints>" --username "<xdm_repository_username>" --dbname "semarchy_repository"
Example 1. Command outputPassword for user <xdm_repository_username>:
-
Enter the database cluster master password (
<xdm_repository_password>
) and press Enter. -
Run the following command to list the existing database schemas:
$ \l
Command outputsemarchy_repository=> \l List of databases Name | Owner | Encoding | Collate | Ctype | Access privileges --------------------+----------+----------+-------------+-------------+---------------------- postgres | postgres | UTF8 | en_US.UTF-8 | en_US.UTF-8 | xxxxxxxx | xxxxxxxx | UTF8 | en_US.UTF-8 | en_US.UTF-8 | xxxxxx=CTc/xxxxxx semarchy_repository | postgres | UTF8 | en_US.UTF-8 | en_US.UTF-8 | =Tc/postgres + …
-
Exit from the PSQL command with the following command:
$ exit
-
Exit from the disposable pod with the following command. You will re-use this pod later to confirm the deployment completion:
$ exit
Step 8. Deploy the active pod
Once the database instance and EKS cluster are running, you can deploy the docker image of the application server.
You need to start by deploying the active node with a unique pod:
-
Download the sample manifest file for the application server active node.
This file defines the deployment of the application server active node and the service associated with exposing the application. You can edit the content of the file to match your specific requirements. For more information, see the Kubernetes documentation. -
Run the following command from the folder containing the manifest file to deploy the active pod:
$ kubectl apply -f <appserver_active_file>.yaml
Command outputdeployment.apps/semarchy-appserver-active created service/semarchy-appserver-active created
-
Run the following command to check the deployment progress until the status becomes Ready (this can take a few minutes):
$ kubectl get deployments
Command outputNAME READY UP-TO-DATE AVAILABLE AGE semarchy-appserver-active 1/1 1 1 13m
The deployment can take about 10 to 20 seconds to be ready.
Step 9. Deploy the passive pods
At this step, deploy two instances of the passive application server image for a high availability configuration.
-
Download the sample manifest file for the application server passive node. This file defines the deployment of the application server passive node and the service associated with exposing the app. You can edit the content of the file to match your specific requirements. For more information, see the Kubernetes documentation.
-
Run the following command from the folder containing the manifest file:
$ kubectl apply -f <appserver_passive_file>.yaml
Command outputdeployment.apps/semarchy-appserver-passive created service/semarchy-appserver-passive created
-
Execute the following command to ensure that the passive nodes are deployed and ready:
$ kubectl get deployments
Command outputNAME READY UP-TO-DATE AVAILABLE AGE semarchy-appserver-active 1/1 1 1 13m semarchy-appserver-passive 2/2 2 2 12m
Step 10. Configure the load balancer
Finally, you need to expose your Kubernetes pods on your networks.
Set a load balancer using sticky sessions to route the users to the active and passive pods:
-
Download the sample Ingress manifest file and edit it to match your requirements.
This file deploys an Ingress resource and configures it to use the sticky sessions for the passive instances.
For more information, see the Kubernetes documentation. -
Run the following command to apply the configuration:
$ kubectl apply -f <ingress_file>.yaml
Command outputingress.networking.k8s.io/ingress created
Step 11. Check the platform connection
At this step, you have deployed all the required resources to run Semarchy xDM on AWS with Kubernetes. You need the cluster IP you have retrieved at Step 4.
By default, the pods are not exposed to the internet. Hence, you have to use the disposable pod to check the platform connection with the active and passive nodes:
-
Run the following command to confirm that the load balancer is routing to the active instance:
$ kubectl exec -it semarchy-disposable-pod -- curl -v --resolve semarchy-appserver-active:80:<cluster_ip> semarchy-appserver-active:80/semarchy/api/rest/probes/started
Command output* Added semarchy-appserver-active:80:<cluster_ip> to DNS cache * Hostname semarchy-appserver-active was found in DNS cache * Trying <cluster_ip>:80... * Connected to semarchy-appserver-active (<cluster_ip>) port 80 (#0) > GET /semarchy/api/rest/probes/started HTTP/1.1 > Host: semarchy-appserver-active > User-Agent: curl/7.80.0 > Accept: / > * Mark bundle as not supporting multiuse < HTTP/1.1 204 …
This command executes a curl command on the disposable pod to query the REST API prob endpoints. -
Run the following command to confirm that the load balancer is routing to the passive instances:
$ kubectl exec -it semarchy-disposable-pod -- curl -v --resolve semarchy-appserver-passive:80:<cluster_ip> semarchy-appserver-passive:80/semarchy/api/rest/probes/started
Command output* Added semarchy-appserver-passive:80:<cluster_ip> to DNS cache * Hostname semarchy-appserver-passive was found in DNS cache * Trying <cluster_ip>:80... * Connected to semarchy-appserver-passive (<cluster_ip>) port 80 (#0) > GET /semarchy/api/rest/probes/started HTTP/1.1 > Host: semarchy-appserver-passive > User-Agent: curl/7.80.0 > Accept: / > * Mark bundle as not supporting multiuse < HTTP/1.1 204 …
-
Delete the disposable pod with the following command:
$ kubectl delete pod semarchy-disposable-pod
Command outputpod "semarchy-disposable-pod" deleted