Deploying Kafkorama on AKS, with Event Hubs support

Learn how to deploy Kafkorama Gateway on Azure Kubernetes Service (AKS) with support for Azure Event Hubs, including setup and topic configuration.

This tutorial walks you through deploying a Kafkorama Gateway cluster on Azure Kubernetes Service (AKS) with Apache Kafka®-compatible support provided by Azure Event Hubs.

Prerequisites

Before deploying Kafkorama Gateway on AKS, ensure you have:

An active Azure account
Installed the following tools:
- Azure CLI
- kubectl

Environment Variables

The following environment variables will be used throughout the deployment steps to create consistent and reusable AKS and Kubernetes resources:

export RESOURCE_GROUP=rg-kafkorama
export AKS_CLUSTER=aks-kafkorama
export EVENTHUBS_NAMESPACE=evhns-kafkorama
export EVENTHUBS_TOPIC=vehicles

Create an AKS cluster

az login

Create a new resource group:

az group create --name $RESOURCE_GROUP --location eastus

Create an AKS cluster with at least three and at most five nodes, while also activating cluster autoscaling:

az aks create \
  --resource-group $RESOURCE_GROUP \
  --name $AKS_CLUSTER \
  --node-count 3 \
  --vm-set-type VirtualMachineScaleSets \
  --generate-ssh-keys \
  --load-balancer-sku standard \
  --enable-cluster-autoscaler \
  --min-count 3 \
  --max-count 5

Connect to the AKS cluster:

az aks get-credentials \
--resource-group $RESOURCE_GROUP \
--name $AKS_CLUSTER

Check if the nodes of the AKS cluster are up:

kubectl get nodes

Create Namespace

To organize the Kubernetes resources for Kafkorama, create a dedicated namespace kafkorama as follows.

First, create a manifest file kafkorama-namespace.yaml with the following content:

apiVersion: v1
kind: Namespace
metadata:
  name: kafkorama

Then, apply the manifest using:

kubectl apply -f kafkorama-namespace.yaml

Azure Event Hubs

Create an Azure Event Hubs topic

First, create a namespace into the Event Hubs:

az eventhubs namespace create --name $EVENTHUBS_NAMESPACE \
--resource-group $RESOURCE_GROUP -l eastus

Create a Kafka topic on Azure Event Hubs as follows:

az eventhubs eventhub create --name $EVENTHUBS_TOPIC \
--resource-group $RESOURCE_GROUP \
--namespace-name $EVENTHUBS_NAMESPACE

Authenticate to Azure Event Hubs with SASL using JAAS

Fetch the Event Hubs rule/policy and get the value of the name attribute from the JSON response of the following command:

az eventhubs namespace authorization-rule list \
--resource-group $RESOURCE_GROUP \
--namespace-name $EVENTHUBS_NAMESPACE

Suppose the policy got above is RootManageSharedAccessKey, then get the value of the attribute primaryConnectionString from the JSON response of the following command:

az eventhubs namespace authorization-rule keys list \
--resource-group $RESOURCE_GROUP \
--namespace-name $EVENTHUBS_NAMESPACE \
--name RootManageSharedAccessKey

The value of the attribute primaryConnectionString from the response of the last command should look as follows:

Endpoint=sb://evhns-kafkorama.servicebus.windows.net/;SharedAccessKeyName=RootManageSharedAccessKey;SharedAccessKey=xxxxxxxxxxxxxxxxx

Therefore, the JAAS config to authenticate to Azure Event Hubs with SASL should look as follows:

KafkaClient {
        org.apache.kafka.common.security.plain.PlainLoginModule required
        username="$ConnectionString"
        password="Endpoint=sb://evhns-kafkorama.servicebus.windows.net/;SharedAccessKeyName=RootManageSharedAccessKey;SharedAccessKey=xxxxxxxxxxxxxxxxx";
};

Copy the JAAS config to a file jaas.config. We will need this configuration later to connect a Kafka consumer and producer to the Azure Event Hubs with SASL.

Create a secret from the JAAS config file

Because the JAAS config file obtained in the previous step must be included in the pod configuration, we should create a secret from jaas.config which will be mounted as a volume in Kubernetes:

kubectl create secret generic kafkorama-secret \
--from-file=jaas.config -n kafkorama

Deploy

We will use the following Kubernetes manifest to build a cluster of three Kafkorama Gateway instances. The following command will update the variables $EVENTHUBS_NAMESPACE, $EVENTHUBS_TOPIC and it will create a file named kafkorama-cluster.yaml:

cat > kafkorama-cluster.yaml <<EOL
apiVersion: v1
kind: Service
metadata:
  namespace: kafkorama
  name: kafkorama-cs
  labels:
    app: kafkorama
spec:
  type: LoadBalancer
  ports:
    - name: client-port
      port: 80
      protocol: TCP
      targetPort: 8800
  selector:
    app: kafkorama
---
apiVersion: apps/v1
kind: Deployment
metadata:
  namespace: kafkorama
  name: kafkorama
spec:
  selector:
    matchLabels:
      app: kafkorama
  replicas: 3
  template:
    metadata:
      labels:
        app: kafkorama
    spec:
      containers:
      - name: kafkorama-cluster
        imagePullPolicy: Always
        image: kafkorama/kafkorama-gateway:latest
        volumeMounts:
        - name: kafkorama-secret
          mountPath: "/kafkorama-gateway/secrets/jaas.config"
          subPath: jaas.config
          readOnly: true
        env:
          - name: KAFKORAMA_GATEWAY_EXTRA_OPTS
            value: "-DMemory=128MB \
              -Dbootstrap.servers=$EVENTHUBS_NAMESPACE.servicebus.windows.net:9093 \
              -DLogLevel=INFO \
              -DX.ConnectionOffload=true \
              -Dtopics=$EVENTHUBS_TOPIC \
              -Dsecurity.protocol=SASL_SSL \
              -Dsasl.mechanism=PLAIN
              -Djava.security.auth.login.config=/kafkorama-gateway/secrets/jaas.config"
          - name: KAFKORAMA_GATEWAY_JAVA_GC_LOG_OPTS
            value: "-XX:+PrintCommandLineFlags -XX:+PrintGC -XX:+PrintGCDetails -XX:+DisableExplicitGC -Dsun.rmi.dgc.client.gcInterval=0x7ffffffffffffff0 -Dsun.rmi.dgc.server.gcInterval=0x7ffffffffffffff0 -verbose:gc"
        resources:
          requests:
            memory: "256Mi"
            cpu: "0.5"
        ports:
          - name: client-port
            containerPort: 8800
        readinessProbe:
          tcpSocket:
            port: 8800
          initialDelaySeconds: 20
          failureThreshold: 5
          periodSeconds: 5
        livenessProbe:
          tcpSocket:
            port: 8800
          initialDelaySeconds: 10
          failureThreshold: 5
          periodSeconds: 5
      volumes:
      - name: kafkorama-secret
        secret:
          secretName: kafkorama-secret
EOL

This manifest includes:

A Service to expose Kafkorama Gateway to external clients.
A Deployment that runs one instance of Kafkorama Gateway.

We configure Kafkorama Gateway using the KAFKORAMA_GATEWAY_EXTRA_OPTS environment variable, which allows you to override default parameters from the Configuration Guide. In this example, we:

Set the Memory limit
Configure the log level to INFO
Enable connection offloading
Connect to Azure Event Hubs and consume a specified topic

To deploy the cluster, save this manifest as kafkorama-cluster.yaml and run:

kubectl apply -f kafkorama-cluster.yaml

Namespace switch

Since the deployment uses the kafkorama namespace, switch to it with the following command:

kubectl config set-context --current --namespace=kafkorama

To switch back to the default namespace, run:

kubectl config set-context --current --namespace=default

Verify the deployment

Check that the kafkorama pods are running:

kubectl get pods

The output should look similar to the following:

NAME                                READY   STATUS    RESTARTS   AGE
kafkorama-57848575bd-4tnbz   1/1     Running   0          4m32s
kafkorama-57848575bd-gjmld   1/1     Running   0          4m32s
kafkorama-57848575bd-tcbtf   1/1     Running   0          4m32s

To view the logs of the Kafkorama pod, run:

kubectl logs kafkorama-57848575bd-4tnbz

Test installation

Then, check if the service is up and accessible:

kubectl get svc

The output should look similar to the following:

NAME           TYPE           CLUSTER-IP    EXTERNAL-IP   PORT(S)        AGE
kafkorama-cs   LoadBalancer   10.0.39.44   YourExternalIP   80:32210/TCP   17s

Locate the EXTERNAL-IP and PORT values for the kafkorama-cs service. In this example, the service is available at http://YourExternalIP.

Open this URL in your browser. You should see a welcome page that includes a demo application under the Debug Console menu, which allows you to publish and consume real-time messages via the Kafkorama cluster.

Scaling

The stateless nature of the Kafkorama cluster when deployed in conjunction with Azure Event Hubs, where each cluster member is independent from the others, highly simplifies the horizontal scaling on AKS.

Manual scaling up

For example, if the load of your system increases substantially, and supposing your nodes have enough resources available, you can add two new members to the cluster by modifying the replicas field as follows:

kubectl scale deployment kafkorama --replicas=5

Manual scaling down

If the load of your system decreases significantly, then you might remove three members from the cluster by modifying the replicas field as follows:

kubectl scale deployment kafkorama --replicas=2

Autoscaling

Manual scaling is practical if the load of your system changes gradually. Otherwise, you can use the autoscaling feature of Kubernetes.

Kubernetes can monitor the load of your system, typically expressed in CPU usage, and scale your Kafkorama cluster up and down by automatically modifying the replicas field.

In the example above, to add one or more new members up to a maximum of 5 cluster members if the CPU usage of the existing members becomes higher than 50%, or remove one or more of the existing members if the CPU usage of the existing members becomes lower than 50%, use the following command:

kubectl autoscale deployment kafkorama \
--cpu-percent=50 --min=3 --max=5

Alternatively, you can use a YAML manifest as follows:

apiVersion: autoscaling/v1
kind: HorizontalPodAutoscaler
metadata:
  namespace: kafkorama
  name: kafkorama-autoscale # you can use any name here
spec:
  maxReplicas: 5
  minReplicas: 3
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: kafkorama 
  targetCPUUtilizationPercentage: 50

Save it as a file named for example kafkorama-autoscale.yaml, then execute it as follows:

kubectl apply -f kafkorama-autoscale.yaml

Now, you can display information about the autoscaler object above using the following command:

kubectl get hpa

and display CPU usage of cluster members with:

kubectl top pods

While testing cluster autoscaling, it is important to understand that the Kubernetes autoscaler periodically retrieves CPU usage information from the cluster members. As a result, the autoscaling process may not appear instantaneous, but this delay aligns with the normal behavior of Kubernetes.

Node Failure Testing

Kafkorama Gateway clustering tolerates a number of cluster member to be down or to fail as detailed in the Clustering section.

In order to test an AKS node failure, use:

kubectl drain <node-name> --force --delete-local-data \
--ignore-daemonsets

Then, to start an AKS node, use:

kubectl uncordon <node-name>

Uninstall

Delete the Kubernetes resources created for this deployment with:

kubectl delete -f kafkorama-namespace.yaml

Go back to default namespace:

kubectl config set-context --current --namespace=default

Finally, when you don't need anymore the AKS cluster of nodes, delete it:

az group delete --name $RESOURCE_GROUP --yes --no-wait

Build realtime apps

Next, choose the appropriate SDK for your programming language or platform to build real-time apps that communicate with your Kafkorama Gateway cluster.

You can also use Azure Event Hubs' APIs or tools to publish real-time messages to Event Hubs — these messages will be delivered to connected Kafkorama clients. Similarly, you can consume messages from Event Hubs that originate from clients connected to Kafkorama.

Finally, to manage Kafka clusters, entitlement, and streaming APIs through a web interface, you can deploy Kafkorama Portal. It provides centralized control for your real-time infrastructure and simplifies operations and access control, and more.

previous

Deploying Kafkorama on EKS, with MSK support

Prerequisites Environment Variables Create an AKS cluster Create Namespace Azure Event Hubs Create an Azure Event Hubs topic Authenticate to Azure Event Hubs with SASL using JAAS Create a secret from the JAAS config file Namespace switch Verify the deployment Test installation Scaling Manual scaling up Manual scaling down Autoscaling Node Failure Testing Uninstall Build realtime apps