Merge pull request #78 from marcel-dempers/kafka

kafka-intro
2025-06-06 17:01:30 +00:00 · 2021-06-06 21:27:02 +10:00 · 2021-06-06 21:27:02 +10:00 · 1bd98cb9a3
commit 1bd98cb9a3
parent 6be9fe471b 816953d517
18 changed files with 989 additions and 0 deletions
--- a/apache/kafka/README.md
+++ b/apache/kafka/README.md
@ -0,0 +1,4 @@
+# Introduction to Kafka
+
+This guide is under the messaging section alongside other message brokers like `RabbitMQ` etc. </br>
+Checkout the guide under the [messaging/kafka](../../messaging/kafka/README.md) folder
--- a/messaging/kafka/README.md
+++ b/messaging/kafka/README.md
@ -0,0 +1,216 @@
+# Introduction to Kafka
+
+Official [Docs](https://kafka.apache.org/)
+
+## Building a Docker file
+
+As always, we start with a `dockerfile` </br>
+We can build our `dockerfile`
+
+```
+cd .\messaging\kafka\
+docker build . -t aimvector/kafka:2.7.0
+
+```
+
+## Exploring the Kafka Install
+
+We can then run it to explore the contents:
+
+```
+docker run --rm --name kafka -it aimvector/kafka:2.7.0 bash
+
+ls -l /kafka/bin/
+cat /kafka/config/server.properties
+```
+
+We can use the `docker cp` command to copy the file out of our container:
+
+```
+docker cp kafka:/kafka/config/server.properties ./server.properties
+docker cp kafka:/kafka/config/zookeeper.properties ./zookeeper.properties
+```
+
+Note: We'll need the Kafka configuration to tune our server and Kafka also requires
+at least one Zookeeper instance in order to function. To achieve high availability, we'll run
+multiple kafka as well as multiple zookeeper instances in the future
+
+# Zookeeper
+
+Let's build a Zookeeper image. The Apache folks have made it easy to start a Zookeeper instance the same way as the Kafka instance by simply running the `start-zookeeper.sh` script.
+
+```
+cd ./zookeeper
+docker build . -t aimvector/zookeeper:2.7.0
+cd ..
+```
+
+Let's create a kafka network and run 1 zookeeper instance
+
+```
+docker network create kafka
+docker run -d `
+--rm `
+--name zookeeper-1 `
+--net kafka `
+-v ${PWD}/config/zookeeper-1/zookeeper.properties:/kafka/config/zookeeper.properties `
+aimvector/zookeeper:2.7.0
+
+docker logs zookeeper-1
+```
+
+# Kafka - 1
+
+```
+docker run -d `
+--rm `
+--name kafka-1 `
+--net kafka `
+-v ${PWD}/config/kafka-1/server.properties:/kafka/config/server.properties `
+aimvector/kafka:2.7.0
+
+docker logs kafka-1
+```
+
+# Kafka - 2
+
+```
+docker run -d `
+--rm `
+--name kafka-2 `
+--net kafka `
+-v ${PWD}/config/kafka-2/server.properties:/kafka/config/server.properties `
+aimvector/kafka:2.7.0
+```
+
+# Kafka - 3
+
+```
+docker run -d `
+--rm `
+--name kafka-3 `
+--net kafka `
+-v ${PWD}/config/kafka-3/server.properties:/kafka/config/server.properties `
+aimvector/kafka:2.7.0
+```
+
+
+# Topic
+
+Let's create a Topic that allows us to store `Order` information. </br>
+To create a topic, Kafka and Zookeeper have scripts with the installer that allows us to do so. </br>
+
+Access the container:
+```
+docker exec -it zookeeper-1 bash
+```
+Create the Topic:
+```
+/kafka/bin/kafka-topics.sh \
+--create \
+--zookeeper zookeeper-1:2181 \
+--replication-factor 1 \
+--partitions 3 \
+--topic Orders
+```
+
+Describe our Topic:
+```
+/kafka/bin/kafka-topics.sh \
+--describe \
+--topic Orders \
+--zookeeper zookeeper-1:2181
+```
+
+# Simple Producer & Consumer
+
+The Kafka installation also ships with a script that allows us to produce
+and consume messages to our Kafka network: <br/>
+
+We can then run the consumer that will receive that message on that Orders topic:
+
+```
+docker exec -it zookeeper-1 bash
+
+/kafka/bin/kafka-console-consumer.sh \
+--bootstrap-server kafka-1:9092,kafka-2:9092,kafka-3:9092 \
+--topic Orders --from-beginning
+
+```
+
+With a consumer in place, we can start producing messages
+
+```
+docker exec -it zookeeper-1 bash
+
+echo "New Order: 1" | \
+/kafka/bin/kafka-console-producer.sh \
+--broker-list kafka-1:9092,kafka-2:9092,kafka-3:9092 \
+--topic Orders > /dev/null
+```
+
+
+Once we have a message in Kafka, we can explore where it got stored in which partition:
+
+```
+docker exec -it kafka-1 bash
+
+apt install -y tree
+tree /tmp/kafka-logs/
+
+
+ls -lh /tmp/kafka-logs/Orders-*
+
+/tmp/kafka-logs/Orders-0:
+total 4.0K
+-rw-r--r-- 1 root root 10M May  4 06:54 00000000000000000000.index    
+-rw-r--r-- 1 root root   0 May  4 06:54 00000000000000000000.log      
+-rw-r--r-- 1 root root 10M May  4 06:54 00000000000000000000.timeindex
+-rw-r--r-- 1 root root   8 May  4 06:54 leader-epoch-checkpoint       
+
+/tmp/kafka-logs/Orders-1:
+total 4.0K
+-rw-r--r-- 1 root root 10M May  4 06:54 00000000000000000000.index    
+-rw-r--r-- 1 root root   0 May  4 06:54 00000000000000000000.log      
+-rw-r--r-- 1 root root 10M May  4 06:54 00000000000000000000.timeindex
+-rw-r--r-- 1 root root   8 May  4 06:54 leader-epoch-checkpoint       
+
+/tmp/kafka-logs/Orders-2:
+total 8.0K
+-rw-r--r-- 1 root root 10M May  4 06:54 00000000000000000000.index    
+-rw-r--r-- 1 root root  80 May  4 06:57 00000000000000000000.log      
+-rw-r--r-- 1 root root 10M May  4 06:54 00000000000000000000.timeindex
+-rw-r--r-- 1 root root   8 May  4 06:54 leader-epoch-checkpoint
+```
+
+By seeing 0 bytes in partition 0 and 1, we know the message is sitting in partition 2 as it has 80 bytes. </br>
+We can check the message with :
+
+```
+cat /tmp/kafka-logs/Orders-2/*.log
+```
+
+## Building a Producer: Go
+
+```
+docker run -it `
+--net kafka `
+-e KAFKA_PEERS="kafka-1:9092,kafka-2:9092,kafka-3:9092" `
+-e KAFKA_TOPIC="Orders" `
+-e KAFKA_PARTITION=1 `
+-p 80:80 `
+kafka-producer
+```
+
+## Building a Consumer: Go
+
+```
+cd messaging\kafka\applications\consumer
+docker build . -t kafka-consumer
+
+docker run -it `
+--net kafka `
+-e KAFKA_PEERS="kafka-1:9092,kafka-2:9092,kafka-3:9092" `
+-e KAFKA_TOPIC="Orders" `
+kafka-consumer
+```
--- a/messaging/kafka/applications/consumer/consumer.go
+++ b/messaging/kafka/applications/consumer/consumer.go
@ -0,0 +1,92 @@
+package main
+
+import (
+	"fmt"
+	"os"
+	"os/signal"
+	"strings"
+	"sync"
+	"syscall"
+	"gopkg.in/Shopify/sarama.v1"
+)
+
+var kafkaBrokers = os.Getenv("KAFKA_PEERS")
+var kafkaTopic = os.Getenv("KAFKA_TOPIC")
+
+var globalProducer sarama.SyncProducer 
+
+func main() {
+	config := sarama.NewConfig()
+	config.Producer.RequiredAcks = sarama.WaitForAll
+	config.Producer.Return.Successes = true
+	config.Producer.Partitioner = sarama.NewRandomPartitioner
+
+	consumer, err := sarama.NewConsumer(strings.Split(kafkaBrokers, ","), config)
+	if err != nil {
+	  fmt.Printf("Failed to open Kafka consumer: %s", err)
+		panic(err)
+	}
+
+	partitionList, err := consumer.Partitions(kafkaTopic)
+
+	if err != nil {
+		fmt.Printf("Failed to get the list of partitions: %s", err)
+		panic(err)
+	}
+
+	var bufferSize = 256
+	var (
+		messages = make(chan *sarama.ConsumerMessage, bufferSize)
+		closing  = make(chan struct{})
+		wg       sync.WaitGroup
+	)
+
+	go func() {
+		signals := make(chan os.Signal, 1)
+		signal.Notify(signals, syscall.SIGTERM, os.Interrupt)
+		<-signals
+		fmt.Println("Initiating shutdown of consumer...")
+		close(closing)
+	}()
+  
+	for _, partition := range partitionList {
+		pc, err := consumer.ConsumePartition(kafkaTopic, partition, sarama.OffsetOldest)
+		if err != nil {
+			fmt.Printf("Failed to start consumer for partition %d: %s\n", partition, err)
+			panic(err)
+		}
+
+		go func(pc sarama.PartitionConsumer) {
+			<-closing
+			pc.AsyncClose()
+		}(pc)
+
+		wg.Add(1)
+		go func(pc sarama.PartitionConsumer) {
+			defer wg.Done()
+			for message := range pc.Messages() {
+				messages <- message
+			}
+		}(pc)
+	}
+
+	go func() {
+		for msg := range messages {
+			fmt.Printf("Partition:\t%d\n", msg.Partition)
+			fmt.Printf("Offset:\t%d\n", msg.Offset)
+			fmt.Printf("Key:\t%s\n", string(msg.Key))
+			fmt.Printf("Value:\t%s\n", string(msg.Value))
+			fmt.Println()
+		}
+	}()
+
+	wg.Wait()
+	fmt.Println("Done consuming topic", kafkaTopic)
+	close(messages)
+
+	if err := consumer.Close(); err != nil {
+		fmt.Printf("Failed to close consumer: %s", err)
+		panic(err)
+	}
+}
+
--- a/messaging/kafka/applications/consumer/dockerfile
+++ b/messaging/kafka/applications/consumer/dockerfile
@ -0,0 +1,20 @@
+FROM golang:1.16-alpine as dev-env
+
+RUN apk add --no-cache git gcc musl-dev
+
+WORKDIR /app
+
+FROM dev-env as build-env
+COPY go.mod /go.sum /app/
+RUN go mod download
+
+COPY . /app/
+
+RUN CGO_ENABLED=0 go build -o /consumer
+
+FROM alpine:3.10 as runtime
+
+COPY --from=build-env /consumer /usr/local/bin/consumer
+RUN chmod +x /usr/local/bin/consumer
+
+ENTRYPOINT ["consumer"]
--- a/messaging/kafka/applications/consumer/go.mod
+++ b/messaging/kafka/applications/consumer/go.mod
@ -0,0 +1,17 @@
+module producer
+
+go 1.16
+
+require (
+	github.com/DataDog/zstd v1.4.8 // indirect
+	github.com/davecgh/go-spew v1.1.1 // indirect
+	github.com/eapache/go-resiliency v1.2.0 // indirect
+	github.com/eapache/go-xerial-snappy v0.0.0-20180814174437-776d5712da21 // indirect
+	github.com/eapache/queue v1.1.0 // indirect
+	github.com/golang/snappy v0.0.3 // indirect
+	github.com/julienschmidt/httprouter v1.3.0 // indirect
+	github.com/pierrec/lz4 v2.6.0+incompatible // indirect
+	github.com/rcrowley/go-metrics v0.0.0-20201227073835-cf1acfcdf475 // indirect
+	github.com/sirupsen/logrus v1.8.1 // indirect
+	gopkg.in/Shopify/sarama.v1 v1.20.1 // indirect
+)
--- a/messaging/kafka/applications/consumer/go.sum
+++ b/messaging/kafka/applications/consumer/go.sum
@ -0,0 +1,26 @@
+github.com/DataDog/zstd v1.4.8 h1:Rpmta4xZ/MgZnriKNd24iZMhGpP5dvUcs/uqfBapKZY=
+github.com/DataDog/zstd v1.4.8/go.mod h1:g4AWEaM3yOg3HYfnJ3YIawPnVdXJh9QME85blwSAmyw=
+github.com/davecgh/go-spew v1.1.1 h1:vj9j/u1bqnvCEfJOwUhtlOARqs3+rkHYY13jYWTU97c=
+github.com/davecgh/go-spew v1.1.1/go.mod h1:J7Y8YcW2NihsgmVo/mv3lAwl/skON4iLHjSsI+c5H38=
+github.com/eapache/go-resiliency v1.2.0 h1:v7g92e/KSN71Rq7vSThKaWIq68fL4YHvWyiUKorFR1Q=
+github.com/eapache/go-resiliency v1.2.0/go.mod h1:kFI+JgMyC7bLPUVY133qvEBtVayf5mFgVsvEsIPBvNs=
+github.com/eapache/go-xerial-snappy v0.0.0-20180814174437-776d5712da21 h1:YEetp8/yCZMuEPMUDHG0CW/brkkEp8mzqk2+ODEitlw=
+github.com/eapache/go-xerial-snappy v0.0.0-20180814174437-776d5712da21/go.mod h1:+020luEh2TKB4/GOp8oxxtq0Daoen/Cii55CzbTV6DU=
+github.com/eapache/queue v1.1.0 h1:YOEu7KNc61ntiQlcEeUIoDTJ2o8mQznoNvUhiigpIqc=
+github.com/eapache/queue v1.1.0/go.mod h1:6eCeP0CKFpHLu8blIFXhExK/dRa7WDZfr6jVFPTqq+I=
+github.com/golang/snappy v0.0.3 h1:fHPg5GQYlCeLIPB9BZqMVR5nR9A+IM5zcgeTdjMYmLA=
+github.com/golang/snappy v0.0.3/go.mod h1:/XxbfmMg8lxefKM7IXC3fBNl/7bRcc72aCRzEWrmP2Q=
+github.com/julienschmidt/httprouter v1.3.0 h1:U0609e9tgbseu3rBINet9P48AI/D3oJs4dN7jwJOQ1U=
+github.com/julienschmidt/httprouter v1.3.0/go.mod h1:JR6WtHb+2LUe8TCKY3cZOxFyyO8IZAc4RVcycCCAKdM=
+github.com/pierrec/lz4 v2.6.0+incompatible h1:Ix9yFKn1nSPBLFl/yZknTp8TU5G4Ps0JDmguYK6iH1A=
+github.com/pierrec/lz4 v2.6.0+incompatible/go.mod h1:pdkljMzZIN41W+lC3N2tnIh5sFi+IEE17M5jbnwPHcY=
+github.com/pmezard/go-difflib v1.0.0/go.mod h1:iKH77koFhYxTK1pcRnkKkqfTogsbg7gZNVY4sRDYZ/4=
+github.com/rcrowley/go-metrics v0.0.0-20201227073835-cf1acfcdf475 h1:N/ElC8H3+5XpJzTSTfLsJV/mx9Q9g7kxmchpfZyxgzM=
+github.com/rcrowley/go-metrics v0.0.0-20201227073835-cf1acfcdf475/go.mod h1:bCqnVzQkZxMG4s8nGwiZ5l3QUCyqpo9Y+/ZMZ9VjZe4=
+github.com/sirupsen/logrus v1.8.1 h1:dJKuHgqk1NNQlqoA6BTlM1Wf9DOH3NBjQyu0h9+AZZE=
+github.com/sirupsen/logrus v1.8.1/go.mod h1:yWOB1SBYBC5VeMP7gHvWumXLIWorT60ONWic61uBYv0=
+github.com/stretchr/testify v1.2.2/go.mod h1:a8OnRcib4nhh0OaRAV+Yts87kKdq0PP7pXfy6kDkUVs=
+golang.org/x/sys v0.0.0-20191026070338-33540a1f6037 h1:YyJpGZS1sBuBCzLAR1VEpK193GlqGZbnPFnPV/5Rsb4=
+golang.org/x/sys v0.0.0-20191026070338-33540a1f6037/go.mod h1:h1NjWce9XRLGQEsW7wpKNCjG9DtNlClVuFLEZdDNbEs=
+gopkg.in/Shopify/sarama.v1 v1.20.1 h1:Gi09A3fJXm0Jgt8kuKZ8YK+r60GfYn7MQuEmI3oq6hE=
+gopkg.in/Shopify/sarama.v1 v1.20.1/go.mod h1:AxnvoaevB2nBjNK17cG61A3LleFcWFwVBHBt+cot4Oc=
--- a/messaging/kafka/applications/producer/dockerfile
+++ b/messaging/kafka/applications/producer/dockerfile
@ -0,0 +1,20 @@
+FROM golang:1.16-alpine as dev-env
+
+RUN apk add --no-cache git gcc musl-dev
+
+WORKDIR /app
+
+FROM dev-env as build-env
+COPY go.mod /go.sum /app/
+RUN go mod download
+
+COPY . /app/
+
+RUN CGO_ENABLED=0 go build -o /producer
+
+FROM alpine:3.10 as runtime
+
+COPY --from=build-env /producer /usr/local/bin/producer
+RUN chmod +x /usr/local/bin/producer
+
+ENTRYPOINT ["producer"]
--- a/messaging/kafka/applications/producer/go.mod
+++ b/messaging/kafka/applications/producer/go.mod
@ -0,0 +1,17 @@
+module producer
+
+go 1.16
+
+require (
+	github.com/DataDog/zstd v1.4.8 // indirect
+	github.com/davecgh/go-spew v1.1.1 // indirect
+	github.com/eapache/go-resiliency v1.2.0 // indirect
+	github.com/eapache/go-xerial-snappy v0.0.0-20180814174437-776d5712da21 // indirect
+	github.com/eapache/queue v1.1.0 // indirect
+	github.com/golang/snappy v0.0.3 // indirect
+	github.com/julienschmidt/httprouter v1.3.0 // indirect
+	github.com/pierrec/lz4 v2.6.0+incompatible // indirect
+	github.com/rcrowley/go-metrics v0.0.0-20201227073835-cf1acfcdf475 // indirect
+	github.com/sirupsen/logrus v1.8.1 // indirect
+	gopkg.in/Shopify/sarama.v1 v1.20.1 // indirect
+)
--- a/messaging/kafka/applications/producer/go.sum
+++ b/messaging/kafka/applications/producer/go.sum
@ -0,0 +1,26 @@
+github.com/DataDog/zstd v1.4.8 h1:Rpmta4xZ/MgZnriKNd24iZMhGpP5dvUcs/uqfBapKZY=
+github.com/DataDog/zstd v1.4.8/go.mod h1:g4AWEaM3yOg3HYfnJ3YIawPnVdXJh9QME85blwSAmyw=
+github.com/davecgh/go-spew v1.1.1 h1:vj9j/u1bqnvCEfJOwUhtlOARqs3+rkHYY13jYWTU97c=
+github.com/davecgh/go-spew v1.1.1/go.mod h1:J7Y8YcW2NihsgmVo/mv3lAwl/skON4iLHjSsI+c5H38=
+github.com/eapache/go-resiliency v1.2.0 h1:v7g92e/KSN71Rq7vSThKaWIq68fL4YHvWyiUKorFR1Q=
+github.com/eapache/go-resiliency v1.2.0/go.mod h1:kFI+JgMyC7bLPUVY133qvEBtVayf5mFgVsvEsIPBvNs=
+github.com/eapache/go-xerial-snappy v0.0.0-20180814174437-776d5712da21 h1:YEetp8/yCZMuEPMUDHG0CW/brkkEp8mzqk2+ODEitlw=
+github.com/eapache/go-xerial-snappy v0.0.0-20180814174437-776d5712da21/go.mod h1:+020luEh2TKB4/GOp8oxxtq0Daoen/Cii55CzbTV6DU=
+github.com/eapache/queue v1.1.0 h1:YOEu7KNc61ntiQlcEeUIoDTJ2o8mQznoNvUhiigpIqc=
+github.com/eapache/queue v1.1.0/go.mod h1:6eCeP0CKFpHLu8blIFXhExK/dRa7WDZfr6jVFPTqq+I=
+github.com/golang/snappy v0.0.3 h1:fHPg5GQYlCeLIPB9BZqMVR5nR9A+IM5zcgeTdjMYmLA=
+github.com/golang/snappy v0.0.3/go.mod h1:/XxbfmMg8lxefKM7IXC3fBNl/7bRcc72aCRzEWrmP2Q=
+github.com/julienschmidt/httprouter v1.3.0 h1:U0609e9tgbseu3rBINet9P48AI/D3oJs4dN7jwJOQ1U=
+github.com/julienschmidt/httprouter v1.3.0/go.mod h1:JR6WtHb+2LUe8TCKY3cZOxFyyO8IZAc4RVcycCCAKdM=
+github.com/pierrec/lz4 v2.6.0+incompatible h1:Ix9yFKn1nSPBLFl/yZknTp8TU5G4Ps0JDmguYK6iH1A=
+github.com/pierrec/lz4 v2.6.0+incompatible/go.mod h1:pdkljMzZIN41W+lC3N2tnIh5sFi+IEE17M5jbnwPHcY=
+github.com/pmezard/go-difflib v1.0.0/go.mod h1:iKH77koFhYxTK1pcRnkKkqfTogsbg7gZNVY4sRDYZ/4=
+github.com/rcrowley/go-metrics v0.0.0-20201227073835-cf1acfcdf475 h1:N/ElC8H3+5XpJzTSTfLsJV/mx9Q9g7kxmchpfZyxgzM=
+github.com/rcrowley/go-metrics v0.0.0-20201227073835-cf1acfcdf475/go.mod h1:bCqnVzQkZxMG4s8nGwiZ5l3QUCyqpo9Y+/ZMZ9VjZe4=
+github.com/sirupsen/logrus v1.8.1 h1:dJKuHgqk1NNQlqoA6BTlM1Wf9DOH3NBjQyu0h9+AZZE=
+github.com/sirupsen/logrus v1.8.1/go.mod h1:yWOB1SBYBC5VeMP7gHvWumXLIWorT60ONWic61uBYv0=
+github.com/stretchr/testify v1.2.2/go.mod h1:a8OnRcib4nhh0OaRAV+Yts87kKdq0PP7pXfy6kDkUVs=
+golang.org/x/sys v0.0.0-20191026070338-33540a1f6037 h1:YyJpGZS1sBuBCzLAR1VEpK193GlqGZbnPFnPV/5Rsb4=
+golang.org/x/sys v0.0.0-20191026070338-33540a1f6037/go.mod h1:h1NjWce9XRLGQEsW7wpKNCjG9DtNlClVuFLEZdDNbEs=
+gopkg.in/Shopify/sarama.v1 v1.20.1 h1:Gi09A3fJXm0Jgt8kuKZ8YK+r60GfYn7MQuEmI3oq6hE=
+gopkg.in/Shopify/sarama.v1 v1.20.1/go.mod h1:AxnvoaevB2nBjNK17cG61A3LleFcWFwVBHBt+cot4Oc=
--- a/messaging/kafka/applications/producer/producer.go
+++ b/messaging/kafka/applications/producer/producer.go
@ -0,0 +1,77 @@
+package main
+
+import (
+	"fmt"
+	"net/http"
+	"github.com/julienschmidt/httprouter"
+	log "github.com/sirupsen/logrus"
+	"os"
+	"strings"
+	"strconv"
+	"gopkg.in/Shopify/sarama.v1"
+)
+
+var kafkaBrokers = os.Getenv("KAFKA_PEERS")
+var kafkaTopic = os.Getenv("KAFKA_TOPIC")
+var kafkaPartition = os.Getenv("KAFKA_PARTITION")
+var partition int32 = -1
+var globalProducer sarama.SyncProducer 
+
+func main() {
+
+	config := sarama.NewConfig()
+	config.Producer.RequiredAcks = sarama.WaitForAll
+	config.Producer.Return.Successes = true
+	config.Producer.Partitioner = sarama.NewRandomPartitioner
+  p, err := strconv.Atoi(kafkaPartition)
+
+	if err != nil {
+	  fmt.Println("Failed to convert KAFKA_PARTITION to Int32")
+		panic(err)
+	}
+
+	partition = int32(p)
+
+	producer, err := sarama.NewSyncProducer(strings.Split(kafkaBrokers, ","), config)
+	if err != nil {
+	  fmt.Printf("Failed to open Kafka producer: %s", err)
+		panic(err)
+	}
+
+	globalProducer = producer
+
+	defer func() {
+	  fmt.Println("Closing Kafka producer...")
+		if err := globalProducer.Close(); err != nil {
+		  fmt.Printf("Failed to close Kafka producer cleanly: %s", err)
+		  panic(err)
+		}
+	}()
+
+	router := httprouter.New()
+
+	router.POST("/publish/:message", func(w http.ResponseWriter, r *http.Request, p httprouter.Params){
+		submit(w,r,p)
+	})
+
+	fmt.Println("Running...")
+	log.Fatal(http.ListenAndServe(":80", router))
+}
+
+func submit(writer http.ResponseWriter, request *http.Request, p httprouter.Params) {
+	
+	messageValue := p.ByName("message")
+	message := &sarama.ProducerMessage{Topic: kafkaTopic, Partition: partition }
+	message.Value = sarama.StringEncoder(messageValue)
+
+	fmt.Println("Received message: " + messageValue)
+
+	partition, offset, err := globalProducer.SendMessage(message)
+
+	if err != nil {
+		log.Fatalf("%s: %s", "Failed to connect to Kafka", err)
+	}
+
+	fmt.Printf("publish success! topic=%s\tpartition=%d\toffset=%d\n", kafkaTopic, partition, offset)
+
+}
--- a/messaging/kafka/config/kafka-1/server.properties
+++ b/messaging/kafka/config/kafka-1/server.properties
@ -0,0 +1,136 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+#    http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+# see kafka.server.KafkaConfig for additional details and defaults
+
+############################# Server Basics #############################
+
+# The id of the broker. This must be set to a unique integer for each broker.
+broker.id=1
+
+############################# Socket Server Settings #############################
+
+# The address the socket server listens on. It will get the value returned from 
+# java.net.InetAddress.getCanonicalHostName() if not configured.
+#   FORMAT:
+#     listeners = listener_name://host_name:port
+#   EXAMPLE:
+#     listeners = PLAINTEXT://your.host.name:9092
+#listeners=PLAINTEXT://:9092
+
+# Hostname and port the broker will advertise to producers and consumers. If not set, 
+# it uses the value for "listeners" if configured.  Otherwise, it will use the value
+# returned from java.net.InetAddress.getCanonicalHostName().
+#advertised.listeners=PLAINTEXT://your.host.name:9092
+
+# Maps listener names to security protocols, the default is for them to be the same. See the config documentation for more details
+#listener.security.protocol.map=PLAINTEXT:PLAINTEXT,SSL:SSL,SASL_PLAINTEXT:SASL_PLAINTEXT,SASL_SSL:SASL_SSL
+
+# The number of threads that the server uses for receiving requests from the network and sending responses to the network
+num.network.threads=3
+
+# The number of threads that the server uses for processing requests, which may include disk I/O
+num.io.threads=8
+
+# The send buffer (SO_SNDBUF) used by the socket server
+socket.send.buffer.bytes=102400
+
+# The receive buffer (SO_RCVBUF) used by the socket server
+socket.receive.buffer.bytes=102400
+
+# The maximum size of a request that the socket server will accept (protection against OOM)
+socket.request.max.bytes=104857600
+
+
+############################# Log Basics #############################
+
+# A comma separated list of directories under which to store log files
+log.dirs=/tmp/kafka-logs
+
+# The default number of log partitions per topic. More partitions allow greater
+# parallelism for consumption, but this will also result in more files across
+# the brokers.
+num.partitions=1
+
+# The number of threads per data directory to be used for log recovery at startup and flushing at shutdown.
+# This value is recommended to be increased for installations with data dirs located in RAID array.
+num.recovery.threads.per.data.dir=1
+
+############################# Internal Topic Settings  #############################
+# The replication factor for the group metadata internal topics "__consumer_offsets" and "__transaction_state"
+# For anything other than development testing, a value greater than 1 is recommended to ensure availability such as 3.
+offsets.topic.replication.factor=1
+transaction.state.log.replication.factor=1
+transaction.state.log.min.isr=1
+
+############################# Log Flush Policy #############################
+
+# Messages are immediately written to the filesystem but by default we only fsync() to sync
+# the OS cache lazily. The following configurations control the flush of data to disk.
+# There are a few important trade-offs here:
+#    1. Durability: Unflushed data may be lost if you are not using replication.
+#    2. Latency: Very large flush intervals may lead to latency spikes when the flush does occur as there will be a lot of data to flush.
+#    3. Throughput: The flush is generally the most expensive operation, and a small flush interval may lead to excessive seeks.
+# The settings below allow one to configure the flush policy to flush data after a period of time or
+# every N messages (or both). This can be done globally and overridden on a per-topic basis.
+
+# The number of messages to accept before forcing a flush of data to disk
+#log.flush.interval.messages=10000
+
+# The maximum amount of time a message can sit in a log before we force a flush
+#log.flush.interval.ms=1000
+
+############################# Log Retention Policy #############################
+
+# The following configurations control the disposal of log segments. The policy can
+# be set to delete segments after a period of time, or after a given size has accumulated.
+# A segment will be deleted whenever *either* of these criteria are met. Deletion always happens
+# from the end of the log.
+
+# The minimum age of a log file to be eligible for deletion due to age
+log.retention.hours=168
+
+# A size-based retention policy for logs. Segments are pruned from the log unless the remaining
+# segments drop below log.retention.bytes. Functions independently of log.retention.hours.
+#log.retention.bytes=1073741824
+
+# The maximum size of a log segment file. When this size is reached a new log segment will be created.
+log.segment.bytes=1073741824
+
+# The interval at which log segments are checked to see if they can be deleted according
+# to the retention policies
+log.retention.check.interval.ms=300000
+
+############################# Zookeeper #############################
+
+# Zookeeper connection string (see zookeeper docs for details).
+# This is a comma separated host:port pairs, each corresponding to a zk
+# server. e.g. "127.0.0.1:3000,127.0.0.1:3001,127.0.0.1:3002".
+# You can also append an optional chroot string to the urls to specify the
+# root directory for all kafka znodes.
+zookeeper.connect=zookeeper-1:2181
+
+# Timeout in ms for connecting to zookeeper
+zookeeper.connection.timeout.ms=18000
+
+
+############################# Group Coordinator Settings #############################
+
+# The following configuration specifies the time, in milliseconds, that the GroupCoordinator will delay the initial consumer rebalance.
+# The rebalance will be further delayed by the value of group.initial.rebalance.delay.ms as new members join the group, up to a maximum of max.poll.interval.ms.
+# The default value for this is 3 seconds.
+# We override this to 0 here as it makes for a better out-of-the-box experience for development and testing.
+# However, in production environments the default value of 3 seconds is more suitable as this will help to avoid unnecessary, and potentially expensive, rebalances during application startup.
+group.initial.rebalance.delay.ms=0
--- a/messaging/kafka/config/kafka-2/server.properties
+++ b/messaging/kafka/config/kafka-2/server.properties
@ -0,0 +1,136 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+#    http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+# see kafka.server.KafkaConfig for additional details and defaults
+
+############################# Server Basics #############################
+
+# The id of the broker. This must be set to a unique integer for each broker.
+broker.id=2
+
+############################# Socket Server Settings #############################
+
+# The address the socket server listens on. It will get the value returned from 
+# java.net.InetAddress.getCanonicalHostName() if not configured.
+#   FORMAT:
+#     listeners = listener_name://host_name:port
+#   EXAMPLE:
+#     listeners = PLAINTEXT://your.host.name:9092
+#listeners=PLAINTEXT://:9092
+
+# Hostname and port the broker will advertise to producers and consumers. If not set, 
+# it uses the value for "listeners" if configured.  Otherwise, it will use the value
+# returned from java.net.InetAddress.getCanonicalHostName().
+#advertised.listeners=PLAINTEXT://your.host.name:9092
+
+# Maps listener names to security protocols, the default is for them to be the same. See the config documentation for more details
+#listener.security.protocol.map=PLAINTEXT:PLAINTEXT,SSL:SSL,SASL_PLAINTEXT:SASL_PLAINTEXT,SASL_SSL:SASL_SSL
+
+# The number of threads that the server uses for receiving requests from the network and sending responses to the network
+num.network.threads=3
+
+# The number of threads that the server uses for processing requests, which may include disk I/O
+num.io.threads=8
+
+# The send buffer (SO_SNDBUF) used by the socket server
+socket.send.buffer.bytes=102400
+
+# The receive buffer (SO_RCVBUF) used by the socket server
+socket.receive.buffer.bytes=102400
+
+# The maximum size of a request that the socket server will accept (protection against OOM)
+socket.request.max.bytes=104857600
+
+
+############################# Log Basics #############################
+
+# A comma separated list of directories under which to store log files
+log.dirs=/tmp/kafka-logs
+
+# The default number of log partitions per topic. More partitions allow greater
+# parallelism for consumption, but this will also result in more files across
+# the brokers.
+num.partitions=1
+
+# The number of threads per data directory to be used for log recovery at startup and flushing at shutdown.
+# This value is recommended to be increased for installations with data dirs located in RAID array.
+num.recovery.threads.per.data.dir=1
+
+############################# Internal Topic Settings  #############################
+# The replication factor for the group metadata internal topics "__consumer_offsets" and "__transaction_state"
+# For anything other than development testing, a value greater than 1 is recommended to ensure availability such as 3.
+offsets.topic.replication.factor=1
+transaction.state.log.replication.factor=1
+transaction.state.log.min.isr=1
+
+############################# Log Flush Policy #############################
+
+# Messages are immediately written to the filesystem but by default we only fsync() to sync
+# the OS cache lazily. The following configurations control the flush of data to disk.
+# There are a few important trade-offs here:
+#    1. Durability: Unflushed data may be lost if you are not using replication.
+#    2. Latency: Very large flush intervals may lead to latency spikes when the flush does occur as there will be a lot of data to flush.
+#    3. Throughput: The flush is generally the most expensive operation, and a small flush interval may lead to excessive seeks.
+# The settings below allow one to configure the flush policy to flush data after a period of time or
+# every N messages (or both). This can be done globally and overridden on a per-topic basis.
+
+# The number of messages to accept before forcing a flush of data to disk
+#log.flush.interval.messages=10000
+
+# The maximum amount of time a message can sit in a log before we force a flush
+#log.flush.interval.ms=1000
+
+############################# Log Retention Policy #############################
+
+# The following configurations control the disposal of log segments. The policy can
+# be set to delete segments after a period of time, or after a given size has accumulated.
+# A segment will be deleted whenever *either* of these criteria are met. Deletion always happens
+# from the end of the log.
+
+# The minimum age of a log file to be eligible for deletion due to age
+log.retention.hours=168
+
+# A size-based retention policy for logs. Segments are pruned from the log unless the remaining
+# segments drop below log.retention.bytes. Functions independently of log.retention.hours.
+#log.retention.bytes=1073741824
+
+# The maximum size of a log segment file. When this size is reached a new log segment will be created.
+log.segment.bytes=1073741824
+
+# The interval at which log segments are checked to see if they can be deleted according
+# to the retention policies
+log.retention.check.interval.ms=300000
+
+############################# Zookeeper #############################
+
+# Zookeeper connection string (see zookeeper docs for details).
+# This is a comma separated host:port pairs, each corresponding to a zk
+# server. e.g. "127.0.0.1:3000,127.0.0.1:3001,127.0.0.1:3002".
+# You can also append an optional chroot string to the urls to specify the
+# root directory for all kafka znodes.
+zookeeper.connect=zookeeper-1:2181
+
+# Timeout in ms for connecting to zookeeper
+zookeeper.connection.timeout.ms=18000
+
+
+############################# Group Coordinator Settings #############################
+
+# The following configuration specifies the time, in milliseconds, that the GroupCoordinator will delay the initial consumer rebalance.
+# The rebalance will be further delayed by the value of group.initial.rebalance.delay.ms as new members join the group, up to a maximum of max.poll.interval.ms.
+# The default value for this is 3 seconds.
+# We override this to 0 here as it makes for a better out-of-the-box experience for development and testing.
+# However, in production environments the default value of 3 seconds is more suitable as this will help to avoid unnecessary, and potentially expensive, rebalances during application startup.
+group.initial.rebalance.delay.ms=0
--- a/messaging/kafka/config/kafka-3/server.properties
+++ b/messaging/kafka/config/kafka-3/server.properties
@ -0,0 +1,136 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+#    http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+# see kafka.server.KafkaConfig for additional details and defaults
+
+############################# Server Basics #############################
+
+# The id of the broker. This must be set to a unique integer for each broker.
+broker.id=3
+
+############################# Socket Server Settings #############################
+
+# The address the socket server listens on. It will get the value returned from 
+# java.net.InetAddress.getCanonicalHostName() if not configured.
+#   FORMAT:
+#     listeners = listener_name://host_name:port
+#   EXAMPLE:
+#     listeners = PLAINTEXT://your.host.name:9092
+#listeners=PLAINTEXT://:9092
+
+# Hostname and port the broker will advertise to producers and consumers. If not set, 
+# it uses the value for "listeners" if configured.  Otherwise, it will use the value
+# returned from java.net.InetAddress.getCanonicalHostName().
+#advertised.listeners=PLAINTEXT://your.host.name:9092
+
+# Maps listener names to security protocols, the default is for them to be the same. See the config documentation for more details
+#listener.security.protocol.map=PLAINTEXT:PLAINTEXT,SSL:SSL,SASL_PLAINTEXT:SASL_PLAINTEXT,SASL_SSL:SASL_SSL
+
+# The number of threads that the server uses for receiving requests from the network and sending responses to the network
+num.network.threads=3
+
+# The number of threads that the server uses for processing requests, which may include disk I/O
+num.io.threads=8
+
+# The send buffer (SO_SNDBUF) used by the socket server
+socket.send.buffer.bytes=102400
+
+# The receive buffer (SO_RCVBUF) used by the socket server
+socket.receive.buffer.bytes=102400
+
+# The maximum size of a request that the socket server will accept (protection against OOM)
+socket.request.max.bytes=104857600
+
+
+############################# Log Basics #############################
+
+# A comma separated list of directories under which to store log files
+log.dirs=/tmp/kafka-logs
+
+# The default number of log partitions per topic. More partitions allow greater
+# parallelism for consumption, but this will also result in more files across
+# the brokers.
+num.partitions=1
+
+# The number of threads per data directory to be used for log recovery at startup and flushing at shutdown.
+# This value is recommended to be increased for installations with data dirs located in RAID array.
+num.recovery.threads.per.data.dir=1
+
+############################# Internal Topic Settings  #############################
+# The replication factor for the group metadata internal topics "__consumer_offsets" and "__transaction_state"
+# For anything other than development testing, a value greater than 1 is recommended to ensure availability such as 3.
+offsets.topic.replication.factor=1
+transaction.state.log.replication.factor=1
+transaction.state.log.min.isr=1
+
+############################# Log Flush Policy #############################
+
+# Messages are immediately written to the filesystem but by default we only fsync() to sync
+# the OS cache lazily. The following configurations control the flush of data to disk.
+# There are a few important trade-offs here:
+#    1. Durability: Unflushed data may be lost if you are not using replication.
+#    2. Latency: Very large flush intervals may lead to latency spikes when the flush does occur as there will be a lot of data to flush.
+#    3. Throughput: The flush is generally the most expensive operation, and a small flush interval may lead to excessive seeks.
+# The settings below allow one to configure the flush policy to flush data after a period of time or
+# every N messages (or both). This can be done globally and overridden on a per-topic basis.
+
+# The number of messages to accept before forcing a flush of data to disk
+#log.flush.interval.messages=10000
+
+# The maximum amount of time a message can sit in a log before we force a flush
+#log.flush.interval.ms=1000
+
+############################# Log Retention Policy #############################
+
+# The following configurations control the disposal of log segments. The policy can
+# be set to delete segments after a period of time, or after a given size has accumulated.
+# A segment will be deleted whenever *either* of these criteria are met. Deletion always happens
+# from the end of the log.
+
+# The minimum age of a log file to be eligible for deletion due to age
+log.retention.hours=168
+
+# A size-based retention policy for logs. Segments are pruned from the log unless the remaining
+# segments drop below log.retention.bytes. Functions independently of log.retention.hours.
+#log.retention.bytes=1073741824
+
+# The maximum size of a log segment file. When this size is reached a new log segment will be created.
+log.segment.bytes=1073741824
+
+# The interval at which log segments are checked to see if they can be deleted according
+# to the retention policies
+log.retention.check.interval.ms=300000
+
+############################# Zookeeper #############################
+
+# Zookeeper connection string (see zookeeper docs for details).
+# This is a comma separated host:port pairs, each corresponding to a zk
+# server. e.g. "127.0.0.1:3000,127.0.0.1:3001,127.0.0.1:3002".
+# You can also append an optional chroot string to the urls to specify the
+# root directory for all kafka znodes.
+zookeeper.connect=zookeeper-1:2181
+
+# Timeout in ms for connecting to zookeeper
+zookeeper.connection.timeout.ms=18000
+
+
+############################# Group Coordinator Settings #############################
+
+# The following configuration specifies the time, in milliseconds, that the GroupCoordinator will delay the initial consumer rebalance.
+# The rebalance will be further delayed by the value of group.initial.rebalance.delay.ms as new members join the group, up to a maximum of max.poll.interval.ms.
+# The default value for this is 3 seconds.
+# We override this to 0 here as it makes for a better out-of-the-box experience for development and testing.
+# However, in production environments the default value of 3 seconds is more suitable as this will help to avoid unnecessary, and potentially expensive, rebalances during application startup.
+group.initial.rebalance.delay.ms=0
--- a/messaging/kafka/config/zookeeper-1/zookeeper.properties
+++ b/messaging/kafka/config/zookeeper-1/zookeeper.properties
@ -0,0 +1,24 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+# 
+#    http://www.apache.org/licenses/LICENSE-2.0
+# 
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# the directory where the snapshot is stored.
+dataDir=/tmp/zookeeper
+# the port at which the clients will connect
+clientPort=2181
+# disable the per-ip limit on the number of connections since this is a non-production config
+maxClientCnxns=0
+# Disable the adminserver by default to avoid port conflicts.
+# Set the port to something non-conflicting if choosing to enable this
+admin.enableServer=false
+# admin.serverPort=8080
--- a/messaging/kafka/dockerfile
+++ b/messaging/kafka/dockerfile
@ -0,0 +1,19 @@
+FROM openjdk:11.0.10-jre-buster
+
+RUN apt-get update && \
+    apt-get install -y curl
+         
+ENV KAFKA_VERSION 2.7.0
+ENV SCALA_VERSION 2.13 
+
+RUN  mkdir /tmp/kafka && \
+    curl "https://archive.apache.org/dist/kafka/${KAFKA_VERSION}/kafka_${SCALA_VERSION}-${KAFKA_VERSION}.tgz" \
+    -o /tmp/kafka/kafka.tgz && \
+    mkdir /kafka && cd /kafka && \
+    tar -xvzf /tmp/kafka/kafka.tgz --strip 1
+
+COPY start-kafka.sh  /usr/bin
+RUN chmod +x  /usr/bin/start-kafka.sh
+
+CMD ["start-kafka.sh"]
+   
--- a/messaging/kafka/start-kafka.sh
+++ b/messaging/kafka/start-kafka.sh
@ -0,0 +1,3 @@
+#!/bin/bash -e
+
+exec "/kafka/bin/kafka-server-start.sh" "/kafka/config/server.properties"
--- a/messaging/kafka/zookeeper/dockerfile
+++ b/messaging/kafka/zookeeper/dockerfile
@ -0,0 +1,17 @@
+FROM openjdk:11.0.10-jre-buster
+
+ENV KAFKA_VERSION 2.7.0
+ENV SCALA_VERSION 2.13 
+RUN mkdir /tmp/kafka && \
+    apt-get update && \
+    apt-get install -y curl
+         
+RUN curl "https://archive.apache.org/dist/kafka/${KAFKA_VERSION}/kafka_${SCALA_VERSION}-${KAFKA_VERSION}.tgz" \
+    -o /tmp/kafka/kafka.tgz && \
+    mkdir /kafka && cd /kafka && \
+    tar -xvzf /tmp/kafka/kafka.tgz --strip 1
+
+COPY start-zookeeper.sh  /usr/bin
+RUN chmod +x  /usr/bin/start-zookeeper.sh
+
+CMD ["start-zookeeper.sh"]
--- a/messaging/kafka/zookeeper/start-zookeeper.sh
+++ b/messaging/kafka/zookeeper/start-zookeeper.sh
@ -0,0 +1,3 @@
+#!/bin/bash -e
+
+exec "/kafka/bin/zookeeper-server-start.sh" "/kafka/config/zookeeper.properties"