How to store your clickstream data?

3 min readOct 4, 2019

I would like to talk about clickstream data in this blog.

Why is it so important to store clickstreams?

There is a lot of answers to this question. In my opinion, the most important reason is you can understand better your visitor’s behavior and you can redesign your business model according to data.

divolte-collector.conf: You can specify server configuration in this file.
a. sources: You can define different sources for events and define javascript file name, cookie name, cookie expires time…
b. mappings: You can map your sources to sinks. In my case, I mapped click_stream source to kafka.
c. global: Global server settings are stored in this section.
d. sinks: Sinks configurations are stored in this section.
eventRecord.avsc: This file stores the format of your data.
mapping.groovy: Mapping your clickstream data to your avro file.

In my case, I customized some values for basic. You can configure it if you need it. I recommend this documentation:

Getting Started — Divolte v0.9.0 User Guide

Edit description

divolte-releases.s3-website-eu-west-1.amazonaws.com

You will find configured kafka in docker file.
If you are ready, we can test whether our clickstream data is going to kafka.
Build docker-compose file vi below command:

docker-compose up -d

You will see divolte homepage when you visit http://localhost:8290. Actually, you can disable this page from the configuration. Because you don’t need this welcome page. You just need javascript. I enable this page for demo:)

Sending clickstream event from divolte:

You can run the below command on the browser console to test:

divolte.signal('event', {"item_id": 10, "event_date":3213213})

4. Writing clickstream data to Kafka

Now we can connect to kafka server for checking data via this command:

docker-compose exec kafka /opt/kafka/bin/kafka-console-consumer.sh --bootstrap-server localhost:9092 --topic click_stream --from-beginning

You will see different chars in the message because of serialization, don’t worry about format.

Thanks for reading.

Next blog: Write data to Cassandra DB from Apache Kafka

How to store your clickstream data?

Table of Contents:

mustafaileri/recommendation-engine-sandbox

You can't perform that action at this time. You signed in with another tab or window. You signed out in another tab or…

Getting Started — Divolte v0.9.0 User Guide

Edit description

Written by Mustafa İleri

No responses yet