raystack
diff --git a/‎README.md
Lines changed: 1 addition & 1 deletion b/‎README.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/docs/examples/aggregation_tumble_window.md
Lines changed: 36 additions & 0 deletions b/‎docs/docs/examples/aggregation_tumble_window.md
Lines changed: 36 additions & 0 deletions
diff --git a/‎docs/docs/examples/deduplication_transformer.md
Lines changed: 36 additions & 0 deletions b/‎docs/docs/examples/deduplication_transformer.md
Lines changed: 36 additions & 0 deletions
diff --git a/‎docs/docs/examples/distance_java_udf.md
Lines changed: 36 additions & 0 deletions b/‎docs/docs/examples/distance_java_udf.md
Lines changed: 36 additions & 0 deletions
diff --git a/‎docs/docs/examples/elasticsearch_enrichment.md
Lines changed: 35 additions & 0 deletions b/‎docs/docs/examples/elasticsearch_enrichment.md
Lines changed: 35 additions & 0 deletions
diff --git a/‎docs/docs/examples/kafka_inner_join.md
Lines changed: 36 additions & 0 deletions b/‎docs/docs/examples/kafka_inner_join.md
Lines changed: 36 additions & 0 deletions
diff --git a/‎docs/docs/examples/overview.md
Lines changed: 9 additions & 0 deletions b/‎docs/docs/examples/overview.md
Lines changed: 9 additions & 0 deletions
diff --git a/‎docs/docs/guides/quickstart.md
Lines changed: 56 additions & 30 deletions b/‎docs/docs/guides/quickstart.md
Lines changed: 56 additions & 30 deletions
diff --git a/‎docs/docs/intro.md
Lines changed: 1 addition & 0 deletions b/‎docs/docs/intro.md
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/sidebars.js
Lines changed: 12 additions & 0 deletions b/‎docs/sidebars.js
Lines changed: 12 additions & 0 deletions
@@ -39,7 +39,7 @@ Explore the following resources to get started with Dagger:
 * [Reference](https://odpf.github.io/dagger/docs/reference/overview) contains details about configurations, metrics and other aspects of Dagger.
 * [Contribute](https://odpf.github.io/dagger/docs/contribute/contribution) contains resources for anyone who wants to contribute to Dagger.
 * [Usecase](https://odpf.github.io/dagger/docs/usecase/overview) describes examples use cases which can be solved via Dagger.
-
+* [Examples](https://odpf.github.io/dagger/docs/examples/overview) contains tutorials to try out some of Dagger's features with real-world usecases
 ## Running locally
 
 Please follow this [Dagger Quickstart Guide](https://odpf.github.io/dagger/docs/guides/quickstart) for setting up a local running Dagger consuming from Kafka or to set up a Docker Compose for Dagger.
 
@@ -0,0 +1,36 @@
+# Data Aggregation using a Tumble Window
+
+## About this example
+In this example, we will count the number of booking orders,(as Kafka records) in every 30 second interval. By the end of this example we will understand how to use Dagger to aggregate data over a specified time window.
+
+
+## Before Trying This Example
+
+
+1. **You must have Docker installed**. We can follow [this guide](https://docs.docker.com/get-docker/) on how to install and set up Docker in your local machine.
+2. Clone Dagger repository into your local
+
+   ```shell
+   git clone https://github.com/odpf/dagger.git
+   ```
+
+## Steps
+
+Following are the steps for setting up dagger in docker compose -
+
+1. cd into the aggregation directory:
+   ```shell
+   cd dagger/quickstart/examples/aggregation/tumble_window 
+   ```
+2. fire this command to spin up the docker compose:
+   ```shell
+   docker compose up 
+   ```
+   Hang on for a while as it installs all the required dependencies and starts all the required services. After a while we should see the output of the Dagger SQL query in the terminal, which will be the count of booking orders in every 30 second interval.
+3. fire this command to gracefully close the docker compose:
+   ```shell
+   docker compose down 
+   ```
+   This will stop all services and remove all the containers.
+
+Congratulations, we are now able to use Dagger for performing aggregation over a tumble window!
@@ -0,0 +1,36 @@
+# Removing duplicate records using Transformers
+
+## About this example
+In this example, we will use the DeDuplication Transformer in Dagger to remove the booking orders (as Kafka records) having duplicate `order_number`. By the end of this example we will understand how to use Dagger to remove duplicate data from Kafka source.
+
+
+## Before Trying This Example
+
+
+1. **We must have Docker installed**. We can follow [this guide](https://docs.docker.com/get-docker/) on how to install and set up Docker in your local machine.
+2. Clone Dagger repository into your local
+
+   ```shell
+   git clone https://github.com/odpf/dagger.git
+   ```
+
+## Steps
+
+Following are the steps for setting up dagger in docker compose -
+
+1. cd into the aggregation directory:
+   ```shell
+   cd dagger/quickstart/examples/aggregation/tumble_window 
+   ```
+2. fire this command to spin up the docker compose:
+   ```shell
+   docker compose up 
+   ```
+   Hang on for a while as it installs all the required dependencies and starts all the required services. After a while we should see the output of the Dagger SQL query in the terminal, which will be the booking logs without any duplicate `order_number`.
+3. fire this command to gracefully close the docker compose:
+   ```shell
+   docker compose down 
+   ```
+   This will stop and remove all the containers.
+
+Congratulations, we are now able to use Dagger to remove duplicate data from Kafka source!   
@@ -0,0 +1,36 @@
+# Distance computation using Java UDF
+
+## About this example
+In this example, we will use a User-Defined Function in Dagger to compute the distance between the driver pickup location and the driver dropoff location for each booking log (as Kafka record) . By the end of this example we will understand how to use Dagger UDFs to add more functionality and simplify our queries.
+
+
+## Before Trying This Example
+
+
+1. **We must have Docker installed**. We can follow [this guide](https://docs.docker.com/get-docker/) on how to install and set up Docker in your local machine.
+2. Clone Dagger repository into your local
+
+   ```shell
+   git clone https://github.com/odpf/dagger.git
+   ```
+
+## Steps
+
+Following are the steps for setting up dagger in docker compose -
+
+1. cd into the aggregation directory:
+   ```shell
+   cd dagger/quickstart/examples/aggregation/tumble_window 
+   ```
+2. fire this command to spin up the docker compose:
+   ```shell
+   docker compose up 
+   ```
+   Hang on for a while as it installs all the required dependencies and starts all the required services. After a while we should see the output of the Dagger SQL query in the terminal, which will be the distance between the driver pickup location and the driver dropoff location for each booking log.
+3. fire this command to gracefully close the docker compose:
+   ```shell
+   docker compose down 
+   ```
+   This will stop and remove all the containers.
+
+Congratulations, we are now able to use Dagger UDF to calculate distance easily!   
@@ -0,0 +1,35 @@
+# Stream enrichment using ElasticSearch source
+
+## About this example
+In this example, we will use Dagger Post-processors to enrich the payment transaction logs (from Kafka source), in the input stream with user profile information from an external source i.e. Elasticsearch, to get the user profile information in each record. At the end of this example, we will be able to use Dagger to enrich our data stream from Kafka with the data on any remote ElasticSearch server.
+
+## Before Trying This Example
+
+
+1. **You must have Docker installed**. We can follow [this guide](https://docs.docker.com/get-docker/) on how to install and set up Docker in your local machine.
+2. Clone Dagger repository into your local
+
+   ```shell
+   git clone https://github.com/odpf/dagger.git
+   ```
+
+## Steps
+
+Following are the steps for setting up dagger in docker compose -
+
+1. cd into the aggregation directory:
+   ```shell
+   cd dagger/quickstart/examples/enrichment/elasticsearch_enrichment 
+   ```
+2. fire this command to spin up the docker compose:
+   ```shell
+   docker compose up 
+   ```
+   Hang on for a while as it installs all the required dependencies and starts all the required services. After a while we should see the output of the Dagger SQL query in the terminal, which will be the enriched booking log with the customer profile information.
+3. fire this command to gracefully close the docker compose:
+   ```shell
+   docker compose down 
+   ```
+   This will stop all services and remove all the containers.
+
+Congratulations, we are now able to use Dagger to enrich our data stream from Kafka with the data on any remote ElasticSearch server.   
@@ -0,0 +1,36 @@
+# Joining two Kafka topics using Inner join
+
+## About this example
+In this example, we will use the Inner joins in Dagger to join the data streams from two different Kafka topics and count the number of booking logs in every 30 second interval from both the sources combined for each service type. By the end of this example we will understand how to use inner joins to combine 2 or more Kafka streams.
+
+
+## Before Trying This Example
+
+
+1. **We must have Docker installed**. We can follow [this guide](https://docs.docker.com/get-docker/) on how to install and set up Docker in your local machine.
+2. Clone Dagger repository into your local
+
+   ```shell
+   git clone https://github.com/odpf/dagger.git
+   ```
+
+## Steps
+
+Following are the steps for setting up dagger in docker compose -
+
+1. cd into the aggregation directory:
+   ```shell
+   cd dagger/quickstart/examples/aggregation/tumble_window 
+   ```
+2. fire this command to spin up the docker compose:
+   ```shell
+   docker compose up 
+   ```
+   Hang on for a while as it installs all the required dependencies and starts all the required services. After a while we should see the output of the Dagger SQL query in the terminal, which will be the number of booking logs in every 30 second interval from both the Kafka sources combined, for each service type.
+3. fire this command to gracefully close the docker compose:
+   ```shell
+   docker compose down 
+   ```
+   This will stop and remove all the containers.
+
+Congratulations, we are now able to use Dagger to combine 2 or more Kafka streams.!   
@@ -0,0 +1,9 @@
+# Overview
+
+The following example tutorials will help you to quickly try out some of Dagger's most useful features with real-world usecases - 
+
+- [Data Aggregation using a Tumble Window](../examples/aggregation_tumble_window.md)
+- [Removing duplicate records using Transformers](../examples/deduplication_transformer.md)
+- [Distance computation using Java UDF](../examples/distance_java_udf.md)
+- [Stream enrichment using ElasticSearch source](../examples/elasticsearch_enrichment.md)
+- [Joining two Kafka topics using Inner join](../examples/kafka_inner_join.md)
@@ -1,14 +1,67 @@
 # Dagger Quickstart
 
-## Prerequisites
+There are 2 ways to set up and get dagger running in your machine in no time - 
+1. **[Docker Compose Setup](quickstart.md#docker-compose-setup)** - recommended for beginners
+2. **[Local Installation Setup](quickstart.md#local-installation-setup)** - for more advanced usecases
+
+## Docker Compose Setup
+
+### Prerequisites
+
+1. **You must have docker installed**
+
+Following are the steps for setting up dagger in docker compose -
+1. Clone Dagger repository into your local
+
+   ```shell
+   git clone https://github.com/odpf/dagger.git
+   ```
+2. cd into the docker-compose directory:
+   ```shell
+   cd dagger/quickstart/docker-compose 
+   ```
+3. fire this command to spin up the docker compose:
+   ```shell
+   docker compose up 
+   ```
+This will spin up docker containers for the kafka, zookeeper, stencil, kafka-producer and the dagger.
+4. fire this command to gracefully stop all the docker containers. This will save the container state and help to speed up the setup next time. All the kafka records and topics will also be saved  :
+   ```shell
+   docker compose stop 
+   ```
+   To start the containers from their saved state run this command
+   ```shell
+   docker compose start 
+   ```
+5. fire this command to gracefully remove all the containers. This will delete all the kafka topics/ saved data as well:
+   ```shell
+   docker compose down 
+   ```
+   
+### Workflow
+
+Following are the containers that are created, in chronological order, when you run `docker compose up`  - 
+
+1. **Zookeeper** -  Container for the Zookeeper service is created and listening on port 2187. Zookeeper is a service required by the Kafka server.
+2. **Kafka** - Container for Kafka server is created and is exposed on port 29094. This will serve as the input data source for the Dagger.
+3. **init-kafka** - This container creates the kafka topic `dagger-test-topic-v1` from which the dagger will pull the Kafka messages.
+4. **Stencil** - It compiles the proto file and creates a proto descriptor. Also it sets up an http server serving the proto descriptors required by dagger to parse the Kafka messages. 
+5. **kafka-producer** - It runs a script to generate the random kafka messages and sends one message to the kafka topic every second.
+6. **Dagger** - Clones the Dagger Github repository and builds the jar. Then it creates an in-memory flink cluster and uploads the dagger job jar and starts the job.
+
+The dagger environment variables are present in the `local.properties` file inside the `quickstart/docker-compose/resources` directory. The dagger runs a simple aggregation query which will count the number of bookings , i.e. kafka messages, in every 30 seconds interval. The output will be visible in the logs in the terminal itself. You can edit this query (`FLINK_SQL_QUERY` variable) in the `local.properties` file inside the `quickstart/docker-compose/resources` directory.
+
+## Local Installation Setup
+
+### Prerequisites
 
 1. **Your Java version is Java 8**: Dagger as of now works only with Java 8. Some features might not work with older or later versions.
 2. Your **Kafka** version is **3.0.0** or a minor version of it
 3. You have **kcat** installed: We will use kcat to push messages to Kafka from the CLI. You can follow the installation steps [here](https://github.com/edenhill/kcat). Ensure the version you install is 1.7.0 or a minor version of it.
 4. You have **protobuf** installed: We will use protobuf to push messages encoded in protobuf format to Kafka topic. You can follow the installation steps for MacOS [here](https://formulae.brew.sh/formula/protobuf). For other OS, please download the corresponding release from [here](https://github.com/protocolbuffers/protobuf/releases). Please note, this quickstart has been written to work with[ 3.17.3](https://github.com/protocolbuffers/protobuf/releases/tag/v3.17.3) of protobuf. Compatibility with other versions is unknown.
 5. You have **Python 2.7+** and **simple-http-server** installed: We will use Python along with simple-http-server to spin up a mock Stencil server which can serve the proto descriptors to Dagger. To install **simple-http-server**, please follow these [installation steps](https://pypi.org/project/simple-http-server/).
 
-## Quickstart
+### Quickstart
 
 1. Clone Dagger repository into your local
 
@@ -52,7 +105,7 @@ The Stencil client being used in Dagger will fetch it by calling this URL. This
 
 After some initialization logs, you should see the output of the SQL query getting printed.
 
-## Troubleshooting
+### Troubleshooting
 
 1. **I am pushing messages to the kafka topic but not seeing any output in the logs.** 
 
@@ -65,30 +118,3 @@ After some initialization logs, you should see the output of the SQL query getti
 2. **I see an exception `java.lang.RuntimeException: Unable to retrieve any partitions with KafkaTopicsDescriptor: Topic Regex Pattern`**
 
    This can happen if the topic configured under `STREAMS` -> `SOURCE_KAFKA_TOPIC_NAMES` in `local.properties` is new and you have not pushed any messages to it yet. Ensure that you have pushed atleast one message to the topic before you start dagger.
-
-## Docker Compose Setup
-
-### Prerequisites
-
-1. **You must have docker installed**
-
-Following are the steps for setting up dagger in docker compose - 
-1. Clone Dagger repository into your local
-
-   ```shell
-   git clone https://github.com/odpf/dagger.git
-   ```
-2. cd into the docker-compose directory:
-   ```shell
-   cd dagger/quickstart/docker-compose 
-   ```
-3. fire this command to spin up the docker compose:
-   ```shell
-   docker compose up 
-   ```
-This will spin up docker containers for the kafka, zookeeper, stencil, kafka-producer and the dagger.
-4. fire this command to gracefully close the docker compose:
-   ```shell
-   docker compose down 
-   ```
-   
 
@@ -41,3 +41,4 @@ Explore the following resources to get started with Dagger:
 - [Reference](./reference/overview.md) contains details about configurations, metrics and other aspects of Dagger.
 - [Contribute](./contribute/contribution.md) contains resources for anyone who wants to contribute to Dagger.
 - [Usecase](./usecase/overview.md) describes examples use cases which can be solved via Dagger.
+- [Examples](./examples/overview.md) contains tutorials to try out some of Dagger's features with real-world usecases
@@ -62,6 +62,18 @@ module.exports = {
         "reference/udfs"
       ],
     },
+    {
+      type: "category",
+      label: "Examples",
+      items: [
+        "examples/overview",
+        "examples/aggregation_tumble_window",
+        "examples/deduplication_transformer",
+        "examples/distance_java_udf",
+        "examples/elasticsearch_enrichment",
+        "examples/kafka_inner_join"
+      ],
+    },
     {
       type: "category",
       label: "Contribute",