add steps page

rtso · rtso · commit 5ac661e85d99 · 2025-01-08T17:15:33.000-05:00
diff --git a/apps/nextra/next.config.mjs b/apps/nextra/next.config.mjs
@@ -407,6 +407,11 @@ export default withBundleAnalyzer(
         destination: "/en/build/indexer/indexer-sdk/documentation/setup",
         permanent: true,
       },
+      {
+        source: "/indexer/indexer-sdk/documentation/steps",
+        destination: "/en/build/indexer/indexer-sdk/documentation/steps",
+        permanent: true,
+      },
       {
         source: "/indexer/indexer-sdk/documentation/parsing-txns",
         destination: "/en/build/indexer/indexer-sdk/documentation/parsing-txns",
diff --git a/apps/nextra/pages/en/build/indexer/indexer-sdk/documentation/_meta.tsx b/apps/nextra/pages/en/build/indexer/indexer-sdk/documentation/_meta.tsx
@@ -1,5 +1,8 @@
 export default {
   setup: { title: "Initial Setup" },
+  "steps": {
+    title: "Creating a Step",
+  },
   "parsing-txns": {
     title: "Parsing Transactions",
   },
diff --git a/apps/nextra/pages/en/build/indexer/indexer-sdk/documentation/setup.mdx b/apps/nextra/pages/en/build/indexer/indexer-sdk/documentation/setup.mdx
@@ -1,8 +1,6 @@
 ---
 title: "Initial Setup"
 ---
-import { Callout } from 'nextra/components'
-import { IndexerBetaNotice } from '@components/index';
 
 # Initial Setup
 
diff --git a/apps/nextra/pages/en/build/indexer/indexer-sdk/documentation/steps.mdx b/apps/nextra/pages/en/build/indexer/indexer-sdk/documentation/steps.mdx
@@ -0,0 +1,121 @@
+---
+title: "Creating a Step"
+---
+
+# Creating a Step
+
+## What is a step?
+A step is a unit of processing lgoic in the SDK. It can be used to extract, transform, or store data. Steps are the building blocks of a processor.
+
+There are two types of steps in the SDK:
+1. **AsyncStep**: Processes a batch of input items and returns a batch of output items.
+2. **PollableAsyncStep**: Does the same as `AsyncStep`, but it also periodically polls its internal state and returns a batch of output items if available.
+
+## How to create a Step
+To create a step with the SDK, follow these instructions:
+
+1. Implement the `Processable` trait. This trait defines several important details about the step: the input and output types, the processing logic, and the run type (either `AsyncStepRunType` or `PollableAsyncStepRunType`).
+    
+    ```rust
+    #[async_trait]
+    impl Processable for MyExtractorStep {
+        // The Input is a batch of Transaction 
+        type Input = Transaction;
+        // The Output is a batch of MyData
+        type Output = MyData;
+
+        // Depending on the type of step this is, the RunType is either
+        // - AsyncRunType
+        // - PollableAsyncRunType
+        type RunType = AsyncRunType;
+    
+    	// Processes a batch of input items and returns a batch of output items.
+        async fn process(
+            &mut self,
+            input: TransactionContext<Transaction>,
+        ) -> Result<Option<TransactionContext<MyData>>, ProcessorError> {
+            let transactions = input.data;
+            let data = transactions.iter().map(|transaction| {
+                // Define the processing logic to extract MyData from a Transaction
+            }).collect();
+            
+            Ok(Some(TransactionContext {
+            data,
+            metadata: input.metadata,
+            }))
+        }
+    }
+    ```
+
+    In the example code above, you'll notice that the input and output types are wrapped within a `TransactionContext`.
+    `TransactionContext` contains relevant metadata about the batch of data being processed, such as the transaction versions and timestamp, and are used for metrics and logging. 
+    
+2. Implement the `NamedStep` trait. This is used for logging.
+    
+    ```rust
+    impl NamedStep for MyExtractorStep {
+        fn name(&self) -> String {
+            "MyExtractorStep".to_string()
+        }
+    }
+    ```
+    
+3. Implement either `AsyncStep` trait or `PollableAsyncStep` trait, which defines how the step will be run in the processor.
+    1. If you're using `AsyncStep`, add this to your code:
+        
+        ```rust        
+        impl AsyncStep for MyExtractorStep {}
+        ```
+        
+    2. If you're creating a `PollableAsyncStep`, you will need to define the poll interval and what the step should do every time it polls.
+        
+        ```rust        
+        #[async_trait]
+        impl<T: Send + 'static> PollableAsyncStep for MyPollStep<T>
+        where
+            Self: Sized + Send + Sync + 'static,
+            T: Send + 'static,
+        {
+            fn poll_interval(&self) -> std::time::Duration {
+                // Define duration
+            }
+        
+            async fn poll(&mut self) -> Result<Option<Vec<TransactionContext<T>>>, ProcessorError> {
+                // Define code here on what this step should do every time it polls
+                // Optionally return a batch of output items
+            }
+        }
+        ```
+
+## How to connect step's 
+
+Now that you have created a step, you can connect it to other steps in the processor. 
+`ProcessorBuilder` is used to connect a graph of steps to construct a processor. 
+It uses trait bounds to ensure that the output type of each step matches the input type of its connected step. 
+
+### How to use `ProcessorBuilder`
+
+1. Initialize the processor with the first step using `ProcessorBuilder::new_with_inputless_first_step`. 
+2. Connect the next step using `.connect_to(second_step.into_runnable_step(), channel_size)`. 
+- `.into_runnable_step()` converts your step into a `RunnableStep`, which enables it to store the step's input and output channels and allows the step to be spawned in a task. 
+- When you use `.connect_to`, a channel gets created with size `channel_size` and connected to the previous and current steps, and the previous step is spawned in a task. 
+3. To close off the `ProcessorBuilder`, use `.end_and_return_output_receiver(channel_size)`. This returns a channel receiver that can be used to receive the final output of the processor.
+
+### Example
+```rust
+let (pb, buffer_receiver) = ProcessorBuilder::new_with_inputless_first_step(
+      first_step.into_runnable_step(),
+  )
+  .connect_to(second_step.into_runnable_step(), channel_size)
+  .connect_to(third_step.into_runnable_step(), channel_size)
+  .end_and_return_output_receiver(channel_size);
+```
+
+## Common steps
+
+The SDK provides several common steps that you can use in your processor. 
+
+1. `TransactionStreamStep` provides a stream of Aptos transactions to the processor
+2. `TimedBufferStep` buffers a batch of items and periodically polls to release the items to the next step
+
+{/* <!-- Add the rest of the common SDK steps --> */}
diff --git a/apps/nextra/pages/en/build/indexer/indexer-sdk/quickstart.mdx b/apps/nextra/pages/en/build/indexer/indexer-sdk/quickstart.mdx
@@ -1,9 +1,11 @@
 # Quickstart Guide on Aptos Indexer SDK
-In this guide, we’re going to walk you through all the steps involved with creating a basic events processor in Rust to
-track events on the Aptos blockchain. At the end of this guide, you’ll be able to run a simple events processor and customize
-the processor for your indexing needs.
 
-# Getting started
+## What to expect from this guide
+This guide will walk you through setting up and running a Rust processor to index events on the Aptos blockchain into PostgreSQL. 
+We provide a template processor that you can customize to index events from your custom contracts.
+By the end of the guide, you should have a basic understanding of how a processor works and be able to customize the processor for your indexing needs.
+
+## Getting started
 
 To get started, clone
 the [aptos-indexer-processors-example](https://github.com/aptos-labs/aptos-indexer-processor-example/tree/main) repo.
@@ -48,7 +50,7 @@ towards PostgreSQL for the sake of simplicity. We use the following database con
 Explaining how to create a database is beyond the scope of this tutorial. If you are not sure how to do it, consider
 checking out tutorials on how to create a database with the `psql` tool.
 
-# Setting up your environment
+## Setting up your environment
 
 Make sure to start the `postgresql` service:
 
@@ -64,7 +66,7 @@ For mac, if you’re using brew, start it up with:
 brew services start postgresql
 ```
 
-# **Configuring your processor**
+## **Configuring your processor**
 
 Now let’s set up the configuration details for the actual indexer processor we’re going to use.
 
@@ -122,7 +124,7 @@ indexer_grpc_data_service_address: grpc.testnet.aptoslabs.com:443
 indexer_grpc_data_service_address: grpc.mainnet.aptoslabs.com:443
 ```
 
-# Explanation of the processor
+## Explanation of the processor
 
 At a high level, each processor is responsible for receiving a stream of transactions, parsing and transforming the
 relevant data, and storing the data into a database.