|
1 | 1 | ---
|
2 |
| -title: Integrating New Networks |
| 2 | +title: New Chain Integration |
3 | 3 | ---
|
4 | 4 |
|
5 |
| -Graph Node can currently index data from the following chain types: |
| 5 | +Chains can bring subgraph support to their ecosystem by starting a new `graph-node` integration. Subgraphs are a powerful indexing tool opening a world of possibilities for developers. Graph Node already indexes data from the chains listed here. If you are interested in a new integration, there are 2 integration strategies: |
6 | 6 |
|
7 |
| -- Ethereum, via EVM JSON-RPC and [Ethereum Firehose](https://github.com/streamingfast/firehose-ethereum) |
8 |
| -- NEAR, via a [NEAR Firehose](https://github.com/streamingfast/near-firehose-indexer) |
9 |
| -- Cosmos, via a [Cosmos Firehose](https://github.com/graphprotocol/firehose-cosmos) |
10 |
| -- Arweave, via an [Arweave Firehose](https://github.com/graphprotocol/firehose-arweave) |
| 7 | +1. **EVM JSON-RPC** |
| 8 | +2. **Firehose**: All Firehose integration solutions include Substreams, a large-scale streaming engine based off Firehose with native `graph-node` support, allowing for parallelized transforms. |
11 | 9 |
|
12 |
| -If you are interested in any of those chains, integration is a matter of Graph Node configuration and testing. |
| 10 | +> Note that while the recommended approach is to develop a new Firehose for all new chains, it is only required for non-EVM chains. |
13 | 11 |
|
14 |
| -If you are interested in a different chain type, a new integration with Graph Node must be built. Our recommended approach is developing a new Firehose for the chain in question and then the integration of that Firehose with Graph Node. More info below. |
| 12 | +## Integration Strategies |
15 | 13 |
|
16 |
| -**1. EVM JSON-RPC** |
| 14 | +### 1. EVM JSON-RPC |
17 | 15 |
|
18 |
| -If the blockchain is EVM equivalent and the client/node exposes the standard EVM JSON-RPC API, Graph Node should be able to index the new chain. For more information, refer to [Testing an EVM JSON-RPC](new-chain-integration#testing-an-evm-json-rpc). |
| 16 | +If the blockchain is EVM equivalent and the client/node exposes the standard EVM JSON-RPC API, Graph Node should be able to index the new chain. |
19 | 17 |
|
20 |
| -**2. Firehose** |
| 18 | +#### Testing an EVM JSON-RPC |
21 | 19 |
|
22 |
| -For non-EVM-based chains, Graph Node must ingest blockchain data via gRPC and known type definitions. This can be done via [Firehose](firehose/), a new technology developed by [StreamingFast](https://www.streamingfast.io/) that provides a highly-scalable indexing blockchain solution using a files-based and streaming-first approach. Reach out to the [StreamingFast team](mailto:[email protected]/) if you need help with Firehose development. |
| 20 | +For Graph Node to be able to ingest data from an EVM chain, the RPC node must expose the following EVM JSON-RPC methods: |
23 | 21 |
|
24 |
| -## Difference between EVM JSON-RPC & Firehose |
| 22 | +- `eth_getLogs` |
| 23 | +- `eth_call` (for historical blocks, with EIP-1898 - requires archive node) |
| 24 | +- `eth_getBlockByNumber` |
| 25 | +- `eth_getBlockByHash` |
| 26 | +- `net_version` |
| 27 | +- `eth_getTransactionReceipt`, in a JSON-RPC batch request |
| 28 | +- `trace_filter` *(optionally required for Graph Node to support call handlers)* |
25 | 29 |
|
26 |
| -While the two are suitable for subgraphs, a Firehose is always required for developers wanting to build with [Substreams](substreams/), like building [Substreams-powered subgraphs](cookbook/substreams-powered-subgraphs/). In addition, Firehose allows for improved indexing speeds when compared to JSON-RPC. |
| 30 | +### 2. Firehose Integration |
27 | 31 |
|
28 |
| -New EVM chain integrators may also consider the Firehose-based approach, given the benefits of substreams and its massive parallelized indexing capabilities. Supporting both allows developers to choose between building substreams or subgraphs for the new chain. |
| 32 | +[Firehose](https://firehose.streamingfast.io/firehose-setup/overview) is a next-generation extraction layer. It collects history in flat files and streams in real time. Firehose technology replaces those polling API calls with a stream of data utilizing a push model that sends data to the indexing node faster. This helps increase the speed of syncing and indexing. |
29 | 33 |
|
30 |
| -> **NOTE**: A Firehose-based integration for EVM chains will still require Indexers to run the chain's archive RPC node to properly index subgraphs. This is due to the Firehose's inability to provide smart contract state typically accessible by the `eth_call` RPC method. (It's worth reminding that eth_calls are [not a good practice for developers](https://thegraph.com/blog/improve-subgraph-performance-reduce-eth-calls/)) |
| 34 | +The primary method to integrate the Firehose into chains is to use an RPC polling strategy. Our polling algorithm will predict when a new block will arrive and increase the rate at which it checks for a new block near that time, making it a very low-latency and efficient solution. For help with the integration and maintenance of the Firehose, contact the [StreamingFast team](https://www.streamingfast.io/firehose-integration-program). New chains and their integrators will appreciate the [fork awareness](https://substreams.streamingfast.io/documentation/consume/reliability-guarantees) and massive parallelized indexing capabilities that Firehose and Substreams bring to their ecosystem. |
31 | 35 |
|
32 |
| ---- |
| 36 | +> NOTE: All integrations done by the StreamingFast team include maintenance for the Firehose replication protocol into the chain's codebase. StreamingFast tracks any changes and releases binaries when you change code and when StreamingFast changes code. This includes releasing Firehose/Substreams binaries for the protocol, maintaining Substreams modules for the block model of the chain, and releasing binaries for the blockchain node with instrumentation if need be. |
33 | 37 |
|
34 |
| -## Testing an EVM JSON-RPC |
| 38 | +#### Specific Firehose Instrumentation for EVM (`geth`) chains |
35 | 39 |
|
36 |
| -For Graph Node to be able to ingest data from an EVM chain, the RPC node must expose the following EVM JSON RPC methods: |
| 40 | +For EVM chains, there exists a deeper level of data that can be achieved through the `geth` [live-tracer](https://github.com/ethereum/go-ethereum/releases/tag/v1.14.0), a collaboration between Go-Ethereum and StreamingFast, in building a high-throughput and rich transaction tracing system. The Live Tracer is the most comprehensive solution, resulting in [Extended](https://streamingfastio.medium.com/new-block-model-to-accelerate-chain-integration-9f65126e5425) block details. This enables new indexing paradigms, like pattern matching of events based on state changes, calls, parent call trees, or triggering of events based on changes to the actual variables in a smart contract. |
37 | 41 |
|
38 |
| -- `eth_getLogs` |
39 |
| -- `eth_call` \_(for historical blocks, with EIP-1898 - requires archive node): |
40 |
| -- `eth_getBlockByNumber` |
41 |
| -- `eth_getBlockByHash` |
42 |
| -- `net_version` |
43 |
| -- `eth_getTransactionReceipt`, in a JSON-RPC batch request |
44 |
| -- _`trace_filter`_ _(optionally required for Graph Node to support call handlers)_ |
| 42 | + |
| 43 | + |
| 44 | +> NOTE: This improvement upon the Firehose requires chains make use of the EVM engine `geth version 1.13.0` and up. |
45 | 45 |
|
46 |
| -### Graph Node Configuration |
| 46 | +## EVM considerations - Difference between JSON-RPC & Firehose |
47 | 47 |
|
48 |
| -**Start by preparing your local environment** |
| 48 | +While the JSON-RPC and Firehose are both suitable for subgraphs, a Firehose is always required for developers wanting to build with [Substreams](https://substreams.streamingfast.io). Supporting Substreams allows developers to build [Substreams-powered subgraphs](/cookbook/substreams-powered-subgraphs) for the new chain, and has the potential to improve the performance of your subgraphs. Additionally, Firehose — as a drop-in replacement for the JSON-RPC extraction layer of `graph-node` — reduces by 90% the number of RPC calls required for general indexing. |
| 49 | + |
| 50 | +- All those `getLogs` calls and roundtrips get replaced by a single stream arriving into the heart of `graph-node`; a single block model for all subgraphs it processes. |
| 51 | + |
| 52 | +> NOTE: A Firehose-based integration for EVM chains will still require Indexers to run the chain's archive RPC node to properly index subgraphs. This is due to the Firehose's inability to provide smart contract state typically accessible by the `eth_call` RPC method. (It's worth reminding that `eth_calls` are not a good practice for developers) |
| 53 | +
|
| 54 | +## Graph Node Configuration |
| 55 | + |
| 56 | +Configuring Graph Node is as easy as preparing your local environment. Once your local environment is set, you can test the integration by locally deploying a subgraph. |
49 | 57 |
|
50 | 58 | 1. [Clone Graph Node](https://github.com/graphprotocol/graph-node)
|
51 |
| -2. Modify [this line](https://github.com/graphprotocol/graph-node/blob/master/docker/docker-compose.yml#L22) to include the new network name and the EVM JSON RPC compliant URL |
52 |
| - > Do not change the env var name itself. It must remain `ethereum` even if the network name is different. |
53 |
| -3. Run an IPFS node or use the one used by The Graph: https://api.thegraph.com/ipfs/ |
| 59 | +2. Modify [this line](https://github.com/graphprotocol/graph-node/blob/master/docker/docker-compose.yml#L22) to include the new network name and the EVM JSON-RPC compliant URL |
| 60 | + |
| 61 | + > Do not change the env var name itself. It must remain `ethereum` even if the network name is different. |
| 62 | +
|
| 63 | +3. Run an IPFS node or use the one used by The Graph: https://api.thegraph.com/ipfs/ |
54 | 64 |
|
55 |
| -**Test the integration by locally deploying a subgraph** |
| 65 | +### Testing an EVM JSON-RPC by locally deploying a subgraph |
56 | 66 |
|
57 |
| -1. Install [graph-cli](https://github.com/graphprotocol/graph-tooling/tree/main/packages/cli) |
| 67 | +1. Install [graph-cli](https://github.com/graphprotocol/graph-cli) |
58 | 68 | 2. Create a simple example subgraph. Some options are below:
|
59 |
| - 1. The pre-packed [Gravitar](https://github.com/graphprotocol/example-subgraph/tree/f89bdd4628efa4badae7367d4919b3f648083323) smart contract and subgraph is a good starting point |
60 |
| - 2. Bootstrap a local subgraph from any existing smart contract or solidity dev environment [using Hardhat with a Graph plugin](https://github.com/graphprotocol/hardhat-graph) |
61 |
| -3. Adapt the resulting `subgraph.yaml` by changing `dataSources.network` to the same name previously passed on to Graph Node. |
62 |
| -4. Create your subgraph in Graph Node: `graph create $SUBGRAPH_NAME --node $GRAPH_NODE_ENDPOINT` |
63 |
| -5. Publish your subgraph to Graph Node: `graph deploy $SUBGRAPH_NAME --ipfs $IPFS_ENDPOINT --node $GRAPH_NODE_ENDPOINT` |
| 69 | + 1. The pre-packed [Gravitar](https://github.com/graphprotocol/example-subgraph/tree/f89bdd4628efa4badae7367d4919b3f648083323) smart contract and subgraph is a good starting point |
| 70 | + 2. Bootstrap a local subgraph from any existing smart contract or solidity dev environment [using Hardhat with a Graph plugin](https://github.com/graphprotocol/hardhat-graph) |
| 71 | +3. Adapt the resulting `subgraph.yaml` by changing `dataSources.network` to the same name previously passed on to Graph Node. |
| 72 | +4. Create your subgraph in Graph Node: `graph create $SUBGRAPH_NAME --node $GRAPH_NODE_ENDPOINT` |
| 73 | +5. Publish your subgraph to Graph Node: `graph deploy $SUBGRAPH_NAME --ipfs $IPFS_ENDPOINT --node $GRAPH_NODE_ENDPOINT` |
64 | 74 |
|
65 | 75 | Graph Node should be syncing the deployed subgraph if there are no errors. Give it time to sync, then send some GraphQL queries to the API endpoint printed in the logs.
|
66 | 76 |
|
67 |
| ---- |
| 77 | +## Substreams-powered Subgraphs |
| 78 | + |
| 79 | +For StreamingFast-led Firehose/Substreams integrations, basic support for foundational Substreams modules (e.g. decoded transactions, logs and smart-contract events) and Substreams-powered subgraph codegen tools are included (check out [Injective](https://substreams.streamingfast.io/documentation/intro-getting-started/intro-injective/injective-first-sps) for an example). |
| 80 | + |
| 81 | +There are two options to consume Substreams data through a subgraph: |
| 82 | + |
| 83 | +- **Using Substreams triggers:** Consume from any Substreams module by importing the Protobuf model through a subgraph handler and move all your logic into a subgraph. This method creates the subgraph entities directly in the subgraph. |
| 84 | +- **Using EntityChanges:** By writing more of the logic into Substreams, you can consume the module's output directly into `graph-node`. In `graph-node`, you can use the Substreams data to create your subgraph entities. |
| 85 | + |
| 86 | +It is really a matter of where you put your logic, in the subgraph or the Substreams. Keep in mind that having more of your logic in Substreams benefits from a parallelized model, whereas triggers will be linearly consumed in `graph-node`. Consider the following example implementing a subgraph handler: |
| 87 | + |
| 88 | +```ts |
| 89 | +export function handleTransactions(bytes: Uint8Array): void { |
| 90 | + let transactions = assembly.eth.transaction.v1.Transactions.decode(bytes.buffer).trasanctions // 1. |
| 91 | + if (transactions.length == 0) { |
| 92 | + log.info('No transactions found', []) |
| 93 | + return |
| 94 | + } |
68 | 95 |
|
69 |
| -## Integrating a new Firehose-enabled chain |
| 96 | + for (let i = 0; i < transactions.length; i++) { |
| 97 | + // 2. |
| 98 | + let transaction = transactions[i] |
70 | 99 |
|
71 |
| -Integrating a new chain is also possible using the Firehose approach. This is currently the best option for non-EVM chains and a requirement for substreams support. Additional documentation focuses on how Firehose works, adding Firehose support for a new chain and integrating it with Graph Node. Recommended docs for integrators: |
| 100 | + let entity = new Transaction(transaction.hash) // 3. |
| 101 | + entity.from = transaction.from |
| 102 | + entity.to = transaction.to |
| 103 | + entity.save() |
| 104 | + } |
| 105 | +} |
| 106 | +``` |
72 | 107 |
|
73 |
| -1. [General docs on Firehose](firehose/) |
74 |
| -2. [Adding Firehose support for a new chain](https://firehose.streamingfast.io/integrate-new-chains/integration-overview) |
75 |
| -3. [Integrating Graph Node with a new chain via Firehose](https://github.com/graphprotocol/graph-node/blob/master/docs/implementation/add-chain.md) |
| 108 | +The `handleTransactions` function is a subgraph handler that receives the raw Substreams bytes as parameter and decodes them into a `Transactions` object. Then, for every transaction, a new subgraph entity is created. For more information about Substreams triggers, visit the [StreamingFast documentation](https://substreams.streamingfast.io/documentation/consume/subgraph/triggers) or check out community modules at [substreams.dev](https://substreams.dev/). |
0 commit comments