docs(ai): format pipeline doc and add VRAM and price recomendations

rickstaa · rickstaa · commit 84339d92895e · 2024-12-12T16:10:10.000+01:00
This commit formats the pipeline doc with prettier and adds VRAM and
price recomendation sections.
diff --git a/ai/pipelines/image-to-text.mdx b/ai/pipelines/image-to-text.mdx
@@ -4,7 +4,10 @@ title: Image-to-Text
 
 ## Overview
 
-The `image-to-text` pipeline converts images into text captions. This pipeline is powered by the latest models in the HuggingFace [text-to-image](https://huggingface.co/models?pipeline_tag=text-to-image) pipeline.
+The `image-to-text` pipeline converts images into text captions. This pipeline
+is powered by the latest models in the HuggingFace
+[text-to-image](https://huggingface.co/models?pipeline_tag=text-to-image)
+pipeline.
 
 <div align="center">
 
@@ -19,10 +22,10 @@ The current warm model requested for the `image-to-text` pipeline is:
 - [Salesforce/blip-image-captioning-large](https://huggingface.co/Salesforce/blip-image-captioning-large)
 
 <Tip>
-    For faster responses with different
-    [image-to-text](https://huggingface.co/models?pipeline_tag=text-to-image)
-    diffusion models, ask Orchestrators to load it on their GPU via the `ai-video`
-    channel in [Discord Server](https://discord.gg/livepeer).
+  For faster responses with different
+  [image-to-text](https://huggingface.co/models?pipeline_tag=text-to-image)
+  diffusion models, ask Orchestrators to load it on their GPU via the `ai-video`
+  channel in [Discord Server](https://discord.gg/livepeer).
 </Tip>
 
 ### On-Demand Models
@@ -31,9 +34,9 @@ The following models have been tested and verified for the `image-to-text`
 pipeline:
 
 <Note>
-    If a specific model you wish to use is not listed, please submit a [feature
-    request](https://github.com/livepeer/ai-worker/issues/new?assignees=&labels=enhancement%2Cmodel&projects=&template=model_request.yml)
-    on GitHub to get the model verified and added to the list.
+  If a specific model you wish to use is not listed, please submit a [feature
+  request](https://github.com/livepeer/ai-worker/issues/new?assignees=&labels=enhancement%2Cmodel&projects=&template=model_request.yml)
+  on GitHub to get the model verified and added to the list.
 </Note>
 
 {/* prettier-ignore */}
@@ -44,13 +47,13 @@ pipeline:
 ## Basic Usage Instructions
 
 <Tip>
-    For a detailed understanding of the `image-to-text` endpoint and to experiment
-    with the API, see the [Livepeer AI API
-    Reference](/ai/api-reference/image-to-text).
+  For a detailed understanding of the `image-to-text` endpoint and to experiment
+  with the API, see the [Livepeer AI API
+  Reference](/ai/api-reference/image-to-text).
 </Tip>
 
-To create an image caption using the `image-to-text` pipeline, submit a
-`POST` request to the Gateway's `image-to-text` API endpoint:
+To create an image caption using the `image-to-text` pipeline, submit a `POST`
+request to the Gateway's `image-to-text` API endpoint:
 
 ```bash
 curl -X POST "https://<GATEWAY_IP>/image-to-text" \
@@ -64,9 +67,7 @@ In this command:
 - `model_id` is the diffusion model to use.
 - `image` is the path to the image file to be captioned.
 
-<Note>
-    Maximum request size: 50 MB
-</Note>
+<Note>Maximum request size: 50 MB</Note>
 
 For additional optional parameters, refer to the
 [Livepeer AI API Reference](/ai/api-reference/image-to-text).
@@ -80,16 +81,32 @@ the [Orchestrator Configuration](/ai/orchestrators/get-started) guide.
 
 The following system requirements are recommended for optimal performance:
 
-- [NVIDIA GPU](https://developer.nvidia.com/cuda-gpus) with **at least 12GB** of
-VRAM.
+- [NVIDIA GPU](https://developer.nvidia.com/cuda-gpus) with **at least 4GB** of
+  VRAM.
+
+
+## Recommended Pipeline Pricing
+
+<Note>
+  We are planning to simplify the pricing in the future so orchestrators can set
+  one AI price per compute unit and have the system automatically scale based on
+  the model's compute requirements.
+</Note>
+
+The pricing for the `image-to-text` pipeline is based on competitor pricing.
+However, we strongly encourage orchestrators to set their own pricing based on
+their costs and requirements. Setting a competitive price will help attract more
+jobs, as Gateways can set their maximum price for a job. The current recommended
+pricing for this pipeline is `2.5e-10 USD` per **input pixel**
+(`height * width`).
 
 ## API Reference
 
 <Card
-    title="API Reference"
-    icon="rectangle-terminal"
-    href="/ai/api-reference/image-to-text"
+  title="API Reference"
+  icon="rectangle-terminal"
+  href="/ai/api-reference/image-to-text"
 >
-    Explore the `image-to-text` endpoint and experiment with the API in the
-    Livepeer AI API Reference.
+  Explore the `image-to-text` endpoint and experiment with the API in the
+  Livepeer AI API Reference.
 </Card>