Skip to content

Commit 8127dcd

Browse files
docs(ai): improve recommended orchestrator pricing (#690)
This commit enhances the recommended pricing documentation, providing orchestrators with clearer guidance for setting their initial prices effectively. --------- Co-authored-by: Rick Staa <[email protected]>
1 parent c8166a7 commit 8127dcd

8 files changed

+115
-34
lines changed

ai/orchestrators/models-config.mdx

+9-32
Original file line numberDiff line numberDiff line change
@@ -25,50 +25,27 @@ currently **recommended** models and their respective prices.
2525
{
2626
"pipeline": "text-to-image",
2727
"model_id": "SG161222/RealVisXL_V4.0_Lightning",
28-
"price_per_unit": 4768371
29-
},
30-
{
31-
"pipeline": "image-to-image",
32-
"model_id": "timbrooks/instruct-pix2pix",
33-
"price_per_unit": 4768371
28+
"price_per_unit": 4768371,
3429
},
3530
{
3631
"pipeline": "upscale",
3732
"model_id": "stabilityai/stable-diffusion-x4-upscaler",
38-
"price_per_unit": 4768371
33+
"price_per_unit": "0.5e-2USD",
34+
"warm": true,
35+
"optimization_flags": {
36+
"SFAST": true,
37+
"DEEPCACHE": false
38+
}
3939
},
4040
{
4141
"pipeline": "audio-to-text",
4242
"model_id": "openai/whisper-large-v3",
4343
"price_per_unit": 12882811,
44+
"pixels_per_unit": 1,
45+
"currency": "USD",
4446
"url": "<CONTAINER_URL>:<PORT>",
4547
"token": "<OPTIONAL_BEARER_TOKEN>",
4648
"capacity": 1
47-
},
48-
{
49-
"pipeline": "segment-anything-2",
50-
"model_id": "facebook/sam2-hiera-large",
51-
"price_per_unit": 3565,
52-
"pixels_per_unit": 1e13,
53-
"currency": "USD",
54-
"warm": true
55-
},
56-
{
57-
"pipeline": "image-to-video",
58-
"model_id": "stabilityai/stable-video-diffusion-img2vid-xt-1-1",
59-
"price_per_unit": 3390842,
60-
"warm": true,
61-
"optimization_flags": {
62-
"SFAST": true,
63-
"DEEPCACHE": false
64-
}
65-
},
66-
{
67-
"pipeline": "text-to-speech",
68-
"model_id": "parler-tts/parler-tts-large-v1",
69-
"price_per_unit": 11,
70-
"pixels_per_unit": 1e2,
71-
"currency": "USD"
7249
}
7350
]
7451
```

ai/pipelines/audio-to-text.mdx

+15
Original file line numberDiff line numberDiff line change
@@ -107,6 +107,21 @@ The following system requirements are recommended for optimal performance:
107107
- [NVIDIA GPU](https://developer.nvidia.com/cuda-gpus) with **at least 12GB** of
108108
VRAM.
109109

110+
## Recommended Pipeline Pricing
111+
112+
<Note>
113+
We are planning to simplify the pricing in the future so orchestrators can set
114+
one AI price per compute unit and have the system automatically scale based on
115+
the model's compute requirements.
116+
</Note>
117+
118+
The pricing for the `audio-to-text` pipeline is based on competitor pricing.
119+
However, we strongly encourage orchestrators to set their own pricing based on
120+
their costs and requirements. Setting a competitive price will help attract more
121+
jobs, as Gateways can set their maximum price for a job. The currently
122+
recommended pricing for this pipeline is `0.02e-6 USD` per **milliseconds** of
123+
audio input.
124+
110125
## API Reference
111126

112127
<Card

ai/pipelines/image-to-image.mdx

+15
Original file line numberDiff line numberDiff line change
@@ -141,6 +141,21 @@ The following system requirements are recommended for optimal performance:
141141
- [NVIDIA GPU](https://developer.nvidia.com/cuda-gpus) with **at least 20GB** of
142142
VRAM.
143143

144+
## Recommended Pipeline Pricing
145+
146+
<Note>
147+
We are planning to simplify the pricing in the future so orchestrators can set
148+
one AI price per compute unit and have the system automatically scale based on
149+
the model's compute requirements.
150+
</Note>
151+
152+
The pricing for the `image-to-image` pipeline is based on competitor pricing.
153+
However, we strongly encourage orchestrators to set their own pricing based on
154+
their costs and requirements. Setting a competitive price will help attract more
155+
jobs, as Gateways can set their maximum price for a job. The current recommended
156+
pricing for this pipeline is `1.9073484e-08 USD` per **input pixel**
157+
(`height * width * output images`).
158+
144159
## API Reference
145160

146161
<Card

ai/pipelines/image-to-video.mdx

+15
Original file line numberDiff line numberDiff line change
@@ -126,6 +126,21 @@ The following system requirements are recommended for optimal performance:
126126
- [NVIDIA GPU](https://developer.nvidia.com/cuda-gpus) with **at least 24GB** of
127127
VRAM.
128128

129+
## Recommended Pipeline Pricing
130+
131+
<Note>
132+
We are planning to simplify the pricing in the future so orchestrators can set
133+
one AI price per compute unit and have the system automatically scale based on
134+
the model's compute requirements.
135+
</Note>
136+
137+
The pricing for the `image-to-video` pipeline is based on competitor pricing.
138+
However, we strongly encourage orchestrators to set their own pricing based on
139+
their costs and requirements. Setting a competitive price will help attract more
140+
jobs, as Gateways can set their maximum price for a job. The current recommended
141+
pricing for this pipeline is `1.3563368e-08 USD` per **output pixel**
142+
(`height * width * frames`).
143+
129144
## API Reference
130145

131146
<Card

ai/pipelines/segment-anything-2.mdx

+15
Original file line numberDiff line numberDiff line change
@@ -104,6 +104,21 @@ The following system requirements are recommended for optimal performance:
104104
- [NVIDIA GPU](https://developer.nvidia.com/cuda-gpus) with **at least 6GB** of
105105
VRAM.
106106

107+
## Recommended Pipeline Pricing
108+
109+
<Note>
110+
We are planning to simplify the pricing in the future so orchestrators can set
111+
one AI price per compute unit and have the system automatically scale based on
112+
the model's compute requirements.
113+
</Note>
114+
115+
The pricing for the `segment-anything-2` pipeline is based on competitor
116+
pricing. However, we strongly encourage orchestrators to set their own pricing
117+
based on their costs and requirements. Setting a competitive price will help
118+
attract more jobs, as Gateways can set their maximum price for a job. The
119+
current recommended pricing for this pipeline is `3.22e-11 USD` per **input
120+
pixel** (`height * width`).
121+
107122
### Pipeline-Specific Image
108123

109124
To serve the `segment-anything-2` pipeline, you must use a pipeline specific AI

ai/pipelines/text-to-image.mdx

+17-2
Original file line numberDiff line numberDiff line change
@@ -37,8 +37,8 @@ The current warm model requested for the `text-to-image` pipeline is:
3737
Furthermore, several Orchestrators are currently maintaining the following model
3838
in a ready state:
3939

40-
- [ByteDance/SDXL-Lightning](https://huggingface.co/ByteDance/SDXL-Lightning):
41-
A high-performance diffusion model developed by ByteDance.
40+
- [ByteDance/SDXL-Lightning](https://huggingface.co/ByteDance/SDXL-Lightning): A
41+
high-performance diffusion model developed by ByteDance.
4242

4343
<Tip>
4444
For faster responses with different
@@ -157,6 +157,21 @@ The following system requirements are recommended for optimal performance:
157157
- [NVIDIA GPU](https://developer.nvidia.com/cuda-gpus) with **at least 24GB** of
158158
VRAM.
159159

160+
## Recommended Pipeline Pricing
161+
162+
<Note>
163+
We are planning to simplify the pricing in the future so orchestrators can set
164+
one AI price per compute unit and have the system automatically scale based on
165+
the model's compute requirements.
166+
</Note>
167+
168+
The pricing for the `text-to-image` pipeline is based on competitor pricing.
169+
However, we strongly encourage orchestrators to set their own pricing based on
170+
their costs and requirements. Setting a competitive price will help attract more
171+
jobs, as Gateways can set their maximum price for a job. The current recommended
172+
pricing for this pipeline is `1.9073484e-08 USD` per **output pixel**
173+
(`height * width * output images`).
174+
160175
## API Reference
161176

162177
<Card

ai/pipelines/text-to-speech.mdx

+14
Original file line numberDiff line numberDiff line change
@@ -83,6 +83,20 @@ The following system requirements are recommended for optimal performance:
8383
- [NVIDIA GPU](https://developer.nvidia.com/cuda-gpus) with **at least 12GB** of
8484
VRAM.
8585

86+
## Recommended Pipeline Pricing
87+
88+
<Note>
89+
We are planning to simplify the pricing in the future so orchestrators can set
90+
one AI price per compute unit and have the system automatically scale based on
91+
the model's compute requirements.
92+
</Note>
93+
94+
The pricing for the `text-to-speech` pipeline is based on competitor pricing.
95+
However, we strongly encourage orchestrators to set their own pricing based on
96+
their costs and requirements. Setting a competitive price will help attract more
97+
jobs, as Gateways can set their maximum price for a job. The current recommended
98+
pricing for this pipeline is `1.5e-6 USD` per **character**.
99+
86100
### Pipeline-Specific Image
87101

88102
To serve the `text-to-speech` pipeline, you must use a pipeline specific AI

ai/pipelines/upscale.mdx

+15
Original file line numberDiff line numberDiff line change
@@ -121,6 +121,21 @@ The following system requirements are recommended for optimal performance:
121121
- [NVIDIA GPU](https://developer.nvidia.com/cuda-gpus) with **at least 24GB** of
122122
VRAM.
123123

124+
## Recommended Pipeline Pricing
125+
126+
<Note>
127+
We are planning to simplify the pricing in the future so orchestrators can set
128+
one AI price per compute unit and have the system automatically scale based on
129+
the model's compute requirements.
130+
</Note>
131+
132+
The pricing for the `upscale` pipeline is based on competitor pricing. However,
133+
we strongly encourage orchestrators to set their own pricing based on their
134+
costs and requirements. Setting a competitive price will help attract more jobs,
135+
as Gateways can set their maximum price for a job. The current recommended
136+
pricing for this pipeline is `1.9073484e-08 USD` per **input pixel**
137+
(`height * width`).
138+
124139
## API Reference
125140

126141
<Card

0 commit comments

Comments
 (0)