Amazon Bedrock Inference Cost Monitoring & Allocation

📌 Overview

This project provides a structured approach to monitor and allocate inference costs for applications utilizing Amazon Bedrock. By leveraging Application Inference Profiles (AIPs), AWS tagging, and CloudWatch dashboards, it enables detailed cost tracking across various dimensions such as applications, tenants, and environments.

🧰 Features

Application Inference Profiles (AIPs): Create AIPs for each combination of application, tenant, and environment to isolate and monitor usage.
AWS Tagging Integration: Utilize AWS tags to associate metadata with each AIP, facilitating granular cost allocation.
Automated Setup: Deploy necessary AWS resources including Lambda functions, API Gateway endpoints, CloudWatch dashboards, and SNS alerts using a setup script.
Real-Time Monitoring: Visualize inference usage and costs through a Streamlit dashboard integrated with CloudWatch metrics.

⚙️ Prerequisites

Before setting up the project, ensure the following:

AWS Account: An active AWS account with permissions to create and manage resources such as Lambda functions, API Gateway, CloudWatch, and SNS.
Python Environment: Python 3.12 or higher installed on your local machine.
Virtual Environment Setup: It's recommended to use a virtual environment to manage project dependencies.

📝 Configuration

Prior to executing the setup script, update the configuration files to reflect your specific use case.

Update config/config.json: Define your applications, profiles, environments, and associated tags.

Example structure:

{
  "profiles": [
    {
      "name": "CustomerOneWebSearchBot", 
      "description": "For Customer-1 using Websearch Bot",
      "model_id": "anthropic.claude-3-haiku-20240307-v1:0",
      "tags": [
                {
                    "key": "CreatedBy",
                    "value": "Dev-Account"
                },
                {
                    "key": "ApplicationID",
                    "value": "Web-Search-Bot"
                },
                {
                    "key": "Environment",
                    "value": "Dev"
                }
         ...
      ]
    },
    {
      "name": "CustomerOneCodeAssistant",
      "description": "For Customer-1 using Coding Assistant Bot",
      "model_id": "amazon.nova-pro-v1:0",
      "tags": [
                {
                    "key": "CreatedBy",
                    "value": "Prod-Account"
                },
                {
                    "key": "ApplicationID",
                    "value": "Coding-Assistant-Bot"
                },
                {
                    "key": "Environment",
                    "value": "Prod"
                }
         ...
      ]
    }
  ]
}

Update config/models.json: Specify the pricing details for each model, including input and output token costs.

Example structure:

{
  "anthropic.claude-3-haiku-20240307-v1:0": {
    "input_cost": 0.00163,
    "output_cost": 0.00551
  },
  "amazon.nova-pro-v1:0": {
    "input_cost": 0.00075,
    "output_cost": 0.001
  }
}

🚀 Setup Instructions

Follow these steps to set up the project:

Clone the Repository:

git clone https://github.com/aws-samples/amazon-bedrock-samples.git
cd amazon-bedrock-samples/poc-to-prod/inference-profiles/inference-profile-cost-tracing

Set Up Virtual Environment:

python3 -m venv venv
source venv/bin/activate  # On Windows, use 'venv\Scripts\activate'

Install Dependencies:

pip install -r requirements.txt

Execute Setup Script:

python setup.py

This script will:

Create Application Inference Profiles based on your configuration.
Deploy Lambda functions responsible for capturing metadate.
Deploy API Gateway endpoints (you will use this to run your inferences).
Set up CloudWatch dashboards and SNS alerts for monitoring.

📊 CloudWatch Dashboard

An example of the CloudWatch dashboard displaying inference usage and cost metrics.

🎥 Video Tutorial

For a comprehensive walkthrough of the solution, watch the following video:

🧾 License

This project is licensed under the MIT License.

🤝 Contributing

Contributions are welcome! Please fork the repository and submit a pull request for any enhancements or bug fixes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Amazon Bedrock Inference Cost Monitoring & Allocation

📌 Overview

🧰 Features

⚙️ Prerequisites

📝 Configuration

🚀 Setup Instructions

📊 CloudWatch Dashboard

🎥 Video Tutorial

🧾 License

🤝 Contributing

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Amazon Bedrock Inference Cost Monitoring & Allocation

📌 Overview

🧰 Features

⚙️ Prerequisites

📝 Configuration

🚀 Setup Instructions

📊 CloudWatch Dashboard

🎥 Video Tutorial

🧾 License

🤝 Contributing