|
1 | | -# Azure OpenAI and AI Agent service chat baseline reference implementation |
| 1 | +# AI Agent service chat baseline reference implementation |
2 | 2 |
|
3 | | -This reference implementation illustrates an approach running a chat application and an AI orchestration layer in a single region. It uses Azure AI Agent service as the orchestrator and Azure OpenAI foundation models. This repository directly supports the [Baseline end-to-end chat reference architecture](https://learn.microsoft.com/azure/architecture/ai-ml/architecture/baseline-openai-e2e-chat) on Microsoft Learn. |
| 3 | +This reference implementation illustrates an approach running a chat application and an AI orchestration layer in a single region. It uses Azure AI Agent service as the orchestrator and OpenAI foundation models. This repository directly supports the [Baseline end-to-end chat reference architecture](https://learn.microsoft.com/azure/architecture/ai-ml/architecture/baseline-openai-e2e-chat) on Microsoft Learn. |
4 | 4 |
|
5 | | -Follow this implementation to deploy an agent in [Azure AI Foundry](https://learn.microsoft.com/azure/ai-studio/how-to/prompt-flow) and uses Bing for grounding data. You'll be exposed to common generative AI chat application characteristics such as: |
| 5 | +Follow this implementation to deploy an agent in [Azure AI Foundry](https://learn.microsoft.com/azure/ai-foundry/) and uses Bing for grounding data. You'll be exposed to common generative AI chat application characteristics such as: |
6 | 6 |
|
7 | 7 | - Creating agents and agent prompts |
8 | 8 | - Querying data stores for grounding data |
@@ -70,7 +70,7 @@ Follow these instructions to deploy this example to your Azure subscription, try |
70 | 70 | - App Service Plans: P1v3 (AZ), 3 instances |
71 | 71 | - Azure AI Search (S - Standard): 1 |
72 | 72 | - Azure Cosmos DB: 1 account |
73 | | - - Azure OpenAI: GPT-4o model deployment with 50k tokens per minute (TPM) capacity |
| 73 | + - OpenAI model: GPT-4o model deployment with 50k tokens per minute (TPM) capacity |
74 | 74 | - DDoS Protection Plans: 1 |
75 | 75 | - Public IPv4 Addresses - Standard: 4 |
76 | 76 | - Standard DSv3 Family vCPU: 2 |
@@ -292,7 +292,7 @@ For this deployment guide, you'll continue using your jump box to simulate part |
292 | 292 |
|
293 | 293 | ### 5. Try it out! Test the deployed application that calls into the Azure AI Agent service |
294 | 294 |
|
295 | | -This section will help you to validate that the workload is exposed correctly and responding to HTTP requests. This will validate that traffic is flowing through Application Gateway, into your Web App, and from your Web App, into the Azure Machine Learning managed online endpoint, which contains the hosted prompt flow. The hosted prompt flow will interface with Wikipedia for grounding data and Azure OpenAI for generative responses. |
| 295 | +This section will help you to validate that the workload is exposed correctly and responding to HTTP requests. This will validate that traffic is flowing through Application Gateway, into your Web App, and from your Web App, into the Azure AI Foundry agent API endpoint, which hosts the agent and its chat history. The agent will interface with Bing for grounding data and an OpenAI model for generative responses. |
296 | 296 |
|
297 | 297 | | :computer: | Unless otherwise noted, the following steps are all performed from your original workstation, not from the jump box. | |
298 | 298 | | :--------: | :------------------------- | |
|
0 commit comments