Bedrock Agent 활용하기

여기에서는 Knowledge Base로 RAG를 구성하고 bedrock agent를 활용하는 방법을 설명합니다. Bedrock agent는 완전관리형 서비스로 한번 구현하면 추가적인 노력없이 편리하게 agent를 이용한 서비스를 구현할 수 있습니다. 이를 위해 action group으로 tools를 등록하고, knowledge base로 편리하게 RAG를 구성할 수 있어야 합니다. Bedrock agent의 LLM 모델로는 Anthropic의 Claude와 Amazon의 Nova를 선택하여 활용할 수 있도록 하였고, agent로 구현하는 code interpreter를 테스트 해볼 수 있습니다.

전체 Architecture

아래 그림에서는 bedrock agent로 구현된 architecture를 보여주고 있습니다. Knowledge base로 RAG를 구성하였고, AWS lambda로 구현된 tools에서는 인터넷 검색, 날씨정보 API를 호출할 수 있습니다. 테스트용 애플리케이션은 streamlit으로 구성하였고, 안전하게 접속할 수 있도록 CloudFront와 API Gateway를 이용해 HTTPS 연결을 제공합니다. 애플리케이션에서 문서를 선택하면 Amazon S3에 업로드 되고, Knowledge base를 이용해 OpenSearch Serverless에 자동 동기화 됩니다. 이후 RAG로 검색을 수행하면 CloudFront - S3의 연결로 파일을 공유 할 수 있습니다.

상세 구현

일반적인 대화

일반적인 대화에서는 프롬프트의 chatbot의 이름을 지정하고 원하는 동작을 수행하도록 요청할 수 있습니다. 또한 아래와 같이 history를 이용해 이전 대화내용을 참조하여 답변하도록 합니다. 결과는 stream으로 전달되어 streamlit으로 표시됩니다.

def general_conversation(query):
    chat = get_chat()
    system = (
        "당신의 이름은 서연이고, 질문에 대해 친절하게 답변하는 사려깊은 인공지능 도우미입니다."
        "상황에 맞는 구체적인 세부 정보를 충분히 제공합니다." 
        "모르는 질문을 받으면 솔직히 모른다고 말합니다."
    )    
    human = "Question: {input}"
    
    prompt = ChatPromptTemplate.from_messages([
        ("system", system), 
        MessagesPlaceholder(variable_name="history"), 
        ("human", human)
    ])                
    history = memory_chain.load_memory_variables({})["chat_history"]
    chain = prompt | chat | StrOutputParser()
    stream = chain.stream(
        {
            "history": history,
            "input": query,
        }
    )          
    return stream

RAG

RAG은 retrive, grade, generation의 단계로 수행됩니다. Knowledge Base를 이용해 관련된 문서를 가져오는 retrieve 동작을 수행합니다. 문서 원본을 확인할 수 있도록 파일의 url은 cloudfront의 도메인을 기준으로 제공합니다. 문서의 조회는 LangChain의 AmazonKnowledgeBasesRetriever을 이용하여 numberOfResults만큼 가져옵니다. 이때 overrideSearchType로 'HYBRID'와 'SEMANTIC"을 선택할 수 있습니다. LangChain을 사용하지 않을 경우에 boto3의 retrieve_and_generate을 이용해 유사하게 구현이 가능합니다.

def retrieve_documents_from_knowledge_base(query, top_k):
    relevant_docs = []
    retriever = AmazonKnowledgeBasesRetriever(
        knowledge_base_id=knowledge_base_id, 
        retrieval_config={"vectorSearchConfiguration": {
            "numberOfResults": top_k,
            "overrideSearchType": "HYBRID"   # SEMANTIC
        }},
        region_name=bedrock_region
    )
    
    documents = retriever.invoke(query)    
    relevant_docs = []
    for doc in documents:
        content = ""
        if doc.page_content:
            content = doc.page_content        
        score = doc.metadata["score"]        
        if "s3Location" in doc.metadata["location"]:
            link = doc.metadata["location"]["s3Location"]["uri"] if doc.metadata["location"]["s3Location"]["uri"] is not None else ""
            pos = link.find(f"/{doc_prefix}")
            name = link[pos+len(doc_prefix)+1:]
            encoded_name = parse.quote(name)
            link = f"{path}/{doc_prefix}{encoded_name}"
        elif "webLocation" in doc.metadata["location"]:
            link = doc.metadata["location"]["webLocation"]["url"] if doc.metadata["location"]["webLocation"]["url"] is not None else ""
            name = "WEB"
        url = link
        relevant_docs.append(
            Document(
                page_content=content,
                metadata={
                    'name': name,
                    'score': score,
                    'url': url,
                    'from': 'RAG'
                },
            )
        )    
    return relevant_docs

적절한 문서를 선택하기 위하여 grade 동작을 수행합니다. 아래와 같이 retieve에서 얻어진 관련된 문서들이 실제 관련이 있는데 확인합니다. 이때 확인된 문서들만 filtered_docs로 정리합니다.

def grade_documents(question, documents):
    filtered_docs = []
    chat = get_chat()
    retrieval_grader = get_retrieval_grader(chat)
    for i, doc in enumerate(documents):
        score = retrieval_grader.invoke({"question": question, "document": doc.page_content})
        grade = score.binary_score
        if grade.lower() == "yes":
            print("---GRADE: DOCUMENT RELEVANT---")
            filtered_docs.append(doc)
        else:
            print("---GRADE: DOCUMENT NOT RELEVANT---")
            continue
    return filtered_docs

문서가 실제 관련이 있는지는 아래와 같이 prompt를 이용합니다. 이때 "yes", "no"와 같은 결과를 추출하기 위하여 structured_output을 활용하였습니다.

class GradeDocuments(BaseModel):
    """Binary score for relevance check on retrieved documents."""

    binary_score: str = Field(description="Documents are relevant to the question, 'yes' or 'no'")

def get_retrieval_grader(chat):
    system = """You are a grader assessing relevance of a retrieved document to a user question. \n 
    If the document contains keyword(s) or semantic meaning related to the question, grade it as relevant. \n
    Give a binary score 'yes' or 'no' score to indicate whether the document is relevant to the question."""
    grade_prompt = ChatPromptTemplate.from_messages(
        [
            ("system", system),
            ("human", "Retrieved document: \n\n {document} \n\n User question: {question}"),
        ]
    )    
    structured_llm_grader = chat.with_structured_output(GradeDocuments)
    retrieval_grader = grade_prompt | structured_llm_grader
    return retrieval_grader

관련된 문서를 가지고 답변을 생성합니다.

chat = get_chat()
system = (
    "당신의 이름은 서연이고, 질문에 대해 친절하게 답변하는 사려깊은 인공지능 도우미입니다."
    "다음의 Reference texts을 이용하여 user의 질문에 답변합니다."
    "모르는 질문을 받으면 솔직히 모른다고 말합니다."
    "답변의 이유를 풀어서 명확하게 설명합니다."
)
human = (
    "Question: {question}"

    "Reference texts: "
    "{context}"
) 
 prompt = ChatPromptTemplate.from_messages([("system", system), ("human", human)])
rag_chain = prompt | chat
result = rag_chain.invoke(
    {
        "question": text,
        "context": relevant_context                
    }
)
msg = result.content

Bedrock Agent

Agent를 위해서는 cdk-bedrock-agent-stack.ts와 같이 Bedrock에 대한 invoke, retrieve, inference, agent-alias를 허용하도록 하여야 합니다.

const agent_role = new iam.Role(this,  `role-agent-for-${projectName}`, {
  roleName: `role-agent-for-${projectName}-${region}`,
  assumedBy: new iam.CompositePrincipal(
    new iam.ServicePrincipal("bedrock.amazonaws.com")
  )
});

const agentInvokePolicy = new iam.PolicyStatement({ 
  effect: iam.Effect.ALLOW,
  resources: [
    `arn:aws:bedrock:*::foundation-model/*`
  ],
  actions: [
    "bedrock:InvokeModel"
  ],
});        
agent_role.attachInlinePolicy( 
  new iam.Policy(this, `agent-invoke-policy-for-${projectName}`, {
    statements: [agentInvokePolicy],
  }),
);  

const bedrockRetrievePolicy = new iam.PolicyStatement({ 
  effect: iam.Effect.ALLOW,
  resources: [
    `arn:aws:bedrock:${region}:${accountId}:knowledge-base/*`
  ],
  actions: [
    "bedrock:Retrieve"
  ],
});        
agent_role.attachInlinePolicy( 
  new iam.Policy(this, `bedrock-retrieve-policy-for-${projectName}`, {
    statements: [bedrockRetrievePolicy],
  }),
);  

const agentInferencePolicy = new iam.PolicyStatement({ 
  effect: iam.Effect.ALLOW,
  resources: [
    `arn:aws:bedrock:${region}:${accountId}:inference-profile/*`,
    `arn:aws:bedrock:*::foundation-model/*`
  ],
  actions: [
    "bedrock:InvokeModel",
    "bedrock:GetInferenceProfile",
    "bedrock:GetFoundationModel"
  ],
});        
agent_role.attachInlinePolicy( 
  new iam.Policy(this, `agent-inference-policy-for-${projectName}`, {
    statements: [agentInferencePolicy],
  }),
);

const agentAliasPolicy = new iam.PolicyStatement({ 
  effect: iam.Effect.ALLOW,
  resources: [
    `arn:aws:bedrock:${region}:${accountId}:agent-alias/*`
  ],
  actions: [
    "bedrock:GetAgentAlias",
    "bedrock:InvokeAgent"
  ],
});        
agent_role.attachInlinePolicy( 
  new iam.Policy(this, `agent-alias-policy-for-${projectName}`, {
    statements: [agentAliasPolicy],
  }),
);

Bedrock agent는 console에서 생성할 수도 있지만 아래와 같이 boto3의 create_agent을 이용해 생성할 수 있습니다. 이때 agent의 instruction을 지정하고 agent role을 활용합니다. 생성된 agentId는 이후 agent에 추가 설정을 하거나 실행할때 활용됩니다.

client = boto3.client(
    service_name='bedrock-agent',
    region_name=bedrock_region
)  
agent_instruction = (
    "당신의 이름은 서연이고, 질문에 친근한 방식으로 대답하도록 설계된 대화형 AI입니다. "
    "상황에 맞는 구체적인 세부 정보를 충분히 제공합니다. "
    "모르는 질문을 받으면 솔직히 모른다고 말합니다. "
)
response = client.create_agent(
    agentResourceRoleArn=agent_role_arn,
    instruction=agent_instruction,
    foundationModel=modelId,
    description=f"Bedrock Agent (Knowledge Base) 입니다. 사용 모델은 {modelName}입니다.",
    agentName=agentName,
    idleSessionTTLInSeconds=600
)
agentId = response['agent']['agentId']

Bedrock agent에서 실행할 tool들은 action group에서 정의합니다. Action group이 이미 있는지를 list_agent_action_groups로 확인하고, create_agent_action_group을 이용해 action group을 생성할 수 있습니다.

response = client.list_agent_action_groups(
    agentId=agentId,
    agentVersion='DRAFT',
    maxResults=10
)
actionGroupSummaries = response['actionGroupSummaries']

isExist = False
for actionGroup in actionGroupSummaries:
    if actionGroup['actionGroupId'] == actionGroupName:
        isExist = True
        break
if not isExist:
    response = client.create_agent_action_group(
        actionGroupName=actionGroupName,
        actionGroupState='ENABLED',
        agentId=agentId,
        agentVersion='DRAFT',
        description=f"Action Group의 이름은 {actionGroupName} 입니다.",
        actionGroupExecutor={'lambda': lambda_tools_arn},
        functionSchema={
            'functions': [
                {
                    'name': 'book_list',
                    'description': 'Search book list by keyword and then return book list',                        
                    'parameters': {
                        'keyword': {
                            'description': 'Search keyword',
                            'required': True,
                            'type': 'string'
                        }
                    },
                    'requireConfirmation': 'DISABLED'
                },
                {
                    'name': 'current_time',
                    'description': "Returns the current date and time in the specified format such as %Y-%m-%d %H:%M:%S",
                    'parameters': {
                        'format': {
                            'description': 'time format of the current time',
                            'required': True,
                            'type': 'string'
                        }
                    },
                    'requireConfirmation': 'DISABLED'
                },
                {
                    'name': 'weather',
                    'description': "Retrieve weather information by city name and then return weather statement.",
                    'parameters': {
                        'city': {
                            'description': 'the name of city to retrieve',
                            'required': True,
                            'type': 'string'
                        }
                    },
                    'requireConfirmation': 'DISABLED'
                },
                {
                    'name': 'search_internet',
                    'description': "Search general information by keyword and then return the result as a string.",
                    'parameters': {
                        'keyword': {
                            'description': 'search keyword',
                            'required': True,
                            'type': 'string'
                        }
                    },
                    'requireConfirmation': 'DISABLED'
                },
                {
                    'name': 'search_rag',
                    'description': "Search technical information by keyword and then return the result as a string.",
                    'parameters': {
                        'keyword': {
                            'description': 'search keyword',
                            'required': True,
                            'type': 'string'
                        }
                    },
                    'requireConfirmation': 'DISABLED'
                }
            ]
        },            
    )

생성된 Bedrock agent에 RAG를 직접 연결할 때에는 아래와 같이 associate_agent_knowledge_base을 이용하여 연결합니다.

rag_prompt = (
    "당신의 이름은 서연이고, 질문에 대해 친절하게 답변하는 사려깊은 인공지능 도우미입니다."
    "다음의 Reference texts을 이용하여 user의 질문에 답변합니다."
    "모르는 질문을 받으면 솔직히 모른다고 말합니다."
    "답변의 이유를 풀어서 명확하게 설명합니다."
)
response = client.associate_agent_knowledge_base(
    agentId=agentId,
    agentVersion='DRAFT',
    description=rag_prompt,
    knowledgeBaseId=knowledge_base_id,
    knowledgeBaseState='ENABLED'
)
print(f'response of associate_agent_knowledge_base(): {response}')

Bedrock agent를 이용하려면 실행전에 prepared 상태이어야 합니다. 따라서 설정 후에는 아래처럼 prepare 상태를 변경합니다.

response = client.prepare_agent(
    agentId=agentId
)
print('response of prepare_agent(): ', response)

Bedrock agent를 사용하기 위해서는 배포를 하여야 하는데, 여기서는 원할한 데모를 위해 기존 배포가 있다지 확인해서 있다면 지우고 새로 생성합니다. 기존 배포의 확인은 list_agent_aliases, 삭제는 delete_agent_alias, 생성은 create_agent_alias을 이용합니다.

# retrieve agent alias
response_agent_alias = client.list_agent_aliases(
    agentId = agentId,
    maxResults=10
)
for summary in response_agent_alias["agentAliasSummaries"]:
    if summary["agentAliasName"] == agentAliasName:
        agentAliasId = summary["agentAliasId"]
        break
if agentAliasId:
    response = client.delete_agent_alias(
        agentAliasId=agentAliasId,
        agentId=agentId
    )            

# create agent alias 
response = client.create_agent_alias(
    agentAliasName=agentAliasName,
    agentId=agentId,
    description='the lastest deployment'
)
agentAliasId = response['agentAlias']['agentAliasId']

Bedrock agent는 "bedrock-agent-runtime"을 이용하여 invoke_agent로 실행합니다.

client_runtime = boto3.client(
    service_name='bedrock-agent-runtime',
    region_name=bedrock_region
)
response = client_runtime.invoke_agent( 
    agentAliasId=agentAliasId,
    agentId=agentId,
    inputText=text, 
    enableTrace=True,
    sessionId=sessionId[userId], 
    memoryId='memory-'+userId,
    sessionState=sessionState
)
event_stream = response['completion']

결과는 stream으로 얻을수 있습니다. "enableTrace"를 이용해 중간 결과를 화면에 표시할 수 있습니다. 최종 결과는 chunk난 trace의 "observation"의 "finalResponse"로 알 수 있습니다.

for index, event in enumerate(event_stream):
    # Handle text chunks
    if "chunk" in event:
        chunk = event["chunk"]
        if "bytes" in chunk:
            text = chunk["bytes"].decode("utf-8")
            stream_result += text
    # Handle file outputs
    if "files" in event:
        files = event["files"]["files"]
        for file in files:
            st.image(file["bytes"], caption=file["name"])
    # Check trace
    if "trace" in event:
        if ("trace" in event["trace"] and "orchestrationTrace" in event["trace"]["trace"]):
            trace_event = event["trace"]["trace"]["orchestrationTrace"]
            if "rationale" in trace_event:
                trace_text = trace_event["rationale"]["text"]
                st.info(f"rationale: {trace_text}")

            if "modelInvocationInput" in trace_event:
                if "text" in trace_event["modelInvocationInput"]:
                    trace_text = trace_event["modelInvocationInput"]["text"]
                    print("trace_text: ", trace_text)
                if "rawResponse" in trace_event["modelInvocationInput"]:
                    rawResponse = trace_event["modelInvocationInput"]["rawResponse"]                        
                    print("rawResponse: ", rawResponse)
            if "modelInvocationOutput" in trace_event:
                if "rawResponse" in trace_event["modelInvocationOutput"]:
                    trace_text = trace_event["modelInvocationOutput"]["rawResponse"]["content"]
                    print("trace_text: ", trace_text)

            if "invocationInput" in trace_event:
                if "codeInterpreterInvocationInput" in trace_event["invocationInput"]:
                    trace_code = trace_event["invocationInput"]["codeInterpreterInvocationInput"]["code"]
                    print("trace_code: ", trace_code)
                if "knowledgeBaseLookupInput" in trace_event["invocationInput"]:
                    trace_text = trace_event["invocationInput"]["knowledgeBaseLookupInput"]["text"]
                    st.info(f"RAG를 검색합니다. 검색어: {trace_text}")
                if "actionGroupInvocationInput" in trace_event["invocationInput"]:
                    trace_function = trace_event["invocationInput"]["actionGroupInvocationInput"]["function"]
                    st.info(f"actionGroupInvocation: {trace_function}")

            if "observation" in trace_event:
                if "finalResponse" in trace_event["observation"]:
                    trace_resp = trace_event["observation"]["finalResponse"]["text"]
                    final_result = trace_resp
                if ("codeInterpreterInvocationOutput" in trace_event["observation"]):
                    if "executionOutput" in trace_event["observation"]["codeInterpreterInvocationOutput"]:
                        trace_resp = trace_event["observation"]["codeInterpreterInvocationOutput"]["executionOutput"]
                        st.info(f"observation: {trace_resp}")
                    if "executionError" in trace_event["observation"]["codeInterpreterInvocationOutput"]:
                        trace_resp = trace_event["observation"]["codeInterpreterInvocationOutput"]["executionError"]
                        if "image_url" in trace_resp:
                            print("got image")
                            image_url = trace_resp["image_url"]
                            st.image(image_url)
                if "knowledgeBaseLookupOutput" in trace_event["observation"]:
                    if "retrievedReferences" in trace_event["observation"]["knowledgeBaseLookupOutput"]:
                        references = trace_event["observation"]["knowledgeBaseLookupOutput"]["retrievedReferences"]
                        st.info(f"{len(references)}개의 문서가 검색되었습니다.")
                if "actionGroupInvocationOutput" in trace_event["observation"]:
                    trace_resp = trace_event["observation"]["actionGroupInvocationOutput"]["text"]
                    st.info(f"actionGroupInvocationOutput: {trace_resp}")

        elif "guardrailTrace" in event["trace"]["trace"]:
            guardrail_trace = event["trace"]["trace"]["guardrailTrace"]
            if "inputAssessments" in guardrail_trace:
                assessments = guardrail_trace["inputAssessments"]
                for assessment in assessments:
                    if "contentPolicy" in assessment:
                        filters = assessment["contentPolicy"]["filters"]
                        for filter in filters:
                            if filter["action"] == "BLOCKED":
                                st.error(f"Guardrail blocked {filter['type']} confidence: {filter['confidence']}")
                    if "topicPolicy" in assessment:
                        topics = assessment["topicPolicy"]["topics"]
                        for topic in topics:
                            if topic["action"] == "BLOCKED":
                                st.error(f"Guardrail blocked topic {topic['name']}")

Agent에서 사용할 Tool의 구현

Action group에 정의된 Tool들은 Lambda를 이용해 실행됩니다. cdk-bedrock-agent-stack.ts에서는 아래와 같이 tools을 위한 lambda를 정의합니다. 이때, lambda는 "lambda:InvokeFunction"에 대한 권한을 가지고 있어야 합니다.

const lambdaTools = new lambda.DockerImageFunction(this, `lambda-tools-for-${projectName}`, {
  description: 'action group - tools',
  functionName: `lambda-tools-for-${projectName}`,
  code: lambda.DockerImageCode.fromImageAsset(path.join(__dirname, '../../lambda-tools')),
  timeout: cdk.Duration.seconds(60),
  environment: {
    bedrock_region: String(region),
    projectName: projectName,
    "sharing_url": 'https://'+distribution_sharing.domainName,
  }
});         
lambdaTools.grantInvoke(new cdk.aws_iam.ServicePrincipal("bedrock.amazonaws.com"));

Bedrock agent가 Tool을 선택하면 event로 전달됩니다. 이때 event에서 "function"으로 tool의 이름을 확인하고, "parameters"의 "value"로 tool을 실행시키고 결과를 리턴합니다. 아래의 코드는 lambda_function.py을 참조합니다.

def lambda_handler(event, context):
    agent = event['agent']
    actionGroup = event['actionGroup']
    function = event['function']
    parameters = event.get('parameters',[])
    name = parameters[0]['name']
    value = parameters[0]['value']
    
    if function == 'current_time':
        output = get_current_time(value)        
    elif function == 'book_list':
        output = get_book_list(value)            
    elif function == 'weather':        
        output = get_weather_info(value)
    elif function == 'search_internet':
        output = search_by_tavily(value)
    elif function == 'search_rag':
        output = search_by_knowledge_base(value)

    responseBody =  {
        "TEXT": {
            "body": output
        }
    }
    action_response = {
        'actionGroup': actionGroup,
        'function': function,
        'functionResponse': {
            'responseBody': responseBody
        }
    }
    response = {
        'response': action_response, 
        'messageVersion': event['messageVersion']
    }
    return response

주식정보를 가져오는 함수의 예제입니다. stock_data_lookup의 경우에 ticker와 country를 받아서 country가 한국인 경우에 "KS"를 붙여서 1개월의 정보를 주식정보를 가져옵니다.

import yfinance as yf
def stock_data_lookup(ticker, country):
    """
    Retrieve accurate stock trends for a given ticker.
    ticker: the ticker to retrieve price history for
    country: the english country name of the stock
    return: the information of ticker
    """ 
    com = re.compile('[a-zA-Z]') 
    alphabet = com.findall(ticker)
    if len(alphabet)==0:        
        if country == "South Korea" or country == "Korea":
            ticker += ".KS"
        elif country == "Japan":
            ticker += ".T"
    stock = yf.Ticker(ticker)
    
    # get the price history for past 1 month
    history = stock.history(period="1mo")
    
    result = f"## Trading History\n{history}"
    result += f"\n\n## Financials\n{stock.financials}"
    result += f"\n\n## Major Holders\n{stock.major_holders}"

    return result

일반 인터넷 검색을 수행하는 함수는 아래와 같습니다.

def search_by_tavily(keyword: str) -> str:
    answer = ""    
    keyword = keyword.replace('\'','')
    
    search = TavilySearchResults(
        max_results=2,
        include_answer=True,
        include_raw_content=True,
        api_wrapper=tavily_api_wrapper,
        search_depth="advanced", # "basic"
    )
    output = search.invoke(keyword)
    
    for result in output:
        print('result: ', result)
        if result:
            content = result.get("content")
            url = result.get("url")                    
            answer = answer + f"{content}, URL: {url}\n\n"    
    if answer == "":
        answer = "관련된 정보를 찾지 못하였습니다."
    return answer

도서 검색용 Tool은 아래와 같이 정의할 수 있습니다.

def get_book_list(keyword: str) -> str:
    """
    Search book list by keyword and then return book list
    keyword: search keyword
    return: book list
    """
    
    keyword = keyword.replace('\'','')

    answer = ""
    url = f"https://search.kyobobook.co.kr/search?keyword={keyword}&gbCode=TOT&target=total"
    response = requests.get(url)
    if response.status_code == 200:
        soup = BeautifulSoup(response.text, "html.parser")
        prod_info = soup.find_all("a", attrs={"class": "prod_info"})
        
        if len(prod_info):
            answer = "추천 도서는 아래와 같습니다.\n"
            
        for prod in prod_info[:5]:
            title = prod.text.strip().replace("\n", "")       
            link = prod.get("href")
            answer = answer + f"{title}, URL: {link}\n\n"
    
    return answer

날씨 Tool은 OpenWeather에서 무료로 제공하는 API를 이용해 구현하였습니다.

def get_weather_info(city: str) -> str:
    """
    retrieve weather information by city name and then return weather statement.
    city: the name of city to retrieve
    return: weather statement
    """    
    
    city = city.replace('\n','')
    city = city.replace('\'','')
    city = city.replace('\"','')
                
    chat = get_chat()
    if isKorean(city):
        place = traslation(chat, city, "Korean", "English")
        print('city (translated): ', place)
    else:
        place = city
        city = traslation(chat, city, "English", "Korean")
        
    weather_str: str = f"{city}에 대한 날씨 정보가 없습니다."
    if weather_api_key: 
        apiKey = weather_api_key
        lang = 'en' 
        units = 'metric' 
        api = f"https://api.openweathermap.org/data/2.5/weather?q={place}&APPID={apiKey}&lang={lang}&units={units}"
        result = requests.get(api)
        result = json.loads(result.text)    
        if 'weather' in result:
            overall = result['weather'][0]['main']
            current_temp = result['main']['temp']
            min_temp = result['main']['temp_min']
            max_temp = result['main']['temp_max']
            humidity = result['main']['humidity']
            wind_speed = result['wind']['speed']
            cloud = result['clouds']['all']            
            weather_str = f"{city}의 현재 날씨의 특징은 {overall}이며, 현재 온도는 {current_temp} 입니다. 현재 습도는 {humidity}% 이고, 바람은 초당 {wind_speed} 미터 입니다. 구름은 {cloud}% 입니다."                        
    return weather_str

Custom Orchestration

Customize your Amazon Bedrock Agent's behavior with custom orchestration와 같이 아래와 같은 event를 lambda를 이용해 처리 할 수 있습니다. START, MODEL_INVOKED,TOOL_INVOKED의 event와 stopReason == "end_turn"로 end turn의 동작을 지정할 수 있습니다.

Lambda payload의 "actionEvent"의 종류는 "INVOKE_MODEL | INVOKE_TOOL | APPLY_GUARDRAIL | FINISH | user defined"가 있습니다.

Code Interpreter

Code Interpreter를 위한 action group을 생성합니다. 이때, Amazon Bedrock에서 코드 해석 활성화와 같이 parentActionGroupSignature을 'AMAZON.CodeInterpreter'로 설정합니다. 이때 description, actionGroupExecutor은 사용할 수 없습니다.

def create_action_group_for_code_interpreter(agentId, st):
    actionGroupName = "CodeInterpreter"
    response = client.list_agent_action_groups(
        agentId=agentId,
        agentVersion='DRAFT',
        maxResults=10
    )
    actionGroupSummaries = response['actionGroupSummaries']

    isExist = False
    for actionGroup in actionGroupSummaries:
        if actionGroup['actionGroupId'] == actionGroupName:
            print('action group already exists')
            isExist = True
            break
    if not isExist:
        response = client.create_agent_action_group(
            actionGroupName=actionGroupName,
            actionGroupState='ENABLED',
            agentId=agentId,
            agentVersion='DRAFT',
            parentActionGroupSignature='AMAZON.CodeInterpreter'
        )

Test code interpretation in Amazon Bedrock와 같이 sessionState을 이용해 code interpreter를 실행시킬 수 있습니다. app.py와 같이 sessionState에 "files"를 정의 후에 실행합니다.

sessionState = {
    'files': [
        {
            'name': file_name,
            'source': {
                'byteContent': {
                    'data': uploaded_file.getvalue(),
                    'mediaType': 'text/csv'
                },
                'sourceType': 'BYTE_CONTENT'
            },
            'useCase': 'CODE_INTERPRETER'
        },
    ]
}
with st.status("thinking...", expanded=True, state="running") as status:
    response, reference_docs = chat.run_bedrock_agent(prompt, chat.agent_name, sessionState, st)
    st.write(response)

Multi Agent Collaboration

Multi agent에서 supervisor와 collaborator들은 agent id, agent name, agent alias name, agent alias arn을 가지고 있습니다. 여기에서는 stock와 search agent들을 가지고 collaborator로 등록합니다.

# supervisor
supervisor_agent_id = supervisor_alias_id = None
supervisor_agent_name = "agent-supervisor"
supervisor_agent_alias_name = "latest_version"
supervisor_agent_alias_arn = ""

# collaborator
stock_agent_id = stock_agent_alias_id = None
stock_agent_name = "stock-agent"
stock_agent_alias_name = "latest_version"
stock_agent_alias_arn = ""

search_agent_id = search_agent_alias_id = None
search_agent_name = "search-agent"
search_agent_alias_name = "latest_version"
search_agent_alias_arn = ""

Multi agent에서 invoke 동작은 single agent에서와 동일하게 supervisor의 agent id, agent alias를 이용해 invoke를 수행합니다. 이때 필요시 trace를 이용해 중간값들을 가져오고, session id를 이용해 파일들을 활용할 수 있으므로 memory id를 이용해 이전 대화이력을 활용할 수 있습니다.

response = client_runtime.invoke_agent( 
    agentAliasId=supervisor_agent_alias_id,
    agentId=supervisor_agent_id,
    inputText=text, 
    enableTrace=True,
    sessionId=sessionId[userId], 
    memoryId='memory-'+userId
)
logger.info(f"response of invoke_agent(): {response}")

response_stream = response['completion']

아래는 collaborator를 생성하는 함수입니다. stock agent는 stock_data_lookup라는 이름을 가지고 있고 주어진 ticker로 부터 stock 정보를 가져옵니다. 이때 ticker와 country 정보가 필요한데, supervisor가 collaborator의 description을 보고 적절한 값을 넣게 됩니다. 한국과 같은 경우에 ticker가 숫자이므로 country를 보고 'ko'를 추가합니다. 아래와 같이 create_agent()로 agent를 생성하고, create_action_group()으로 action group을 생성하고, prepare_agent()를 이용해 prepare 상태를 만들고, deploy_agent()로 배포합니다. 각 state를 정의하는 함수 중간에 의도적으로 delay를 부여합니다.

def create_bedrock_agent_collaborator(modelId, modelName, agentName, agentAliasName, st):
    if agentName == "stock-agent":
        functionSchema = {
            'functions': [
                {
                    'name': 'stock_data_lookup',
                    'description': "Retrieve accurate stock trends for a given ticker.",
                    'parameters': {
                        'ticker': {
                            'description': 'the ticker to retrieve price history for',
                            'required': True,
                            'type': 'string'
                        },
                        'country': {
                            'description': 'the english country name of the stock',
                            'required': True,
                            'type': 'string'
                        }
                    },
                    'requireConfirmation': 'DISABLED'
                }
            ]
        }
    elif agentName == "search-agent": 
        functionSchema = {
            'functions': [
                {
                    'name': 'search_by_tavily',
                    'description': "Search general information by keyword and then return the result as a string.",
                    'parameters': {
                        'keyword': {
                            'description': 'search keyword',
                            'required': True,
                            'type': 'string'
                        }
                    },
                    'requireConfirmation': 'DISABLED'
                }
            ]
        }

    agent_instruction = (
        "당신의 이름은 서연이고, 질문에 친근한 방식으로 대답하도록 설계된 대화형 AI입니다. "
        "상황에 맞는 구체적인 세부 정보를 충분히 제공합니다. "
        "모르는 질문을 받으면 솔직히 모른다고 말합니다. "
    )
    response = client.create_agent(
        agentResourceRoleArn=agent_role_arn,
        instruction=agent_instruction,
        foundationModel=modelId,
        description=f"Collaborator Agent인 {agentName}입니다. 사용 모델은 {modelName}입니다.",
        agentName=agentName,
        idleSessionTTLInSeconds=600
    )

    agentId = response['agent']['agentId']
    time.sleep(5)   

    create_action_group(agentId, action_group_name_for_multi_agent, lambda_tools_arn, functionSchema, st)     

    prepare_agent(agentId)
    
    agentAliasId, agentAliasArn = deploy_agent(agentId, agentAliasName)
    time.sleep(3) 

    return agentId, agentAliasId, agentAliasArn

아래와 같이 supervisor agent를 생성합니다. agentCollaboration으로 SUPERVISOR를 선택하고 agent의 role과 instruction을 연결합니다. 결과에서 agent를 추출한 후에 supervisor에서 code로 분석할 수 있도록 code interpreter를 action group으로 등록합니다. associate_agent_collaborator()로 collaborator들을 supervisor에 각각 연결합니다. 이후 prepare_agent()로 prepare 상태로 바꾸고, deploy_agent()로 배포합니다.

def create_bedrock_agent_supervisor(modelId, modelName, agentName, agentAliasName, st):
    agent_instruction = (
        "당신의 이름은 서연이고, 질문에 친근한 방식으로 대답하도록 설계된 대화형 AI입니다. "
        "상황에 맞는 구체적인 세부 정보를 충분히 제공합니다. "
        "모르는 질문을 받으면 솔직히 모른다고 말합니다. "
    )
    response = client.create_agent(
        agentCollaboration = 'SUPERVISOR', # SUPERVISOR_ROUTER
        orchestrationType = 'DEFAULT',
        agentName=agentName,
        agentResourceRoleArn=agent_role_arn,
        instruction=agent_instruction,
        foundationModel=modelId,
        description=f"Supervisor Agent인 {agentName}입니다. 사용 모델은 {modelName}입니다.",
        idleSessionTTLInSeconds=600
    )
    agentId = response['agent']['agentId']
    time.sleep(3)

    create_action_group_for_code_interpreter(agentId, st)
                
    response = client.associate_agent_collaborator(
        agentDescriptor={
            'aliasArn': stock_agent_alias_arn
        },
        agentId=agentId,
        agentVersion='DRAFT',
        collaborationInstruction=f"{stock_agent_name} retrieves accurate stock trends for a given ticker.",
        collaboratorName=stock_agent_name
    )
    
    response = client.associate_agent_collaborator(
        agentDescriptor={
            'aliasArn': search_agent_alias_arn
        },
        agentId=agentId,
        agentVersion='DRAFT',
        collaborationInstruction=f"{search_agent_name} searchs general information by keyword and then return the result as a string.",
        collaboratorName=search_agent_name
    )
    time.sleep(3)

    prepare_agent(agentId)
    time.sleep(3)
    
    agentAliasId, agentAliasArn = deploy_agent(agentId, agentAliasName)    
    time.sleep(3)

    return agentId, agentAliasId, agentAliasArn

왼쪽 메뉴에서 "multi agent collaboration"을 선택하면 supervisor agent이 collaborator인 stock agent와 search agent를 이용해 답변을 구합니다. 아래와 같이 superviser agent에게 네이버 주식에 대해 문의하면 stock agent가 실행됩니다. stock agent은 질문을 보고 action group을 실행하는데, 여기서는 stock_data_lookup이 선택되어 주식정보를 가져옵니다.

활용 방법 (Debugging)

EC2는 Private Subnet에 있으므로 SSL로 접속할 수 없습니다. 따라서, Console-EC2에 접속하여 "app-for-bedrock-agent"를 선택한 후에 Connect에서 sesseion manager를 선택하여 접속합니다.

Github에서 app에 대한 코드를 업데이트 하였다면, session manager에 접속하여 아래 명령어로 업데이트 합니다.

sudo runuser -l ec2-user -c 'cd /home/ec2-user/bedrock-agent && git pull'

Streamlit의 재시작이 필요하다면 아래 명령어로 service를 stop/start 시키고 동작을 확인할 수 있습니다.

sudo systemctl stop streamlit
sudo systemctl start streamlit
sudo systemctl status streamlit -l

Local에서 디버깅을 빠르게 진행하고 싶다면 Local에서 실행하기에 따라서 Local에 필요한 패키지와 환경변수를 업데이트 합니다. 이후 아래 명령어서 실행합니다.

streamlit run application/app.py

EC2에서 debug을 하면서 개발할때 사용하는 명령어입니다.

먼저, 시스템에 등록된 streamlit을 종료합니다.

sudo systemctl stop streamlit

이후 EC2를 session manager를 이용해 접속한 이후에 아래 명령어를 이용해 실행하면 로그를 보면서 수정을 할 수 있습니다.

sudo runuser -l ec2-user -c "/home/ec2-user/.local/bin/streamlit run /home/ec2-user/bedrock-agent/application/app.py"

참고로 lambda-tools로 docker image가 아닌 코드를 압축해서 올리는 경우에 참조할 명령어는 아래와 같습니다. 압축후에 my_deployment_package.zip을 console에서 업로드합니다.

cd lambda-tools/
pip install --target ./package requests beautifulsoup4 # package 설치
cd package && zip -r ../my_deployment_package.zip .
cd .. && zip my_deployment_package.zip lambda_function.py info.py # add lambda_function.py

MCP

Bedrock agent에서 MCP를 사용하기 위해서는 InlineAgent SDSK를 이용합니다. 상세한 내용은 MCP를 이용해 Bedrock Agent 이용하기을 참조합니다.

To-Do: MCP는 python3.10 이상인데 현재 EC2는 3.9이므로 python 버전을 바꾸거나 docker로 전환을 하여야 함

직접 실습 해보기

사전 준비 사항

이 솔루션을 사용하기 위해서는 사전에 아래와 같은 준비가 되어야 합니다.

AWS Account 생성에 따라 계정을 준비합니다.

CDK를 이용한 인프라 설치

본 실습에서는 us-west-2 리전을 사용합니다. 인프라 설치에 따라 CDK로 인프라 설치를 진행합니다.

실행결과

CloudFront의 도메인 주소로 접속시 아래와 같은 화면을 볼 수 있습니다. 메뉴에는 "일상적인대화", "RAG", "Agent", "Agent with Knowledge Base", "번역하기", "문법 검토하기"가 있습니다. 여기서 "Agent"는 Bedrock agent이고, "Agent with Knowledge Base"는 "Agent"에 Knodwledge Base를 추가한 형태입니다. "Agent"는 Knowledge base를 Tool의 형태로 action group에 지정하여 활용하고 "Agent with Knowledge Base"는 Bedrock agent에 Knodwledge base를 등록합니다. "Agent with Knowledge Base"은 먼저 action group의 tool들을 조회하고 없는 경우에 "Knowledge Base"를 조회하고, "Agent"는 tool의 하나로 Knowledge base를 사용하는 차이가 있습니다.

또한, 아래와 같이 메뉴에서 사용 모델을 선택하면, 6개의 모델을 선택하여 사용할 수 있습니다. 모델을 선택하면 Bedrock agent의 설정을 바꾸기위한 provisioning이 수행됩니다.

문서 업로드의 Browse files를 선택하여 업로드할 파일을 지정하면 아래와 같이 파일이 Amazon S3로 업로드되고, 이후 자동으로 Knodwledge base와 sync하는 동작이 수행됩니다.

Code Interpreter를 사용하고자 하는 경우에는 아래와 같이 "Code Interpreter"를 지정하고 csv 파일을 업로드 합니다. 이 버튼이 Enable되면 문서 업로드가 Code Interpreter로만 동작하므로 일반 문서를 업로드 할 경우에는 "Disable"을 하여야 합니다.

메뉴에서 "Agent"를 선택하고 "여행과 관련된 책 추천해줘."라고 입력한 후에 결과를 확인합니다. 아래와 같이 action group에서 "get_book_list"를 호출하여 얻은 값을 가지고 답변을 생성하였습니다.

Bedrock agetn의 code interpreter의 기능을 테스트하기 위하여 "code interpreter"를 enable하고 주식 CSV 파일파일을 업로드합니다. 이후 "가장 변화량이 큰 주식의 지난 1년간의 변화량을 일자별로 표시하는 그래프를 그려주세요."라고 입력합니다. 이때의 결과는 아래와 같습니다.

"네이버 주가 동향을 그래프로 그려주세요. 그림의 글자는 충분히 크게 하고 영어를 사용합니다."라고 입력하면 API를 통해 주식 정보를 확인하고, 얻어진 정보를 기반으로 그래프를 아래와 같이 그려줍니다.

"네이버와 카카오의 일일 주가동향을 그래프로 그려주세요. 네이버와 카카오의 향후 투자 전략도 간단히 세워주세요. 그래프 안의 제목등은 영어로 작성하세요."라고 입력후 결과를 확인합니다. Matplotlib로 만들어진 그림의 한글이 깨지는 현상이 있어서 프롬프트에 해당 내용을 추가합니다.

메뉴에서 [이미지 분석]과 모델로 [Claude 3.5 Sonnet]을 선택한 후에 기다리는 사람들 사진을 다운받아서 업로드합니다. 이후 "사진속에 있는 사람들은 모두 몇명인가요?"라고 입력후 결과를 확인하면 아래와 같습니다.

리소스 정리하기

더이상 인프라를 사용하지 않는 경우에 아래처럼 모든 리소스를 삭제할 수 있습니다. 아래의 명령어로 전체 삭제를 합니다.

cd ~/bedrock-agent/cdk-bedrock-agent && cdk destroy --all

References

generative-ai-cdk-constructs

On AWS CDK and Agents for Amazon Bedrock

Bedrock Agents-Based AI AppBuilder Assistant with Boto3 SDK

Agents for Amazon Bedrock - create agent

AWS re:Invent 2024 - Building an AWS solutions architect agentic app with Amazon Bedrock (DEV331)

Building Agentic Workflows on AWS

Building an AWS Solutions Architect Agentic App with Amazon Bedrock

종속 항목이 있는 .zip 배포 패키지 생성

Amazon Bedrock Agent Samples

Amazon Bedrock Agents 워크샵

Amazon Bedrock 다중 에이전트 협업

AI Agent Stack

Extended thinking - Claude 3.7

Name		Name	Last commit message	Last commit date
Latest commit History 413 Commits
application		application
cdk-bedrock-agent		cdk-bedrock-agent
contents		contents
lambda-tools		lambda-tools
LICENSE		LICENSE
README.md		README.md
deployment.md		deployment.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Bedrock Agent 활용하기

전체 Architecture

상세 구현

일반적인 대화

RAG

Bedrock Agent

Agent에서 사용할 Tool의 구현

Custom Orchestration

Code Interpreter

Multi Agent Collaboration

활용 방법 (Debugging)

MCP

직접 실습 해보기

사전 준비 사항

CDK를 이용한 인프라 설치

실행결과

리소스 정리하기

References

About

Uh oh!

Releases

Packages

Languages

License

kyopark2014/bedrock-agent

Folders and files

Latest commit

History

Repository files navigation

Bedrock Agent 활용하기

전체 Architecture

상세 구현

일반적인 대화

RAG

Bedrock Agent

Agent에서 사용할 Tool의 구현

Custom Orchestration

Code Interpreter

Multi Agent Collaboration

활용 방법 (Debugging)

MCP

직접 실습 해보기

사전 준비 사항

CDK를 이용한 인프라 설치

실행결과

리소스 정리하기

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages