GitHub - aws-samples/Reducing-Hallucinations-in-LLM-Agents-with-a-Verified-Semantic-Cache: This repository contains sample code demonstrating how to implement a verified semantic cache using Amazon Bedrock Knowledge Bases to prevent hallucinations in Large Language Model (LLM) responses while improving latency and reducing costs.

Reducing Hallucinations in LLM Agents with Amazon Bedrock Knowledge Bases

This repository contains sample code demonstrating how to implement a verified semantic cache using Amazon Bedrock Knowledge Bases to prevent hallucinations in Large Language Model (LLM) responses while improving latency and reducing costs.

Overview

The solution implements a read-only semantic cache that acts as an intelligent intermediary layer between users and Amazon Bedrock Agents. When a user submits a query, the system:

Evaluates semantic similarity with existing verified questions in the knowledge base
For highly similar queries (>80% match), returns curated & verified answers directly
For partial matches (60-80%), uses verified answers as few-shot examples
For low similarity matches (<60%), falls back to standard LLM processing

Benefits

Reduced Hallucinations: Uses verified answers for known queries
Lower Latency: Provides near-instantaneous responses for cached queries
Cost Optimization: Avoids unnecessary LLM invocations
Improved Accuracy: Uses few-shot examples to guide LLM responses

Prerequisites

An AWS account with access to Amazon Bedrock
Access to the following foundation models:
- Anthropic Claude Sonnet v1 (claude-3-sonnet-20240229-v1:0)
- Amazon Titan Text Embeddings v2
AWS CLI configured with appropriate credentials

Getting Started

Clone this repository.

git clone https://github.com/aws-samples/Reducing-Hallucinations-in-LLM-Agents-with-a-Verified-Semantic-Cache.git && cd Reducing-Hallucinations-in-LLM-Agents-with-a-Verified-Semantic-Cache

Deploy the provided AWS CloudFormation template to setup an Amazon SageMaker notebook.

aws cloudformation deploy \
    --template-file ./sagemaker_notebook.yaml \
    --stack-name ReducingHallucinationsDemoStack \
    --capabilities CAPABILITY_NAMED_IAM

Navigate to the Amazon SageMaker AI console (https://console.aws.amazon.com/sagemaker), and click on "Notebooks."
Open "ReducingHallucinationsDemoStack-SageMakerNotebook" as a Jupyter Notebook and follow the instructions in verified_semantic_cache.ipynb. This GitHub repository should already be cloned and available in the Notebook.
Delete the AWS CloudFormation stack to prevent unnecessary cost.

aws cloudformation delete-stack --stack-name ReducingHallucinationsDemoStack

Security

See CONTRIBUTING for more information.

License

This library is licensed under the MIT-0 License. See the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
knowledge_base.py		knowledge_base.py
requirements.txt		requirements.txt
sagemaker_notebook.yaml		sagemaker_notebook.yaml
verified_semantic_cache.ipynb		verified_semantic_cache.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reducing Hallucinations in LLM Agents with Amazon Bedrock Knowledge Bases

Overview

Benefits

Prerequisites

Getting Started

Security

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

aws-samples/Reducing-Hallucinations-in-LLM-Agents-with-a-Verified-Semantic-Cache

Folders and files

Latest commit

History

Repository files navigation

Reducing Hallucinations in LLM Agents with Amazon Bedrock Knowledge Bases

Overview

Benefits

Prerequisites

Getting Started

Security

License

About

Topics

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages