Policy corpus

This repository stands as a corpus of business policies to support research studies, academic courses and experiments. It includes synthetic policies and a catalog of links of real public policies for various business domains.

Motivations

LLMs are told to bring reasoning capabilities. While they are progressing for more than 2 years in this direction when performing specific tasks, question remains open when automating the decision making of business policies. To go beyond intuition or any biais the idea is to measure any solution (LLM, generated code, human choice) to decide in comparison of a ground truth dataset.

For synthetic but real life inspired business policies what are the performances of:

pure LLM decision making, given a single prompt, a request and the policy,
a chain of thoughts involving a sequence or tree of generations
a generated code thanks to an LLM
other means involving other technics (optimization, genetic algorithms, tensor based logic inference, etc)

Policy reference implementation

For a panel of business domain and use cases, this project proposes data and code to benchmark automated decisions with respect to a business policy expressed in plain text. Each policy is described by:

a plain text specifying the requirements, criteria and logic to deduce a decision from a given context.
a Python code implementating the policy. This implementation has been validated by a human, based on an interpretation where policy brings ambiguity or misses information.
a Data generator code. It invokes the automation code on synthetic inputs to produce outcomes
a list of decision datasets. They are ready to use as a baseline to measure the performances of any machines (pure LLMs, code generated thanks to LLMs, others) that automate the decision making.

Policy list

The list of business policies captured in this corpus:

Troubleshooting generated projects

When generating projects in an assistant we may face parsing errors in the rules.

remove 'the' article before non boolean attributes
use add verb when using a list to set its content

How to benchmark your policy automation against a test dataset

You want to measure quantitatively the performance of your policy automation, then this project is made for you. Run your own implementation (pure LLM, LLM generating code, etc) to produce decisions and compare these decisions with reference ones in the available datasets. Please have a look at this section: Benchmark your policy implementation

Common framework

As we are cooking a similar recipe for each policy, the project proposes a common framework to support and accelerate the definition, and data generation of a policy: Common framework

How to extend the corpus to your own policies

If you intend to extend the corpus with a new policy please have a look to this section: Adding a policy

Name		Name	Last commit message	Last commit date
Latest commit History 170 Commits
air_transport		air_transport
basic		basic
benchmark_your_policy_automation_docs		benchmark_your_policy_automation_docs
border-control		border-control
common		common
fraud-detection		fraud-detection
healthcare/cardiovascular_risk		healthcare/cardiovascular_risk
human-resources		human-resources
insect-classification		insect-classification
insurance		insurance
loan		loan
luggage		luggage
marketing/discount		marketing/discount
nature		nature
policy_corpus_extension_docs		policy_corpus_extension_docs
.DS_Store		.DS_Store
.coverage		.coverage
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Policy corpus

Motivations

Policy reference implementation

Policy list

Troubleshooting generated projects

How to benchmark your policy automation against a test dataset

Common framework

How to extend the corpus to your own policies

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

DecisionsDev/policy-corpus

Folders and files

Latest commit

History

Repository files navigation

Policy corpus

Motivations

Policy reference implementation

Policy list

Troubleshooting generated projects

How to benchmark your policy automation against a test dataset

Common framework

How to extend the corpus to your own policies

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages