Skip to content

Need help with honesty benchmark against vanilla llama2 #38

@poa010101

Description

@poa010101

Hey lab. I am working on a POC with a customer support AI company. They ask us to provide honesty benchmark to use data to prove we are more honest than the vanilla llama2. Do we have the public data set and the test result? If not, could we use https://people.eecs.berkeley.edu/~normanmu/llm_rules/ to test? It seems the pipeline is different and we can not use as is.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions