Skip to content

Clarifications of the benchmark #5

@guangyaodou

Description

@guangyaodou

Hi,

I am trying to run the benchmark, and have following questions.

  1. Where can we find the code_understanding_directory? This seems to be necessary to evaluate data pipeline.
  2. What are data sources for each query? For example, for the data_sources of legal-tiny.json, I don't see them being passed as part of the inputs during the generation time. I do see that the only time they were used was format_code_understanding_messages in the gpt_interface file, which was called in generate_key_functionalities_for_workload. However, when and how is the generate_key_functionalities_for_workload being used? I couldn't see any function calls that invoke this method.

Thank you for your time.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions