Hi,
I am trying to run the benchmark, and have following questions.
- Where can we find the code_understanding_directory? This seems to be necessary to evaluate data pipeline.
- What are data sources for each query? For example, for the data_sources of legal-tiny.json, I don't see them being passed as part of the inputs during the generation time. I do see that the only time they were used was format_code_understanding_messages in the gpt_interface file, which was called in generate_key_functionalities_for_workload. However, when and how is the generate_key_functionalities_for_workload being used? I couldn't see any function calls that invoke this method.
Thank you for your time.
Hi,
I am trying to run the benchmark, and have following questions.
Thank you for your time.