Skip to content

Error when running get_result.py #3

@Doris404

Description

@Doris404

When run python get_result.py --cols 2 --distinct 10 --corr 2 --skew 2 --method [METHOD], I get wrong return like:
截屏2022-01-11 下午2 06 42
I debug the code, and find cols_2_distinct_10_corr_2_skew_2test.sql.csv is empty which should be the reason, however the previous command like
python generate_data_sql.py --cols [COLUMNS_NUM] --distinct [DOMAIN_SIZE] --corr [CORRELATION] --skew [SKEWNESS] and python get_truecard.py --version cols_[COLUMNS_NUM]_distinct_[DOMAIN_SIZE]_corr_[CORRELATION]_skew_[SKEWNESS] don't mention how to get the cols_2_distinct_10_corr_2_skew_2test.sql.csv based on cols_2_distinct_10_corr_2_skew_2test.sql.

I try to copy the cols_2_distinct_10_corr_2_skew_2test.sql directly into the cols_2_distinct_10_corr_2_skew_2test.sql.csv, which doesn't work.

I trace the code and find code cannot work correctly in prepare_pattern_workload, in which load the cols_2_distinct_10_corr_2_skew_2test.sql.csv by line, and each line run tables = sorted([x.split(' ')[1] for x in line.split('#')[0].split(',')]).

So, how can I get the cols_2_distinct_10_corr_2_skew_2test.sql.csv with correct style.
Thx for help🙂

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions