Open
Description
Hi I am getting a preprocessing error (when invoking source preprocess.sh
). I don't get any error when I preprocess the same dataset with code2vec. Appreciate any advice. Here's the error:
Extracting paths from training set...
dir: <dataset dir> was not completed in time
dir: <dataset dir> was not completed in time
dir: <dataset dir> was not completed in time
dir: <dataset dir> was not completed in time
Finished extracting paths from training set
Creating histograms from the training data
subtoken vocab size: 0
node vocab size: 0
target vocab size: 0
File: <dataset_name>.raw.txt
Traceback (most recent call last):
File "preprocess.py", line 115, in <module>
max_contexts=int(args.max_contexts), max_data_contexts=int(args.max_data_contexts))
File "preprocess.py", line 53, in process_file
print('Average total contexts: ' + str(float(sum_total) / total))
ZeroDivisionError: float division by zero
This is the line that is being triggered:
code2seq/JavaExtractor/extract.py
Line 49 in af04b4c
Appreciate any thoughts.
Metadata
Metadata
Assignees
Labels
No labels