batch-pydatajson-analyzer

Little python script to analyze metadata of Argentine data.json site-nodes.

Execute

python metadata.py --file ./test/samples/nodes.csv

Output file: ./test/result/result.csv

TODO list:

/home/ubu-dev-env/development/repos/batch-pydatajson-analyzer/output.py:144: FutureWarning: Behavior when concatenating bool-dtype and numeric-dtype arrays is deprecated; in a future version these will cast to object dtype (instead of coercing bools to numeric values). To retain the old behavior, explicitly cast bool-dtype arrays to numeric dtype. curr_dataframe = pd.concat([curr_dataframe, append_dataframe], axis=0) /home/ubu-dev-env/development/repos/batch-pydatajson-analyzer/output.py:93: FutureWarning: Behavior when concatenating bool-dtype and numeric-dtype arrays is deprecated; in a future version these will cast to object dtype (instead of coercing bools to numeric values). To retain the old behavior, explicitly cast bool-dtype arrays to numeric dtype. return pd.concat([curr_dataframe, append_dataframe], axis=0)
clean unused imports
better comment code
there is an error in distribution types plot: not showing distribution types correctly.
sometimes there is a discrepancy between the data type shown in some of the columns, e.g, Dataset Errors in catalog indicator shows TRUE or 1 sometimes.
DCAT schema validation? test with Tigre data.json

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.vscode		.vscode
_vendor		_vendor
res		res
test		test
.gitignore		.gitignore
README.md		README.md
constants.py		constants.py
metadata.py		metadata.py
output.py		output.py
requirements.txt		requirements.txt
stats.py		stats.py
util.py		util.py
vendorize.toml		vendorize.toml