PyTorch project for classifying cropped word images as Kurdish or Arabic with:
- ViT-only classifier
- Projection-only classifier
- ViT + projection late-fusion classifier
configs/: experiment configssrc/: training, evaluation, models, datasets, and utilitiesdata/: generated split CSVs at runtimeoutputs/: checkpoints, logs, and metrics at runtimeevaluate_annotated_json.py: evaluation script for COCO-style annotated line images
Install dependencies:
pip install -r requirements.txtViT baseline:
python train.py --config configs/vit_baseline.yamlViT + projection fusion:
python train.py --config configs/vit_proj_fusion.yamlpython evaluate_annotated_json.py --config configs/vit_proj_fusion.yaml --checkpoint outputs/vit_proj_fusion/checkpoints/best.pt --json path\\to\\annotations.json --image-dir path\\to\\images --output outputs\\annotated_eval- The provided YAML configs use local Windows dataset paths and should be edited for a new machine.
- Generated CSV splits, checkpoints, and output artifacts are intentionally excluded from Git.