RaphaelaHeil
diff --git a/‎README.md‎
Lines changed: 84 additions & 0 deletions b/‎README.md‎
Lines changed: 84 additions & 0 deletions
diff --git a/‎strikethrough_classification/config.cfg‎
Lines changed: 24 additions & 0 deletions b/‎strikethrough_classification/config.cfg‎
Lines changed: 24 additions & 0 deletions
diff --git a/‎strikethrough_classification/requirements.txt‎
Lines changed: 103 additions & 0 deletions b/‎strikethrough_classification/requirements.txt‎
Lines changed: 103 additions & 0 deletions
diff --git a/‎strikethrough_classification/src/__init__.py‎
Lines changed: 6 additions & 0 deletions b/‎strikethrough_classification/src/__init__.py‎
Lines changed: 6 additions & 0 deletions
diff --git a/‎strikethrough_classification/src/configuration.py‎
Lines changed: 134 additions & 0 deletions b/‎strikethrough_classification/src/configuration.py‎
Lines changed: 134 additions & 0 deletions
@@ -1 +1,85 @@
 # Strikethrough Removal From Handwritten Words Using CycleGANs
+
+[![License](https://img.shields.io/badge/License-MIT-blue.svg?style=flat-square)](https://opensource.org/licenses/MIT)
+
+### [Raphaela Heil](mailto:raphaela.heil@it.uu.se):envelope:, [Ekta Vats](ekta.vats@it.uu.se) and [Anders Hast](anders.hast@it.uu.se)
+
+Code and related resources for the [ICDAR 2021](https://icdar2021.org/) paper **Strikethrough Removal From Handwritten Words Using CycleGANs**
+
+## Table of Contents
+1. [Code](#code)
+    1. [Strikethrough Removal](#strikethrough-removal)
+    2. [Strikethrough Classification](#strikethrough-classification)
+    3. [Strikethrough Identification](#strikethrough-identification)
+    4. [Running the Code](#running-the-code)
+2. [Data](#data)
+3. [Citation](#citation)
+4. [Acknowledgements](#acknowledgements)
+
+## Code
+Each of the following subdirectories contains the code that was used in the context of this paper. Additionally, Python requirements and the original configuration(s) are included for each. Configuration files have to be modified with local paths to input and output directories before running.
+
+Model checkpoints are attached in the release of this repository.
+
+### Strikethrough Removal
+- code for training various forms of CycleGANs to remove strikethrough from handwritten words
+- the CycleGAN code is based on [https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix](https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix)
+  > @inproceedings{CycleGAN2017,
+  title={Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networkss},
+  author={Zhu, Jun-Yan and Park, Taesung and Isola, Phillip and Efros, Alexei A},
+  booktitle={Computer Vision (ICCV), 2017 IEEE International Conference on},
+  year={2017}
+}
+
+### Strikethrough Classification
+- code to train a DenseNet121 to classify a struck-through word image into one of seven types of strikethrough
+
+### Strikethrough Identification
+- code to train a DenseNet121 to identify whether a given word image is struck-through or not (i.e. 'clean')
+
+### Running the Code
+
+#### Train
+In order to train any of the three models, run:
+```
+python src/train.py -configfile <path to config file> -config <name of section from config file>
+```
+
+If no `configfile` is defined, the script will assume `config.cfg` in the current working directory. If no `config` is defined, the script will assume `DEFAULT`.
+
+#### Test
+For testing, run:
+```
+python src/train.py -configfile <path to config file> -data <path to data dir>
+```
+- `configfile` should point to the config file in an output directory of a train run (or one of the checkpoint config files)
+- `data` should point to a directory containing `struck` and `struck_gt` sub-directories, e.g. one of the datasets presented in [Data](#data)
+- an additional flag `-save` can be specified to save the cleaned images, otherwise only performance metrics (F<sub>1</sub> score and RMSE) will be logged
+
+
+## Data
+- Synthetic strikethrough dataset on Zenodo: [https://doi.org/10.5281/zenodo.4767094](https://doi.org/10.5281/zenodo.4767094)
+  - based on the [IAM](https://fki.tic.heia-fr.ch/databases/iam-handwriting-database) database
+  - multi-writer
+  - generated using [https://doi.org/10.5281/zenodo.4767062](https://doi.org/10.5281/zenodo.4767062)
+- Genuine strikethrough dataset on Zenodo: [https://doi.org/10.5281/zenodo.4765062](https://doi.org/10.5281/zenodo.4765062)
+  - single-writer
+  - blue ballpoint pen
+  - clean and struck word images registered based on:
+    >J. Öfverstedt, J. Lindblad and N. Sladoje, "Fast and Robust Symmetric Image Registration Based on Distances Combining Intensity and Spatial Information," in IEEE Transactions on Image Processing, vol. 28, no. 7, pp. 3584-3597, July 2019, doi: 10.1109/TIP.2019.2899947.
+    ([Paper](https://ieeexplore.ieee.org/document/8643403), [Code](https://github.com/MIDA-group/py_alpha_amd_release))
+
+## Citation
+ICDAR 2021
+```
+@INPROCEEDINGS{heil2021strikethrough,
+  author={Heil, Raphaela and Vats, Ekta and Hast, Anders},
+  booktitle={2021 International Conference on Document Analysis and Recognition (ICDAR)},
+  title={{Strikethrough Removal from Handwritten Words Using CycleGANs}},
+  year={2021},
+  pubstate={to appear}}
+```
+
+## Acknowledgements
+- R.Heil would like to thank [Nicolas Pielawski](https://scholar.google.se/citations?user=MmqXB5oAAAAJ), [Håkan Wieslander](https://scholar.google.se/citations?user=PLJ8O9MAAAAJ), [Johan Öfverstedt](https://scholar.google.se/citations?user=GMminVMAAAAJ) and [Anders Brun](https://scholar.google.se/citations?user=LQ4p1qQAAAAJ) for their helpful comments and fruitful discussions.
+- The computations were enabled by resources provided by the Swedish National Infrastructure for Computing ([SNIC](https://snic.se/)) at the High Performance Computing Center North ([HPC2N](https://www.hpc2n.umu.se/)) partially funded by the Swedish Research Council through grant agreement no. 2018-05973.
@@ -0,0 +1,24 @@
+[DEFAULT]
+outdir = tmp
+trainimgagebasedir = train/struck
+testimagedir = validation/struck
+imageheight = 128
+imagewidth = 512
+epochs = 30
+batchsize = 128
+validationepochinterval = 1
+modelsaveepoch = -1
+invertimages = True
+model = dense
+padscale = False
+padwidth = 1024
+padheight = 256
+
+[PAD_INV]
+model = dense
+batchsize = 64
+invertimages = True
+padscale = True
+padwidth = 1024
+padheight = 256
+
@@ -0,0 +1,103 @@
+# This file may be used to create an environment using:
+# $ conda create --name <env> --file <this file>
+# platform: linux-64
+_libgcc_mutex=0.1=main
+blas=1.0=mkl
+blosc=1.20.1=hd408876_0
+brotli=1.0.9=he6710b0_2
+brunsli=0.1=h2531618_0
+bzip2=1.0.8=h7b6447c_0
+ca-certificates=2021.1.19=h06a4308_0
+certifi=2020.12.5=py38h06a4308_0
+charls=2.1.0=he6710b0_2
+cloudpickle=1.6.0=py_0
+cudatoolkit=10.2.89=hfd86e86_1
+cycler=0.10.0=py38_0
+cytoolz=0.11.0=py38h7b6447c_0
+dask-core=2021.1.1=pyhd3eb1b0_0
+dbus=1.13.18=hb2f20db_0
+decorator=4.4.2=pyhd3eb1b0_0
+expat=2.2.10=he6710b0_2
+fontconfig=2.13.0=h9420a91_0
+freetype=2.10.4=h5ab3b9f_0
+giflib=5.1.4=h14c3975_1
+glib=2.66.1=h92f7085_0
+gst-plugins-base=1.14.0=h8213a91_2
+gstreamer=1.14.0=h28cd5cc_2
+icu=58.2=he6710b0_3
+imagecodecs=2021.1.11=py38h581e88b_1
+imageio=2.9.0=py_0
+intel-openmp=2020.2=254
+joblib=1.0.0=pyhd3eb1b0_0
+jpeg=9b=h024ee3a_2
+jxrlib=1.1=h7b6447c_2
+kiwisolver=1.3.0=py38h2531618_0
+lcms2=2.11=h396b838_0
+ld_impl_linux-64=2.33.1=h53a641e_7
+lerc=2.2.1=h2531618_0
+libaec=1.0.4=he6710b0_1
+libdeflate=1.7=h27cfd23_5
+libedit=3.1.20191231=h14c3975_1
+libffi=3.3=he6710b0_2
+libgcc-ng=9.1.0=hdf63c60_0
+libgfortran-ng=7.3.0=hdf63c60_0
+libpng=1.6.37=hbc83047_0
+libstdcxx-ng=9.1.0=hdf63c60_0
+libtiff=4.1.0=h2733197_1
+libuuid=1.0.3=h1bed415_2
+libuv=1.40.0=h7b6447c_0
+libwebp=1.0.1=h8e7db2f_0
+libxcb=1.14=h7b6447c_0
+libxml2=2.9.10=hb55368b_3
+libzopfli=1.0.3=he6710b0_0
+lz4-c=1.9.3=h2531618_0
+matplotlib=3.3.2=h06a4308_0
+matplotlib-base=3.3.2=py38h817c723_0
+mkl=2020.2=256
+mkl-service=2.3.0=py38he904b0f_0
+mkl_fft=1.2.0=py38h23d657b_0
+mkl_random=1.1.1=py38h0573a6f_0
+ncurses=6.2=he6710b0_1
+networkx=2.5=py_0
+ninja=1.10.2=py38hff7bd54_0
+numpy=1.19.2=py38h54aff64_0
+numpy-base=1.19.2=py38hfa32c7d_0
+olefile=0.46=py_0
+openjpeg=2.3.0=h05c96fa_1
+openssl=1.1.1i=h27cfd23_0
+pandas=1.2.1=py38ha9443f7_0
+pcre=8.44=he6710b0_0
+pillow=8.1.0=py38he98fc37_0
+pip=20.3.3=py38h06a4308_0
+pyparsing=2.4.7=pyhd3eb1b0_0
+pyqt=5.9.2=py38h05f1152_4
+python=3.8.5=h7579374_1
+python-dateutil=2.8.1=pyhd3eb1b0_0
+pytorch=1.7.1=py3.8_cuda10.2.89_cudnn7.6.5_0
+pytz=2021.1=pyhd3eb1b0_0
+pywavelets=1.1.1=py38h7b6447c_2
+pyyaml=5.4.1=py38h27cfd23_1
+qt=5.9.7=h5867ecd_1
+readline=8.1=h27cfd23_0
+scikit-image=0.17.2=py38hdf5156a_0
+scikit-learn=0.23.2=py38h0573a6f_0
+scipy=1.5.2=py38h0b6359f_0
+setproctitle=1.2.2=py38h27cfd23_1004
+setuptools=52.0.0=py38h06a4308_0
+sip=4.19.13=py38he6710b0_0
+six=1.15.0=py38h06a4308_0
+snappy=1.1.8=he6710b0_0
+sqlite=3.33.0=h62c20be_0
+threadpoolctl=2.1.0=pyh5ca1d4c_0
+tifffile=2021.1.14=pyhd3eb1b0_1
+tk=8.6.10=hbc83047_0
+toolz=0.11.1=pyhd3eb1b0_0
+torchvision=0.8.2=py38_cu102
+tornado=6.1=py38h27cfd23_0
+typing_extensions=3.7.4.3=pyh06a4308_0
+wheel=0.36.2=pyhd3eb1b0_0
+xz=5.2.5=h7b6447c_0
+yaml=0.2.5=h7b6447c_0
+zfp=0.5.5=h2531618_4
+zlib=1.2.11=h7b6447c_3
+zstd=1.4.5=h9ceee32_0
@@ -0,0 +1,6 @@
+from .configuration import ModelName, Configuration, getConfiguration
+from .dataset import StrikeThroughType, StruckDataset
+from .utils import PadToSize, composeTransformations, getModelByName
+
+__all__ = ["ModelName", "Configuration", "getConfiguration", "StrikeThroughType", "StruckDataset", "PadToSize",
+           "composeTransformations", "getModelByName"]
@@ -0,0 +1,134 @@
+"""
+Contains all code related to the configuration of experiments.
+"""
+
+import argparse
+import random
+import time
+from configparser import SectionProxy, ConfigParser
+from enum import Enum, auto
+from pathlib import Path
+from typing import Tuple
+
+import torch
+
+
+class ModelName(Enum):
+    """
+    Encodes the names of supported models.
+    """
+    DENSE = auto()
+    RESNET = auto()
+
+    @staticmethod
+    def getByName(name: str) -> "ModelName":
+        """
+        Returns the ModelName corresponding to the given string. Returns ModelName.RESNET in case an unknown name is
+        provided.
+
+        Parameters
+        ----------
+        name : str
+            string representation that should be converted to a ModelName
+
+        Returns
+        -------
+            ModelName representation of the provided string, default: ModelName.RESNET
+        """
+        if name.upper() in [model.name for model in ModelName]:
+            return ModelName[name.upper()]
+        else:
+            return ModelName.RESNET
+
+
+class Configuration:
+    """
+    Holds the configuration for the current experiment.
+    """
+
+    def __init__(self, parsedConfig: SectionProxy, test: bool = False, fileSection: str = "DEFAULT"):
+        self.fileSection = fileSection
+        self.outDir = Path(parsedConfig.get('outdir')) / '{}_{}_{}'.format(fileSection, str(int(time.time())),
+                                                                           random.randint(0, 100000))
+        if not self.outDir.exists() and not test:
+            self.outDir.mkdir(parents=True, exist_ok=True)
+        if torch.cuda.is_available():
+            self.device = 'cuda'
+        else:
+            self.device = 'cpu'
+
+        self.epochs = parsedConfig.getint('epochs', 100)
+        self.learningRate = parsedConfig.getfloat('learning_rate', 0.0002)
+        self.betas = self.parseBetas(parsedConfig.get("betas", "0.5,0.999"))
+
+        self.batchSize = parsedConfig.getint('batchsize', 4)
+        self.imageHeight = parsedConfig.getint('imageheight', 128)
+        self.imageWidth = parsedConfig.getint('imagewidth', 256)
+        self.modelSaveEpoch = parsedConfig.getint('modelsaveepoch', 10)
+        self.validationEpoch = parsedConfig.getint('validationEpochInterval', 10)
+        self.trainImageDir = Path(parsedConfig.get('trainimgagebasedir'))
+        self.testImageDir = Path(parsedConfig.get('testimagedir'))
+        self.invertImages = parsedConfig.getboolean('invertImages', False)
+        self.padScale = parsedConfig.getboolean('padscale', False)
+        self.padWidth = parsedConfig.getint('padwidth', 512)
+        self.padHeight = parsedConfig.getint('padheight', 256)
+
+        self.modelName = ModelName.getByName(parsedConfig.get("model", "RESNET"))
+
+        if not test:
+            configOut = self.outDir / 'config.cfg'
+            with configOut.open('w+') as cfile:
+                parsedConfig.parser.write(cfile)
+
+    @staticmethod
+    def parseBetas(betaString: str) -> Tuple[float, float]:
+        """
+        Parses a comma-separated string to a list of floats.
+
+        Parameters
+        ----------
+        betaString: str
+            String to be parsed.
+
+        Returns
+        -------
+            Tuple of floats.
+
+        Raises
+        ------
+        ValueError
+            if fewer than two values are specified
+        """
+        betas = betaString.split(',')
+        if len(betas) < 2:
+            raise ValueError("found fewer than two values for betas")
+        return float(betas[0]), float(betas[1])
+
+
+def getConfiguration() -> Configuration:
+    """
+    Reads the required arguments from command line and parse the respective configuration file/section.
+
+    Returns
+    -------
+        parsed :class:`Configuration`
+    """
+    cmdParser = argparse.ArgumentParser()
+    cmdParser.add_argument("-config", required=False, help="section of config-file to use")
+    cmdParser.add_argument("-configfile", required=False, help="path to config-file")
+    args = vars(cmdParser.parse_args())
+    fileSection = 'DEFAULT'
+    fileName = 'config.cfg'
+    if args["config"]:
+        fileSection = args["config"]
+
+    if args['configfile']:
+        fileName = args['configfile']
+    configParser = ConfigParser()
+    configParser.read(fileName)
+    parsedConfig = configParser[fileSection]
+    sections = configParser.sections()
+    for s in sections:
+        if s != fileSection:
+            configParser.remove_section(s)
+    return Configuration(parsedConfig, fileSection=fileSection)