pl-dcm_txtlocr is a ChRIS ds plugin which locates text in DICOM files and saves it to a specified output directory. It can optionally use GPU acceleration if available.
This plugin processes DICOM files, extracts text present in the pixel data of a DICOM file, and creates output files in the desired format. It is designed to work on local machines or inside a ChRIS pipeline and supports GPU acceleration for faster text extraction.
pl-dcm_txtlocr is a ChRIS plugin, meaning it can run either inside ChRIS or from the command line using container technologies like Apptainer.
Run locally with Apptainer:
apptainer exec docker://fnndsc/pl-dcm_txtlocr dcm_txtlocr [--args values...] input/ output/
To print its available options, run:
```shell
apptainer exec docker://fnndsc/pl-dcm_txtlocr dcm_txtlocr --helpdcm_txtlocr requires two positional arguments: a directory containing
input data, and a directory where to create output data.
First, create the input directory and move input data into it.
mkdir incoming/ outgoing/
mv *.dcm incoming/
apptainer exec docker://fnndsc/pl-dcm_txtlocr:latest dcm_txtlocr incoming/ outgoing/
apptainer exec docker://fnndsc/pl-dcm_txtlocr:latest dcm_txtlocr \
-o text_output \
-f "*.dcm" \
-t "txt" \
-u \
incoming/ outgoing/Instructions for developers.
Build a local container image:
docker build -t localhost/fnndsc/pl-dcm_txtlocr .Mount the source code phi_detector.py into a container to try out changes without rebuild.
docker run --rm -it --userns=host -u $(id -u):$(id -g) \
-v $PWD/phi_detector.py:/usr/local/lib/python3.12/site-packages/dcm_txtlocr.py:ro \
-v $PWD/in:/incoming:ro -v $PWD/out:/outgoing:rw -w /outgoing \
localhost/fnndsc/pl-dcm_txtlocr dcm_txtlocr /incoming /outgoingRun unit tests using pytest.
It's recommended to rebuild the image to ensure that sources are up-to-date.
Use the option --build-arg extras_require=dev to install extra dependencies for testing.
docker build -t localhost/fnndsc/pl-dcm_txtlocr:dev --build-arg extras_require=dev .
docker run --rm -it localhost/fnndsc/pl-dcm_txtlocr:dev pytestSteps for release can be automated by Github Actions. This section is about how to do those steps manually.
Increase the version number in setup.py and commit this file.
Build and push an image tagged by the version. For example, for version 1.2.3:
docker build -t docker.io/fnndsc/pl-dcm_txtlocr:1.2.3 .
docker push docker.io/fnndsc/pl-dcm_txtlocr:1.2.3
Run chris_plugin_info
to produce a JSON description of this plugin, which can be uploaded to ChRIS.
docker run --rm docker.io/fnndsc/pl-dcm_txtlocr:1.2.3 chris_plugin_info -d docker.io/fnndsc/pl-dcm_txtlocr:1.2.3 > chris_plugin_info.jsonIntructions on how to upload the plugin to ChRIS can be found here: https://chrisproject.org/docs/tutorials/upload_plugin