Skip to content
This repository was archived by the owner on Mar 10, 2026. It is now read-only.

[WIP] Adding Differentiable Binarization#2306

Draft
ariG23498 wants to merge 2 commits intokeras-team:OCRfrom
ariG23498:aritra/OCR
Draft

[WIP] Adding Differentiable Binarization#2306
ariG23498 wants to merge 2 commits intokeras-team:OCRfrom
ariG23498:aritra/OCR

Conversation

@ariG23498
Copy link
Contributor

This pull request introduces a preliminary implementation of the paper available at arXiv:1911.08947. The primary focus has been on setting up the dataset, model, and loss functions based on the methodologies outlined in the paper. We are utilizing the ICDAR 2015 dataset for this purpose.

Key Points:

  • Dataset Integration: The ICDAR 2015 dataset has been integrated for training and testing purposes.
  • Model and Losses Setup: Both the model architecture and the loss functions have been defined in accordance with the paper.
  • Current Progress:
    • The implementation is still in its early stages.
    • End-to-end training is not yet established.

Challenges and Immediate Focus:

  • Data Batching: Currently, the data is not batched, which is a significant aspect we need to address for effective training.
  • Loss Function Testing: The loss functions have not been tested thoroughly. This is a critical next step to ensure the model learns as expected.
  • Single Data Point Testing: For immediate testing, we aim to pass a single data point through the model to observe its behavior and output.

Proposers:

sachinprasadhs and others added 2 commits January 18, 2024 15:59
Fixed image.shape issue for tensorflow backend
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants