Skip to content
Discussion options

You must be logged in to vote

Hi @navaneeth-nagu 👋,

The only things we do by default are resizing the inputs and normalizing so no we don't remove noise :)

multi_img_doc = DocumentFile.from_images(["path/to/page1.jpg", "path/to/page2.jpg"])

The DocumentFile returns a list of numpy arrays so you could do with it whatever you want (for example any OpenCV operation) before passing the list of np arrays to the model/pipeline

Or you use a AutoEncoder to remove noise i have worked on something a few years ago (maybe useful as reference):
https://github.com/felixdittrich92/Document_Scanner#denoising-image

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@navaneeth-nagu
Comment options

Answer selected by navaneeth-nagu
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #1655 on June 24, 2024 06:50.