[Feature] Add. PdfDoc.saveToTargetPath() for Writing PDF Directly to File Path #87

YoungSeok-Choi · 2025-01-26T08:24:48Z

What?

Currentiy there is only way to save an modified PDF File(PDFDocument.save()) which is creates a entire Copy of PDF size buffer

this PR add a new feature(PDFDocument.saveToTargetPath()) totally same functionality with PDFDocument.save() but reduces memory usage by writing PDF data directly to specific directory path (without copying Unit8Array)

Why?

I created this feature to address high memory usage when working in serverless environments (e.g., AWS Lambda). When processing large PDFs (e.g., 1GB), the current implementation requires at least 2GB of memory, resulting in substantial resource costs.

This PR reduces memory requirements, which is especially important in memory-constrained environments.

For more details, please refer to the related issue I created:

please consider at below issue (i craeted that issue)
[Feature Proposal] PDFWriter .save() with specific file path #71

How?

PDFDocument.save(): Creates a Uint8Array for the entire size of the modified PDF file and writes this array.
PDFDocument.saveToTargetPath(): Writes the modified PDF file directly to a specified path using a Writable stream, avoiding the creation of a large Uint8Array.

Testing?

Verifying the input(outputPath) options is valid.
Ensuring that the results from PDFDocument.save() and PDFDocument.saveToTargetPath() are identical.

New Dependencies?

No Additional Dependency

Screenshots

Anything Else?

Nothing!

Checklist

Sharcoux · 2025-01-27T02:28:54Z

This lib is cross-platform, including web. How can this work?

YoungSeok-Choi · 2025-01-27T05:59:31Z

This lib is cross-platform, including web. How can this work?

Ahh.. right!
i should consider other platforms
i will do about that

thanks for reviewing! 🙏🙏

YoungSeok-Choi · 2025-01-27T06:05:05Z

This lib is cross-platform, including web. How can this work?

By the way
The main concept of PdfDoc.saveToTargetPath() is that any kind of modified PDF
can write the fIle to specific directory Path (formed PDF format) regardless platform

currently my PR Only might work only Node.js Environment

Sharcoux · 2025-02-06T07:24:16Z

Maybe for other environments we could just write an error in the console explaining that this feature is not available on that environment?

YoungSeok-Choi · 2025-02-06T14:30:49Z

Maybe for other environments we could just write an error in the console explaining that this feature is not available on that environment?

That's an idea!

Fist of all, i would try to handle all other environments include Node.js but, if it's hard to cover
then i will figure it out the way to report an error to non-node.js environments.

let me know you by next weekend @Sharcoux

YoungSeok-Choi · 2025-03-18T23:04:29Z

i had some issue of memory usage of this feature.. 😂
beacause of lack of knowledge of stream(node)

give me some time for working on that!

cjam · 2025-04-08T17:07:22Z

Would this PR help with adding streaming support to this library? This library is great, but I've ran into a memory issue in a serverless environment since it doesn't support streams it consumes more memory than my serverless function is allocated. In my case, I'm merging pdfs and would love to be able to stream pdfs into a writeable stream and direct it to disk, allowing this library to merge larger documents while maintaining constant-ish memory usage.

As per the feedback from @Sharcoux (WRT to compatbility), perhaps the Web streams api would be a better a more compatible approach which would allow the stream to be redirected to a file in node, or a blob or something in the browser.

I'd be happy to assist, but might be more effective with a pointer as to what portion of the codebase to focus on / high level approach.

Sharcoux · 2025-04-08T17:43:54Z

It would not. We read the pdf before parsing it. Then you can write it. Streaming won't help much as long as we need the full data to do the parsing.

Sharcoux · 2025-04-08T17:47:22Z

But that PR might still improve memory perfs as highlighted by @YoungSeok-Choi

I agree with the use of web stream api

cjam · 2025-04-08T18:04:24Z

@Sharcoux thanks for the quick reply, is there any issue in the backlog for streaming support? I looked in this repo but didn't find anything, I think the original repo had a few things in there backlog / roadmap. Would be great to land have a discussion about approach / feasability.

Sharcoux · 2025-04-22T08:42:31Z

It's unlikely that we'll work on this, but feel free to open a PR and start discussing stuff. This is a community package. We'll just monitor the PRs and releases, and try to help when we can.

Add. PdfDoc.saveToTargetPath stream write feature

36136ef

github-actions bot added the needs-triage label Jan 26, 2025

YoungSeok-Choi mentioned this pull request Jan 26, 2025

[Feature Proposal] PDFWriter .save() with specific file path #71

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Add. PdfDoc.saveToTargetPath() for Writing PDF Directly to File Path #87

[Feature] Add. PdfDoc.saveToTargetPath() for Writing PDF Directly to File Path #87

Uh oh!

YoungSeok-Choi commented Jan 26, 2025 •

edited

Loading

Uh oh!

Sharcoux commented Jan 27, 2025

Uh oh!

YoungSeok-Choi commented Jan 27, 2025

Uh oh!

YoungSeok-Choi commented Jan 27, 2025

Uh oh!

Sharcoux commented Feb 6, 2025

Uh oh!

YoungSeok-Choi commented Feb 6, 2025 •

edited

Loading

Uh oh!

YoungSeok-Choi commented Mar 18, 2025

Uh oh!

cjam commented Apr 8, 2025

Uh oh!

Sharcoux commented Apr 8, 2025

Uh oh!

Sharcoux commented Apr 8, 2025

Uh oh!

cjam commented Apr 8, 2025

Uh oh!

Sharcoux commented Apr 22, 2025

Uh oh!

Uh oh!

[Feature] Add. PdfDoc.saveToTargetPath() for Writing PDF Directly to File Path #87

Are you sure you want to change the base?

[Feature] Add. PdfDoc.saveToTargetPath() for Writing PDF Directly to File Path #87

Uh oh!

Conversation

YoungSeok-Choi commented Jan 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What?

Why?

How?

Testing?

New Dependencies?

Screenshots

Suggested Reading?

Anything Else?

Checklist

Uh oh!

Sharcoux commented Jan 27, 2025

Uh oh!

YoungSeok-Choi commented Jan 27, 2025

Uh oh!

YoungSeok-Choi commented Jan 27, 2025

Uh oh!

Sharcoux commented Feb 6, 2025

Uh oh!

YoungSeok-Choi commented Feb 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

YoungSeok-Choi commented Mar 18, 2025

Uh oh!

cjam commented Apr 8, 2025

Uh oh!

Sharcoux commented Apr 8, 2025

Uh oh!

Sharcoux commented Apr 8, 2025

Uh oh!

cjam commented Apr 8, 2025

Uh oh!

Sharcoux commented Apr 22, 2025

Uh oh!

Uh oh!

YoungSeok-Choi commented Jan 26, 2025 •

edited

Loading

YoungSeok-Choi commented Feb 6, 2025 •

edited

Loading