fix txt writing issue#11
Conversation
|
In this issue #4 was a bug that the library torch tried to allocate 184.32 GiB I think that doing the thing you do it can solve the bug. Can you try it please? |
|
@White-Mask-230 I don't think the #4 is related to this issue. I'm training with a customized dataset that has ~1400 images and ~366k initial points. I got the error that "torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 1050588.79 GiB"!!! I'm also anticipating that issue to be addressed. |
Hi, thanks for bringing this up! Could you tell us about your setup (OS / CUDA / Pytorch version)? |
Ok, I think that it could work because it was an optimitation. I try but no. About your pull request for me is correct |
@Snosixtyboo I'm using an Ubuntu 20.04 machine, with cuda 12.1 and pytorch 2.3.0 |
Ok, thanks a lot! We have seen issues with cuda 12.1 on Ubuntu, that SEEM to go away with 12.3 or 12.5. We are trying different things to make it work with 12.1 / 11.8 etc, because there is no good reason why it shouldn't, unless there is a really low-level bug in the build toolchain. But in the meantime, you may try if 12.3 or 12.5 works out for you! Best, |
True, but in my point of view, I think that every big problem starts with a small solution |
Agreed. Thats why ive been looking into this bug for 3 days now. It is super elusive and seems somehow related with internal GPU memory management, depending on different build settings and how the application is loaded. |
|
The GPU memory problem should be solve as described in this issue I think the txt writing issue is not related to the GPU memory issue, and should be addressed by this PR. |
For details, please see #10