vlm模式下OOM #4321
Unanswered
duanxin888
asked this question in
Q&A
vlm模式下OOM
#4321
Replies: 2 comments 1 reply
-
|
Hi @duanxin888! I'm Dosu and I’m helping the MinerU team. 你的硬件(7.5G 显存、16G 内存)在 VLM 模式下确实容易 OOM,哪怕只处理单页高分辨率 PDF。主要原因是模型本身启动就占用约 6G 显存,剩余空间很有限,PDF 转图片分辨率高或内容复杂时,显存和内存消耗会进一步增加。此外,VLM 模型采用单例缓存机制,clean_memory 只能清理 PyTorch 缓存,无法彻底释放模型本身占用的显存,只有重启进程才能完全释放详细说明。 优化建议如下:
注意事项:
如果以上方法仍无法解决,建议尝试更低分辨率、分批处理,或升级硬件资源。 To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
1 reply
-
|
可以尝试使用colab的免费T4 gpu,https://colab.research.google.com/gist/myhloli/a3cb16570ab3cfeadf9d8f0ac91b4fca/mineru_demo.ipynb |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
你好,目前我的GPU显存7.5G,内存16G,vlm模式下解析单页PDF,出现OOM现象,有办法优化吗?模型启动后需要占用6G左右,有办法优化吗?
Beta Was this translation helpful? Give feedback.
All reactions