如何获得pdf的正文 #482
Replies: 4 comments
-
我已经写了个脚本可以用LLM总结正文:#457。要读取pdf我是这样做的: let pdfAttachment = await item.getBestAttachment();
let pdfPath = await pdfAttachment.getFilePath();
let fileData = await IOUtils.read(pdfPath); 解析pdf文件我是搭建了http服务,将文件发送给服务器做解析。 |
Beta Was this translation helpful? Give feedback.
-
This issue is stale because it has been open for 30 days with no activity. |
Beta Was this translation helpful? Give feedback.
-
qyzhang提供了非常好的想法!由于在zotero内debug可能比较困难,为了分离与Zotero文献交互和实际处理的逻辑,更好的方法是本地运行http服务,zotero把pdf交给服务,服务去获得pdf正文,调用ai,服务返回各个标签给zotero,记录在zotero中。 这样来看,zotero像一个写好的前端。如果您要做后续数据分析,甚至可以存在数据库里或csv里。这样可以轻松提取各文献关键信息并汇总,适用于meta分析或者其他需要对文献批量提同样问题的场景 |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
Is there an existing issue for this?
Environment
Describe the feature request
我想开发一个script,可以自动提取一片论文(pdf格式)的content,并且用chatgpt生成对应的summary和tags。我想知道要如何在action脚本中获得当前pdf的所有正文内容呢?
Describe the solution you'd like
获得pdf正文内容的api。
Anything else?
No response
Beta Was this translation helpful? Give feedback.
All reactions