Documents with Math formulas #8835

mattwg · 2025-07-15T00:14:22Z

mattwg
Jul 15, 2025

Hi folks - firstly I want to say RagFlow is an awesome project. I already have a few ideas of things to contribute having only been using it for 24 hours. I am quickly seeing some great results! Thank you.

I work in EdTech and a lot of the documents we are doing RAG over are college lecture notes and presentations that contain a lot of math formulas. I am seeing some great results where RagFlow accurately identifies the whole formula - but I am also seeing many examples where the coordinates seem to be off - and then the Math being detected makes no sense. I have not experimented with many model providers - kinda plugged in gpt-4o for testing. For PDF Parser I am using DeepDoc.

For example here is a bad example:

You can see that the bounding box was positioned in a way that meant the formula's were truncated.

Ideally I would love it if there was a way to drive up the amount of time it does this (a good example)

You can see it identifies the embedded mathematical notation cleanly.

In my application I am using RAG to help an Agent choose images and formula's from the students notes when explaining concepts. So clean detection of the formula is really important to us.

Any ideas what I could do to help improve the results.

Thanks,
Matt

ZhenhangTung · 2025-07-21T02:51:18Z

ZhenhangTung
Jul 21, 2025
Collaborator

Hi Matt,
Appreciate your feedback. Currently our quality of parsing math formulas is kinda low. The way to increate it is training a model that is very good at parsing these math formulas.
It's not on our recent roadmap, sorry about that.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

InfiniFlow

Documents with Math formulas #8835

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

InfiniFlow

Documents with Math formulas #8835

Uh oh!

mattwg Jul 15, 2025

Replies: 1 comment

Uh oh!

ZhenhangTung Jul 21, 2025 Collaborator

mattwg
Jul 15, 2025

ZhenhangTung
Jul 21, 2025
Collaborator