Fix regex bugs with replaced code. Simplified implementation as well. #8

jaxley · 2024-09-24T18:58:21Z

The regex code was not properly matching links with special characters in them. This was causing the path to be a truncated part of the image filename. When this truncated path matched an existing image file, that file would have OCR text inserted into the file. This caused lots of files to have the same (wrong) OCR text.

I changed the way the images were identified to pull from each file's embeds list. No need to use regex to find them. This lets you use the API to locate the file path matching that name using Obsidian's own logic for resolving the file name. There's no need to have an image path parameter and use that to know what is/is not an image. That was a rigid design that didn't work for many configurations of Obsidian and resulted in many images never getting OCR text due to their location.

Once we had a list of the embeds, this allows for directly finding the location of that link in the file using the exact link text. And associating the embed with its corresponding TFile object, we can get its path without re-creating it (incorrectly). This resolves the original bug.

Jason Axley added 4 commits September 24, 2024 11:51

Fix regex bugs with replaced code. Simplified implementation as well.

6008235

Adding regex escape function I forgot to include.

a56cdae

Adding eslint basic config and package lock.

5304254

Fixed bug - embed.link is not the markdown "link" -- embed.original is!

5b67c3f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix regex bugs with replaced code. Simplified implementation as well. #8

Fix regex bugs with replaced code. Simplified implementation as well. #8

jaxley commented Sep 24, 2024

Fix regex bugs with replaced code. Simplified implementation as well. #8

Are you sure you want to change the base?

Fix regex bugs with replaced code. Simplified implementation as well. #8

Conversation

jaxley commented Sep 24, 2024