Skip to content

aard2 slob to stardict conversion is missing entries #697

@NgrNxk

Description

@NgrNxk

I've created the following dictionary:

╰─❯ slob info perrypedia-2025-11-19.slob                                                                                                                                                             ─╯


perrypedia-2025-11-19.slob
--------------------------
         id: 560cf0ea67b1453c94c22f8d9fbb1de9
   encoding: utf-8
compression: lzma2
 blob count: 64311
  ref count: 100451

Stuff like this works just fine:

╰─❯ slob find perrypedia-2025-11-19.slob Arkoniden                                                                                                                                                   ─╯
6094873 text/html;charset=utf-8 Arkoniden
6160385 text/html;charset=utf-8 Arkonidenanzug
73859074 text/html;charset=utf-8 Arkonidenbengel
6160386 text/html;charset=utf-8 Arkonidenliebe
6160384 text/html;charset=utf-8 Arkoniden (PR Neo)
6160387 text/html;charset=utf-8 Arkonidenzoo (PR Neo)

I then converted this aard2 to stardict via pyglossary -v4 perrypedia-2025-11-19.slob pp-sd --read-format=Aard2Slob --write-format=Stardict and although it looks nice from the outside:

╰─❯ sdcv -2l .                                                                                                                                                                                       ─╯
Dictionary's name   Word count
Perrypedia (de)    64223

The search is strange somehow:

╰─❯ sdcv -2x . Arkoniden                                                                                                                                                                             ─╯
Found 3 items, similar to Arkoniden.
0)Perrypedia (de)-->Arkonidizin
1)Perrypedia (de)-->Arkons Ende
2)Perrypedia (de)-->Koniden
Your choice[-1 to abort]: -1

I don't know what's going on here. The issue also shows up in my KOReader in exactly the same form -- it's not only an issue with sdcv.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions