Skip to content

enhancement: redirections to locutions #2

@rubenperezm

Description

@rubenperezm

There could be 4 different IDs in a page:

  • Definition id (in the article tag)
  • Meaning id (in the li of the first ol within an article)
  • Locution Name id (locutions appear in a h3)
  • Locution Meaning id (in li of a locution's ol)

We are currently working with the first, second and fourth types. The third type is not being extracted since most of the redirections to locutions refer to one of their meanings. However, there are ~700 (~0.5% of the total number of meanings) that refer to the locution name. At this point we consider them exceptions. It is not a big deal since locutions are barely asked in the show, but we probably we could handle them by extracting the h3 IDs and creating a dict to store the relations (key: locution name id, value: locution meaning id) to use it in the next stage.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions