Skip to content

I need a crawler or something similar! #20

@kolatubosun

Description

@kolatubosun

Just came across this page that has hundreds of unique Yoruba names. It turns out that WAEC or other admission lists is where you get lots of unique names, because they usually list people's middle names.

In any case, won't it be nice to have something that can tell me which of the names on the page aren't currently in the dictionary. It could be in form of an excel upload thingy. How it might work is that I copy the names, put them in excel, upload it, and the machine gives me an output that has ONLY the names not in the dictionary.

The result, inevitably, will have Hausa, Igbo and Yoruba names. But the Yoruba names will be the distinct ones not in the dictionary. So I can sort out the next step which is simply to remove any other name that is not Yoruba. I can do this manually because it would be hard to train a machine to do this kind of task. After this, I can then re-upload the distinct Yoruba names via our usual Excel spreadsheet.

Doable?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions