Skip to content

speeding up search by restricting the number of docs to search #194

Open
@punkish

Description

I built a db of 5000 articles and ran 9 queries against it. The response time varied between 6 and 16 seconds. This is understandably too slow for a real life application. But I notice that not only does the app search over all my 5000 articles, for some of it answers it also draws from outside the domain. I asked in #190 if I could restrict the search to just my data. But now I am thinking it would be even better if, optionally, I could also restrict the search to a subset of my data.

For example, if my (scientific) articles are about butterflies, ants, and spiders, and someone asks a question that is obviously related to ants, the search could be restricted to only the ants-related documents. Of course, this would imply storing some kind of metadata that would allow such sub-setting. Could that be possible?

I can think of doing a db JOIN against my original articles which have all kinds of metadata using the rowid (or some primary key), using a WHERE clause to narrow the basket, and then running the search on just that basket.

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions