Skip to content

Search in GlobalView #45

Open
Open
@Nolski

Description

@Nolski

Overview

GlobalView refers to a single instance of Mediawiki which can query every single project running on a single torque instance. A key feature in GlobalView will be able to run search queries and get results from every project in Torque.

Suggested Solution

I propose that we use the Postgresql Full Text Search functon instead of maintaining a separate database and series of caches as we do now. The benfits I see this approach bringing are:

  • By keeping our search functionality within Postgres we can minimize the number of places that we need to update data when updates are made.
  • Search queries can be made through our ORM which will simplify development with regards to search.
  • We will not need to maintain a search specific database.

An example of the ORM utilizing postgres full text search can be found here

Potential Difficulties

A requirement of search is that our searchable items have a significant amount of unstructured data which only a portion of which must be searchable depending on the permissions of the user performing the search.

A workaround that I propose is that we maintain a column for each permission type. We will then sort the unstructured data into the various permission columns. When we make search queries, depending on the user's permissions, we will query the necessary columns. An example of our searchable table is below:

proposal_name proposal_date ... unstructured_data_confidential unstructured_data_sensitive
Proposal 1 15 July 2020 ... { ... } { ... }
Proposal 2 15 July 2020 ... { ... } { ... }
Proposal 2 15 July 2020 ... { ... } { ... }

note:
I strongly advise that we strive to remove unstructured proposal data wherever possible. Not doing so will make it increasingly difficult to make use of the features relational databased provide to us.

Alternative Solution

I will leave this area sparse for others to fill in and suggest. Our current alternative that I see is we continue to maintain a woosh database with a series of caches for each user type. Continuing to to do this will mean costly rebuilding of search indexes when modifications are made.

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions