Description
Overview
GlobalView refers to a single instance of Mediawiki which can query every single project running on a single torque instance. A key feature in GlobalView will be able to run search queries and get results from every project in Torque.
Suggested Solution
I propose that we use the Postgresql Full Text Search functon instead of maintaining a separate database and series of caches as we do now. The benfits I see this approach bringing are:
- By keeping our search functionality within Postgres we can minimize the number of places that we need to update data when updates are made.
- Search queries can be made through our ORM which will simplify development with regards to search.
- We will not need to maintain a search specific database.
An example of the ORM utilizing postgres full text search can be found here
Potential Difficulties
A requirement of search is that our searchable items have a significant amount of unstructured data which only a portion of which must be searchable depending on the permissions of the user performing the search.
A workaround that I propose is that we maintain a column for each permission type. We will then sort the unstructured data into the various permission columns. When we make search queries, depending on the user's permissions, we will query the necessary columns. An example of our searchable table is below:
proposal_name | proposal_date | ... | unstructured_data_confidential | unstructured_data_sensitive |
---|---|---|---|---|
Proposal 1 | 15 July 2020 | ... | { ... } | { ... } |
Proposal 2 | 15 July 2020 | ... | { ... } | { ... } |
Proposal 2 | 15 July 2020 | ... | { ... } | { ... } |
note:
I strongly advise that we strive to remove unstructured proposal data wherever possible. Not doing so will make it increasingly difficult to make use of the features relational databased provide to us.
Alternative Solution
I will leave this area sparse for others to fill in and suggest. Our current alternative that I see is we continue to maintain a woosh database with a series of caches for each user type. Continuing to to do this will mean costly rebuilding of search indexes when modifications are made.