Managing releases

Automatic Releases

When it is time to make a release from the Test environment to Acceptance, or Acceptance to Production, we need to:

update the relevant deployment files to point to the relevant image tags
create a GitHub release
compile changes into a release notes document
deploy yaml files to kubernetes.

This process can be automated using the release script in the deployment repository. See README in that repository for more detailed information.

Cleaning data from database/elastic/mongoDB in Test or Acceptance environments

We do not want the Test/Acceptance environment growing unchecked. After a non-production environment has over 5 million items, it becomes time to clear it up to save costs. Afterwards, re-ingest key datasets.

Clean up relevant data from data storage layer. This could be in three places depending on the data, the database, elasticsearch index and/or mongoDB. Not all objects are stored in all three storage solutions. For example DigitalSpecimen data is stored in all three but MAS information is stored in the database and MongoDB. So breaking changes in the datamodel of the DigitalSpecimen needs to be changed in all three solution, MAS only in two. Breaking changes do not include additions as we can always add additional attributes, but structural changes in the existing model do require a purge of storage.

Database purge

To purge data from the database we can do a Truncate of the table. Only in very few instances we need to fully drop the table. However, we truncate because the datamodel in the JsonB column had changed and is no longer compatible with the new model. Truncating can be done through your favourite DB IDE with a Truncate command.

Elastic search Index purge

Only for the objects DigitalSpecimen, DigitalMediaObject and Annotation is a purge of Elastic necesarry. To purge data from elasticSearch we can make a port-forward to kibana. k port-forward -n elastic service/kibana-kb-http 5601 We then go to https://localhost:5601/login?next=%2Fapp%2Fhome (Ignore the cert warning) and login with the credentials. We can now use Kibana (Stack Management -> Index Management) to delete and recreate the speicmen and media indexes.

After the index has been deleted, its mappings need to be recreated. This can be done through the API or the kibana dev tools.

First, update the field limit on each index:

PUT /digital-media-object/_settings
{
 "index.mapping.total_fields.limit": 2000
}

The mapping is available on the Deployment Repo. You may run the schema generation script to make sure the index is up to-date with the latest version of OpenDS. When you have the mapping, update the index you just created:

PUT /digital-media-object/_mapping
{
  "properties": {
    // mapping here
  }
}

MongoDB Index purge

In MongoDB we store versions for our objects. For almost all objects we publish new versions on changes. To purge all old version (in the old data model) we can drop the Collection. This is the fastest way to remove all the data, however it does require us to recreate the Collection. So recreate the Collection and add the indices (see Setup indices for mongdb) Logging into MongoDB can be done through opening op a port-forward (make sure the mongodb Tunnel is running) k port-forward service/mongodb-tunnel 27017 After the port is opened, you can log in through MongoDB Compass. Or through CML through a pod on the network, as explained in the Installation guide.

(Database changed) Database update We can now deploy any database changes. We are not yet using a database migration tool like Flyway or Liquibase as this needs to be done by hand. Deploy all changes to the database which should be available as SQL statements in the deployment repository

(Database changed) Elastic update Deploy the latest version or update the existing mapping. Statements should be available as commands in the deployment repository

Update deployment files Update all the deployment files which are needed to deploy during the release. Keep an eye out for new or changed environmental variables. Also keep an eye out for new Secrets which need to be added. A PR can be made so that others can review the changes. Be Aware that when the PR is merged this will kick of ArgoCD and will start the actual deployment of these files on k8s.

(Breaking release) Ingest data When there were breaking changes in the release we might need to reingest the data. This ensures that all the data adheres to the new data model.

Evaluating the Acceptance Environment

Changes in the acceptance environment must be tested before being pushed to production. The following are guidelines for thorough testing of the acceptance environment.

Data Ingestion

Create a new source system through the orchestration service.
Run the new ingestion
Run the ingestion again. The processing service should mark all new specimens as “equal” and make no changes

Machine Annotation Services

Create a new MAS in the orchestration service and update it.
Schedule all MASs through the backend api. Make sure results are as expected
Schedule a MAS through DiSSCover.

DiSSCover Searching

Search using free text
Search using regular filters
Search using taxonomic filters
Search using advanced ID search

Data Export

Schedule a data export. Ensure results are received via email and are publicly accessible.

Navigation

Home
The Team
Development Methods
Architecture
Persistent Identifiers
- PID Lifecycle
- FDO Types and Profiles
Release Management
Information for Third Parties
- Information for MAS Developers
Other

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Managing releases

Automatic Releases

Cleaning data from database/elastic/mongoDB in Test or Acceptance environments

Database purge

Elastic search Index purge

MongoDB Index purge

Evaluating the Acceptance Environment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally