About

This repository holds a helm chart for geOrchestra. This README file aims to present some of the features and/or implementation choices.

Maintainers

How to create a new chart release

IMPORTANT: Don't create too many versions, test your changes using git submodules for example. Create new versions with a batch of features if possible.

Change the version in the Chart.yaml.
Please follow https://semver.org, if you are adding a new feature bump the MINOR version, otherwise if it's a bugfix bump the PATCH version.
Write a changelog in the CHANGELOG.md
Push your changes.

WARNING: New location storage for the helm chart - How to use

All the helm chart are now stored inside the GitHub Docker registry.

Recheck the Quick start tutorial below. In a summary you need to change from:

helm repo update
helm upgrade --install -f your-values.yaml georchestra georchestra/georchestra

to

helm upgrade -f your-values.yaml georchestra oci://ghcr.io/georchestra/helm-georchestra/georchestra --version X.X.X

Usage

Install

WARNING: Change X.X.X by the latest version of the helm chart found in https://github.com/georchestra/helm-georchestra/blob/main/Chart.yaml#L18

Quick start

Install a Ingress Controller if you don't already have one.
NGINX Ingress controller is a good example:

helm upgrade --install ingress-nginx ingress-nginx \
   --repo https://kubernetes.github.io/ingress-nginx \
   --namespace ingress-nginx --create-namespace

Execute these commands for installing the georchestra chart:
```
helm install georchestra oci://ghcr.io/georchestra/helm-georchestra/georchestra --version X.X.X --set fqdn=YOURDOMAIN
```
Note: For the domain you can use georchestra-127-0-1-1.traefik.me, just replace 127-0-1-1 with the IP address of your server.
Go to https://YOURDOMAIN

Customized installation

Create a new separate 'values' file (or edit the existing one, not recommended).
Edit the parameters like fqdn, database or ldap if needed.

Install a Ingress Controller if you don't already have one.
NGINX Ingress controller is a good example:

helm upgrade --install ingress-nginx ingress-nginx \
   --repo https://kubernetes.github.io/ingress-nginx \
   --namespace ingress-nginx --create-namespace

Execute these commands for installing the georchestra chart:

helm install -f your-values.yaml georchestra oci://ghcr.io/georchestra/helm-georchestra/georchestra --version X.X.X

Go to https://YOURDOMAIN

Upgrade

Apply only for a customized installation.

helm upgrade -f your-values.yaml georchestra oci://ghcr.io/georchestra/helm-georchestra/georchestra --version X.X.X

geOrchestra Datadir bootstrap

The helm chart provides the possibility to clone the datadir from a remote repository using git. A secret SSH key can be provided to do so.

Mapstore2 / geOrchestra datadir

On the mapstore2 webapp, we are using the following strategy:

the georchestra_datadir is bootstrapped from a git repository the same way as the other geOrchestra components
we mount the mapstore dynamic datadir onto /mnt/mapstore2

CAS 6 theme customizations

An initContainer will take care of overriding the themes and templates subdirectory from the webapp classpath, if such ones exist into the georchestra data directory.

As a result, it is possible to override the default georchestra theme by creating a cas/themes and a cas/templates subdirectory into the georchestra datadir.

About livenessProbes

The healthchecks for the different webapps have been implemented using livenessProbes and startupProbes. Checking in the logs how long the webapps needed to bootstrap themselves, then multiplying the needed time per 3, to keep a safety margin.

Here is an example with the analytics webapp:

2022-02-21 20:53:49.500:INFO:oejs.AbstractConnector:main: Started ServerConnector@587d1d39{HTTP/1.1, (http/1.1)}{0.0.0.0:8080}
2022-02-21 20:53:49.500:INFO:oejs.Server:main: Started @2865ms

If we round the 2865ms to 3 seconds, then multiplying per 3 gives us 9 seconds. Let's round it to 10 seconds.

It gives the following yaml specification:

livenessProbe:
  httpGet:
    path: /analytics/
    port: 8080
  periodSeconds: 10
startupProbe:
  tcpSocket:
    port: 8080
  failureThreshold: 5
  periodSeconds: 10

Actually, the container will benefit from 5 trials of 10 seconds before failing to start. Afterwards, the containers will be tested every 10 seconds onto its /analytics/ endpoint during its lifetime, to check if it still alive. If not, after 2 other trials (failurethreshold being set to 3), the container will be considered as failing and be restarted.

CAS takes 26 seconds to boot. Multiplying per 3 might be a bit overkill, considering 60 seconds instead. The threshold will allow 5 60-second trials anyway.

GeoNetwork is particular, as the httpGet request can take longer than expected depending on the wro4j cache. A custom timeoutSeconds has been set to 5 seconds.

About the livenessProbes, we kept the default failureThreshold to 3, meaning that 3 successive failing attempts will need to be performed before the container is considered as failing.

The mapstore docker image is making use of Tomcat instead of Jetty, so we will use a startupProbe based on httpGet, similar to the one used for livenessProbe, instead of a tcpSocket one which is used on the other Jetty-based containers.

The ldap deployment object has been configured with only a livenessProbe, as we cannot tell if the ldap database has already been initialized or not. Doing some tries with a single pod, it has been estimated that 30 seconds should be sufficient to populate the database initially.

The datafeeder (backend) requires now a valid account passed as a base64-encoded HTTP header (the sec-user header, maybe some others). As a result, testing via a HTTP request for liveness is not easy. We still can pass some headers to the request, but if it requires a valid account, then maybe we shall stick to a simpler tcpSocket one.

Update strategy

The default update strategy in kubernetes being RollingUpdate, we can fall into a situation where new containers are created but never started because waiting for the volumes to be released from the former pods.

As a result, several containers run with a .strategy.type set to recreate instead of RollingUpdate. This is basically the case for every deployments which are making use of Persistent volumes:

elasticsearch (used by GeoNetwork4)
geonetwork
geoserver
mapstore
openldap

Resources allocations and limits

The requested and limits to allocated CPU and RAM is configurable in the values.yaml file for each component.

You can configure it with the availables resources parameter :

resources:
  limits:
    cpu: 2000m
    memory: 4Gi
  requests:
    cpu: 1000m
    memory: 2Gi

This config will request 1 CPU and 2Gi RAM to launch and limits consumption to 2 CPU and 4Gi RAM.

Bellow are default configs, they were determined for a test environment (request + limits) :

	CPU - Requests	CPU - Limits	RAM - Requests	RAM - Limits
console	100m	unset	1024Mi	1024Mi
datafeeder	100m	unset	512Mi	512Mi
datafeeder-frontend	50m	unset	128Mi	128Mi
geonetwork	200m	2000m	1512Mi	1512Mi
ogc-api-records	100m	unset	1024Mi	1024Mi
elasticsearch	200	2000	1512Mi	1512Mi
kibana	100m	unset	1024Mi	1024Mi
geoserver	100m	4000m	2048Mi	2048Mi
header	50Mi	unset	512Mi	512Mi
mapstore	100m	unset	1024Mi	1024Mi
openldap	100m	unset	1024Mi	1024Mi
gateway	500m	4000m	1024Mi	1024Mi
database (PG)	200m	unset	512Mi	512Mi
smtp	50Mi	unset	64Mi	64Mi

In production you need to configure accordingly to your environment:

The CPU requests and limits needed for geonetwork, elasticsearch, geoserver, gateway. And if you are using the built-in database, you may set some CPU limits.
It is not useful to set CPU limits for the other modules because they are not frequently accessed nor use a lot of CPU.
The memory requests and limits for all the modules.

Here is a table with recommended values for a production environment with low usage:

	CPU - Requests	CPU - Limits	RAM - Requests	RAM - Limits
console	500m	unset	2Gi	2Gi
datafeeder	200m	unset	2Gi	2Gi
datafeeder-frontend	100m	unset	256Mi	256Mi
geonetwork	2000m	4000m	3Gi	3Gi
ogc-api-records	100m	unset	2Gi	2Gi
elasticsearch	1000m	2000m	4Gi	4Gi
kibana	500m	unset	2Gi	2Gi
geoserver	2000m	4000m	4Gi	4Gi
header	200m	unset	1Gi	1Gi
mapstore	1000m	unset	2Gi	2Gi
openldap	500m	unset	2Gi	2Gi
gateway	2000m	4000m	3Gi	3Gi
database (PG)	2000m	4000m	2Gi	2Gi
smtp	200m	unset	128Mi	128Mi

Feel free to suggest modifications based on your use cases.

These values should work for an average production with low usage, but you are strongly advised to document yourself on how to estimate resource consumption based on your data and platform traffic.

Name		Name	Last commit message	Last commit date
Latest commit History 296 Commits
.github		.github
files		files
templates		templates
.gitignore		.gitignore
.helmignore		.helmignore
CHANGELOG.md		CHANGELOG.md
Chart.lock		Chart.lock
Chart.yaml		Chart.yaml
LICENSE		LICENSE
README.md		README.md
cr.yaml		cr.yaml
values.yaml		values.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Maintainers

How to create a new chart release

WARNING: New location storage for the helm chart - How to use

Usage

Install

Quick start

Customized installation

Upgrade

geOrchestra Datadir bootstrap

Mapstore2 / geOrchestra datadir

CAS 6 theme customizations

About livenessProbes

Update strategy

Resources allocations and limits

About

Uh oh!

Releases 7

Packages

Uh oh!

Uh oh!

Contributors 11

Uh oh!

Languages

License

georchestra/helm-georchestra

Folders and files

Latest commit

History

Repository files navigation

About

Maintainers

How to create a new chart release

WARNING: New location storage for the helm chart - How to use

Usage

Install

Quick start

Customized installation

Upgrade

geOrchestra Datadir bootstrap

Mapstore2 / geOrchestra datadir

CAS 6 theme customizations

About livenessProbes

Update strategy

Resources allocations and limits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Uh oh!

Contributors 11

Uh oh!

Languages

Packages