Skip to content

Add grpcPodConfig to CatalogSource to prevent TRANSIENT_FAILURE#14911

Open
shyRozen wants to merge 1 commit intored-hat-storage:masterfrom
shyRozen:fix-catalogsource-grpc-pod-config
Open

Add grpcPodConfig to CatalogSource to prevent TRANSIENT_FAILURE#14911
shyRozen wants to merge 1 commit intored-hat-storage:masterfrom
shyRozen:fix-catalogsource-grpc-pod-config

Conversation

@shyRozen
Copy link
Copy Markdown
Contributor

The catalog pod crash-loops because it only gets 50Mi memory by default, which is insufficient for the 78MB configs + 260MB cache. The startup probe kills it before the gRPC server becomes ready.

Adding memoryTarget (512Mi) and extractContent config allows the catalog to load via init containers and gives the process enough memory.

@shyRozen shyRozen requested a review from a team as a code owner April 15, 2026 22:34
The catalog pod crash-loops because it only gets 50Mi memory by default,
which is insufficient for the 78MB configs + 260MB cache. The startup
probe kills it before the gRPC server becomes ready.

Adding memoryTarget (512Mi) and extractContent config allows the catalog
to load via init containers and gives the process enough memory.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Signed-off-by: Shay Rozen <shay.rozen@gmail.com>
@shyRozen shyRozen force-pushed the fix-catalogsource-grpc-pod-config branch from a545849 to 1848f9f Compare April 15, 2026 22:37
@clacroix12 clacroix12 added the Needs Deployment Testing Needs Deployment and Some Tier1 Testing label Apr 15, 2026
@clacroix12 clacroix12 requested a review from a team April 15, 2026 22:37
Copy link
Copy Markdown
Contributor

@clacroix12 clacroix12 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This LGTM so I'm approving, however I would like to see a deployment test using this change to ensure functionality.

Also, do you think this is something we should consider for other catalogs such as for FDF?

@shyRozen
Copy link
Copy Markdown
Contributor Author

This LGTM so I'm approving, however I would like to see a deployment test using this change to ensure functionality.

Also, do you think this is something we should consider for other catalogs such as for FDF?

@clacroix12 This config is used by other catalogsource (certified-operators and community-operators).
I think we can also try it on fdf although I never deployed it so I don't know if there are transit_failure error, let's do a pilot (after deployment test). I think the transient failure we've gotten used to can be a thing of the past. :)

Copy link
Copy Markdown
Contributor

@dahorak dahorak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree with Coady, that it will be worth to update all the other CatalogSource templates we have in ocs-ci:

ocs_ci/templates/acm-deployment/acm_catalogsource.yaml
ocs_ci/templates/acm-deployment/mce_catalogsource.yaml
ocs_ci/templates/cnv-deployment/catalog-source.yaml
ocs_ci/templates/fusion-aas/catalogsource.yaml.j2
ocs_ci/templates/fusion/catalog-source.yaml
ocs_ci/templates/fusion/catalog-source.yaml.j2
ocs_ci/templates/hyperconverged-deployment/hyperconverged-catsrc.yaml
ocs_ci/templates/mce-deployment/mce_catsrc.yaml
ocs_ci/templates/ocp-deployment/qe-app-registry-catalog-source.yaml
ocs_ci/templates/ocs-deployment/brew-catalog.yaml
ocs_ci/templates/ocs-deployment/local-storage-optional-operators.yaml
ocs_ci/templates/ocs-deployment/multicluster/submariner_downstream_unreleased_catsrc.yaml
ocs_ci/templates/ocs-deployment/provider-mode/ocs-catalogsource.yaml
ocs_ci/templates/operators/unreleased-operator-catalog.yaml

@openshift-ci
Copy link
Copy Markdown

openshift-ci bot commented Apr 16, 2026

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: clacroix12, dahorak, shyRozen

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

lgtm Needs Deployment Testing Needs Deployment and Some Tier1 Testing size/XS

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants