Skip to content

Comments

Added the running rosa spots ability#349

Open
athiruma wants to merge 1 commit intocloud-bulldozer:masterfrom
athiruma:add-rosa-spot-ability
Open

Added the running rosa spots ability#349
athiruma wants to merge 1 commit intocloud-bulldozer:masterfrom
athiruma:add-rosa-spot-ability

Conversation

@athiruma
Copy link

Type of change

  • Refactor
  • New feature
  • Bug fix
  • Optimization
  • Documentation Update

Description

Span the Rosa worker nodes with the help of machinepools by creating the default cluster with 3(M+W+I).
The remaining Spot worker nodes are WorkerNodes - 3

Related Tickets & Documents

  • Related Issue #
  • Closes #

Checklist before requesting a review

  • I have performed a self-review of my code.
  • If it is a core feature, I have added thorough tests.

Testing

  • Please describe the System Under Test.
  • Please provide detailed steps to perform tests related to this code change.
    • Add the variable enable_spot_workers=true in the install config file of rosa.
  • How were the fix/results from this change verified? Please provide relevant screenshots or results.
    Ran the airflow cluster by creating a playground on the sailplane cluster.

@athiruma
Copy link
Author

@krishvoor ptal. 👋

@athiruma athiruma force-pushed the add-rosa-spot-ability branch from c4bdeba to 0279cb3 Compare August 11, 2023 21:13
if [ "$SPOT_POOL_READY" == "true" ]; then
# Node count is number of workers pool + 3 infra
NODES_COUNT=$(($2+3))
else
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Which condition needs NODE_COUNT=$2

Copy link
Author

@athiruma athiruma Aug 13, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Before spot_pool_ready, I need to check whether the SPOT nodes are ready or not. So, NODES_COUNT =$2 represent the number passing to the wait_for_nodes_ready

if [ "$SPOT_POOL_READY" == "true" ]; then
_wait_for_nodes_ready $1 ${COMPUTE_WORKERS_NUMBER} "node-role.kubernetes.io/worker"
else
DEFAULT_WORKER_NODES=3
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shouldn't the default be always openshift_worker_count?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

openshift_worker_count=9
The workflow will be,
default cluster= 3(M+W-I)
We already added 3 nodes, the remaining 6 nodes would be SPOT nodes.
So I added the DEFAULT_WORKER_NODES=3 to verify that 3 nodes in ready state.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants