Skip to content

Releases: GoogleCloudPlatform/cluster-toolkit

v1.4.1: Fix Application Specific Tutorials

09 Sep 18:49
b8106eb

Choose a tag to compare

What's Changed

Full Changelog: v1.4.0...v1.4.1

v1.4.0: Advanced networking for Slurm V5, Version Updates, & Bug Fixes

07 Sep 23:29
6940ca6

Choose a tag to compare

Improvements

Version updates

What's Changed

New Contributors

Full Changelog: v1.3.0...v1.4.0

v1.3.0: Application specific tutorials for Gromacs, Openfoam, & WRF

24 Aug 19:24
8525bf2

Choose a tag to compare

Key New Features

  • Application specific tutorials for Gromacs, Openfoam, & WRF that walk through running real workloads.

New Examples

Resource Improvements

  • Slurm V5 controller and login node support enabling public ip addresses.
  • slurm-gcp-v5-*: Remove requirement to set the slurm_cluster_name in slurm-gcp-v5 modules.

What's Changed

Full Changelog: v1.2.1...v1.3.0

v1.2.1: Improved startup time when NFS mounting, Slurm V5 zone preferences, testing improvements, & bug fixes

11 Aug 05:19
238f9b9

Choose a tag to compare

Key New Features

Improvements

  • NFS client installation time on instance startup reduced by 96%.
  • Cloud Batch integration testing and other integration testing improvements.

Version updates

What's Changed

Full Changelog: v1.2.0...v1.2.1

v1.2.0: HTCondor autoscaling, explicitly defined IP ranges in VPC module

25 Jul 21:50
b6f46fe

Choose a tag to compare

Key New Features

  • Autoscaling in HTCondor.
  • Explicitly defined IP ranges in the VPC module.

New Resources

  • htcondor-execute-point: Creates an instance template and Managed Instance Group (MIG) for creating autoscaled compute nodes. Outputs a runner for configuring the autoscaler to scale the MIG.

Improvements

  • Allow explicitly defined IP ranges in the VPC module.
  • wait-for-startup module will wait for startup script completion when VMS are replaced.
  • Add autoscaler to HTCondor modules.
  • Docker support for HTCondor nodes.
  • HTCondor Pool example added to community examples.
  • HTCondor tutorial added.

Deprecations

  • The following variables in the VPC module are deprecated: primary_subnetwork, additional_subnetworks, subnetwork_size. See the VPC README for more information.

What's Changed

New Contributors

Full Changelog: v1.1.0...v1.2.0

v1.1.0: Google Cloud Batch, Slurm V5, Jumbo Frames, and Advanced Networking in Slurm V4

12 Jul 05:22
dffc869

Choose a tag to compare

Key New Features

  • Google Cloud Batch support: read more.
  • Slurm V5 support & example blueprint.
  • Slurm V4 partitions now support advanced networking features such as gVNIC adapters and high egress (Tier 1) bandwidth.
  • Slurm V4 partitions now support placement groups for all Compute Engine machine families that support them (A2, C2, C2D, N2, N2D).
  • VPC module supports jumbo frames for higher bandwidth and lower latency performance.

New Resources

Version updates

What's Changed

Full Changelog: v1.0.0...v1.1.0

v1.0.0: General Availability

27 May 03:02
b0a5f6f

Choose a tag to compare

Key New Features

  • Support for DAOS
  • Shared VPC example
  • Doc updates

Version updates

What's Changed

New Contributors

Full Changelog: v0.7.2-alpha...v1.0.0

v0.7.2-alpha: New features in `vm-instance`, updated documentation

17 May 17:13
cc87651

Choose a tag to compare

Key New Features

  • Spot provisioning and threads_per_core support in VM Instance module
  • Updated and improved documentation

Resource Improvements

  • vm-instance: Spot provisioning support
  • vm-instance: Option to set threads_per_core to enable or disable Simultaneous Multithreading (SMT)
  • vpc: Better support for supplying custom primary subnetwork
  • vpc: Better dependency tracking
  • startup-scripts: Better dependency tracking

Improvements

  • Updated Documentation, improvements to navigation in large README files
  • make install and make install-user for installing the binary globally or locally.
  • Issue template added for reporting bugs in the HPC Toolkit

Bug Fixes

  • Fixed: Terraform state doesn't update when overwriting a blueprint

What's Changed

Full Changelog: v0.7.1-alpha...v0.7.2-alpha

v0.7.1-alpha: Documentation Additions, Updated Defaults, Bug Fixes, and Intel Select Example

10 May 20:50
9eaa870

Choose a tag to compare

Key New Features

  • Improved documentation.
  • Improved defaults on Filestore and Slurm.
  • Additional modules allow specifying project_id independently from the global project_id.
  • Spack install dir updated to avoid conflict with Slurm.
  • Internal schema rename to match changes released in 0.7.0-alpha.

New Examples

What's Changed

New Contributors

Full Changelog: v0.7.0-alpha...v0.7.1-alpha

v0.7.0-alpha: Updated schema and component names, added community folder, new command line options

03 May 21:20
20481b4

Choose a tag to compare

Key New Features

  • Updated HPC Toolkit naming and schema with significant interface changes (read more below)
  • Moved community contributions to community folder
  • Overwrite flag (-w) optionally overwrites existing deployment folder while maintaining terraform state
  • Terraform Backend can be configured from command line (--backend-configs)
  • Recognition of the output of ghpc as a deployment, rather than blueprint: ghpc create now creates a folder with deployment_name instead of blueprint_name

Naming changes

  • Config YAML or Input YAML is now referred to as the HPC Blueprint
  • Resource Groups are now Deployment Groups
  • Blueprint Folder is now Deployment Folder
  • Resources are now HPC Modules
  • simple-instance is now vm-instance - Underlying module is the same

Blueprint YAML Schema Update

  • vars.deployment_name is used by ghpc for creating the deployment folder name, rather than blueprint_name
  • resource_groups is now deployment_groups
  • resources is now modules, and modules are stored in modules/ and community/modules/
  • Sourcing embedded modules starts with modules or community/modules

Example:

deployment_group:  # Was resource_groups:
  modules:  # Was Resources
  - source: modules/... # Was `- source: resources/...`

Improvements

  • Addition of "Community" folder
  • Overwrite option (-w) for creating a deployment in the same directory, retaining the terraform state and keeping a backup of one prior deployment.
  • Improved instructions for deploying after create
  • Support for startup-script with Packer resource
  • Command Line Flag for specifying terraform state backend config (--backend-config)
  • More reliable project ID validation

What's Changed

Full Changelog: v0.6.0-alpha...v0.7.0-alpha