Skip to content

v1.14.0: HTCondor highly available, HCLS blueprint

Choose a tag to compare

@mr0re1 mr0re1 released this 07 Mar 21:40
· 7515 commits to main since this release
e7c0c24

Key New Features

  • HCLS blueprint supports running GROMACS on GPUs and has added several tutorials.
  • Support for highly available HTCondor pools
    • Job queue (SchedD) high availability remains experimental see README

Module Improvements

  • vpc: new option to enable firewall rule that allows tunneling of Windows Remote Desktop connections
  • schedmd-slurm-gcp-v5-partition: all deprecated variables have been removed; these have migrated to schedmd-slurm-gcp-v5-node-group
  • htcondor-configure:
    • job history will now include VM instance ID, zone and machine type
    • VMs are now provisioned with minimally-permissioned IDTOKENs for their respective daemons (e.g. ADVERTISE_STARTD)
  • startup-script: installation script for Cloud Ops Agent on Debian platforms will retry when other processes are blocking apt operations
  • htcondor-execute-point: add a simple health check of port 9618 on any machine within the execute point
  • vm-instance: vm-instance can be named using both a prefix and the deployment name

Improvements

  • Improved error message when YAML blueprint has syntax errors preventing it from being loaded
  • Regular updates to Go and Python dependencies to address potential security vulnerabilities
  • Fixed Open Front End (OFE) issue with static content (icons) not displaying properly

What's Changed

  • Add Windows Remote Desktop IAP firewall rule by @tpdownes in #885
  • Bump oauthlib from 3.2.1 to 3.2.2 in /community/front-end/ofe by @dependabot in #886
  • Fix: having the same share name and local mount caused slurm failure by @nick-stroud in #887
  • Implicitly add outputs to modules when they are being used across deployment groups by @tpdownes in #878
  • Reorder validator list to test blueprint correctness first by @heyealex in #889
  • Bump cryptography from 36.0.1 to 39.0.1 in /community/front-end/ofe by @dependabot in #890
  • Adding example sbatch and config for Factor Xa protein by @nick-stroud in #888
  • Update HCLS blueprint examples to run Gromacs w/ GPUs by @nick-stroud in #891
  • Add support for highly available HTCondor Central Managers by @tpdownes in #892
  • Remove deprecations from slurm-gcp v5 partition by @heyealex in #893
  • Bump cryptography from 36.0.1 to 39.0.1 in /community/front-end/ofe by @tpdownes in #896
  • Bump cryptography from 36.0.1 to 39.0.1 in /community/front-end/ofe by @dependabot in #894
  • Ensure Open Front End dependabot updates target develop by @tpdownes in #897
  • Fix: Always generate Batch instance template to avoid known at apply time error by @nick-stroud in #898
  • Update sbatch to copy results to output bucket & minor tweaks by @nick-stroud in #895
  • Add troubleshooting documentation for filestore share name exportfs bug by @nick-stroud in #899
  • Bump github.com/hashicorp/hcl/v2 from 2.16.0 to 2.16.1 by @dependabot in #900
  • Fix addlicense check in weekly image building by @tpdownes in #901
  • Update HTCondor modules by @tpdownes in #902
  • Bump django from 3.2.16 to 3.2.17 in /community/front-end/ofe by @dependabot in #905
  • Bump cryptography from 36.0.1 to 39.0.1 in /community/front-end/ofe by @dependabot in #906
  • Bump oauthlib from 3.2.1 to 3.2.2 in /community/front-end/ofe by @dependabot in #907
  • Bump TFLint version in image to latest by @tpdownes in #903
  • Add instructions to ssh to VM created by vm-instance by @nick-stroud in #880
  • Address terraform_unused_required_providers errors by @tpdownes in #908
  • Manage HTCondor yum repo configuration directly by @tpdownes in #904
  • Bump github.com/aws/aws-sdk-go from 1.33.0 to 1.34.0 by @dependabot in #911
  • Fail integration tests on validation warnings. by @mr0re1 in #910
  • Set minimal scopes for HTCondor IDTOKENs by @tpdownes in #919
  • Fix cmd/root_test.go test runs from linked Git worktrees. by @mr0re1 in #918
  • OFE update 14/02/2023. by @ek-nag in #913
  • Improve error message for yaml parsing failures by @heyealex in #923
  • HTCondor job track machine information by @tpdownes in #924
  • Update develop with release v1.13.0 by @nick-stroud in #928
  • Add explicit output dependencies to HTCondor by @tpdownes in #925
  • Pin terraform google provider to v4.53.1 by @nick-stroud in #929
  • Fix root_test failure on MacOS by @mr0re1 in #932
  • Update htcondor-configure README example snippet by @tpdownes in #935
  • Bump github.com/hashicorp/go-getter from 1.6.2 to 1.7.0 by @dependabot in #930
  • Add retries to cloud ops install by @heyealex in #933
  • Bump django from 3.2.17 to 3.2.18 in /community/front-end/ofe by @dependabot in #922
  • Remove unused error message by @tpdownes in #939
  • Bump google.golang.org/api from 0.109.0 to 0.110.0 by @dependabot in #937
  • Fix typo in hcls instructions command by @nick-stroud in #940
  • Update hcls example to use lysozyme protein instead of factor xa by @nick-stroud in #942
  • HTCondor Job Queue High Availability by @tpdownes in #934
  • Refactor useModule by @tpdownes in #941
  • Bugfix to HTCondor autoscaler script by @tpdownes in #945
  • Update hcls spack builder to use c2 machine by @nick-stroud in #948
  • Enable OS Login by default in HTCondor execute points by @tpdownes in #944
  • Remove dependency on unused module by @mr0re1 in #947
  • Add health check for HTCondor VMs by @tpdownes in #946
  • HCLS tutorial update by @nick-stroud in #950
  • Bump github.com/spf13/afero from 1.9.3 to 1.9.4 by @dependabot in #955
  • Bump github.com/zclconf/go-cty from 1.12.1 to 1.13.0 by @dependabot in #954
  • Allow VM instance name to include prefix and deployment name by @nick-stroud in #949
  • Use HTCondor Python bindings in autoscaler by @tpdownes in #951
  • Update hcls Lysozyme example to include visualization instructions by @nick-stroud in #958
  • Unify validatorConfig check logic, reduce code duplication by @mr0re1 in #952
  • Add warnings about tutorial cost and left behind resources by @nick-stroud in #957
  • Lysozyme example update by @vjb-collab in #956
  • Do not populate subnetwork_project as it is populated from subnetwork_self_link by @nick-stroud in #959
  • Remove warning about schema changes by @mr0re1 in #960
  • Re-order tasks in HTCondor configuration runner by @tpdownes in #961
  • Mask sleep/suspend targets on chrome-remote-desktop to prevent shutdown by @nick-stroud in #966
  • Version update by @mr0re1 in #988
  • Release v1.14.0 by @mr0re1 in #989

New Contributors

Full Changelog: v1.13.0...v1.14.0