v1.14.0: HTCondor highly available, HCLS blueprint
Key New Features
- HCLS blueprint supports running GROMACS on GPUs and has added several tutorials.
- Support for highly available HTCondor pools
- Job queue (SchedD) high availability remains experimental see README
Module Improvements
vpc: new option to enable firewall rule that allows tunneling of Windows Remote Desktop connectionsschedmd-slurm-gcp-v5-partition: all deprecated variables have been removed; these have migrated toschedmd-slurm-gcp-v5-node-grouphtcondor-configure:- job history will now include VM instance ID, zone and machine type
- VMs are now provisioned with minimally-permissioned IDTOKENs for their respective daemons (e.g. ADVERTISE_STARTD)
startup-script: installation script for Cloud Ops Agent on Debian platforms will retry when other processes are blocking apt operationshtcondor-execute-point: add a simple health check of port 9618 on any machine within the execute pointvm-instance: vm-instance can be named using both a prefix and the deployment name
Improvements
- Improved error message when YAML blueprint has syntax errors preventing it from being loaded
- Regular updates to Go and Python dependencies to address potential security vulnerabilities
- Fixed Open Front End (OFE) issue with static content (icons) not displaying properly
What's Changed
- Add Windows Remote Desktop IAP firewall rule by @tpdownes in #885
- Bump oauthlib from 3.2.1 to 3.2.2 in /community/front-end/ofe by @dependabot in #886
- Fix: having the same share name and local mount caused slurm failure by @nick-stroud in #887
- Implicitly add outputs to modules when they are being used across deployment groups by @tpdownes in #878
- Reorder validator list to test blueprint correctness first by @heyealex in #889
- Bump cryptography from 36.0.1 to 39.0.1 in /community/front-end/ofe by @dependabot in #890
- Adding example sbatch and config for Factor Xa protein by @nick-stroud in #888
- Update HCLS blueprint examples to run Gromacs w/ GPUs by @nick-stroud in #891
- Add support for highly available HTCondor Central Managers by @tpdownes in #892
- Remove deprecations from slurm-gcp v5 partition by @heyealex in #893
- Bump cryptography from 36.0.1 to 39.0.1 in /community/front-end/ofe by @tpdownes in #896
- Bump cryptography from 36.0.1 to 39.0.1 in /community/front-end/ofe by @dependabot in #894
- Ensure Open Front End dependabot updates target develop by @tpdownes in #897
- Fix: Always generate Batch instance template to avoid known at apply time error by @nick-stroud in #898
- Update sbatch to copy results to output bucket & minor tweaks by @nick-stroud in #895
- Add troubleshooting documentation for filestore share name exportfs bug by @nick-stroud in #899
- Bump github.com/hashicorp/hcl/v2 from 2.16.0 to 2.16.1 by @dependabot in #900
- Fix addlicense check in weekly image building by @tpdownes in #901
- Update HTCondor modules by @tpdownes in #902
- Bump django from 3.2.16 to 3.2.17 in /community/front-end/ofe by @dependabot in #905
- Bump cryptography from 36.0.1 to 39.0.1 in /community/front-end/ofe by @dependabot in #906
- Bump oauthlib from 3.2.1 to 3.2.2 in /community/front-end/ofe by @dependabot in #907
- Bump TFLint version in image to latest by @tpdownes in #903
- Add instructions to ssh to VM created by vm-instance by @nick-stroud in #880
- Address terraform_unused_required_providers errors by @tpdownes in #908
- Manage HTCondor yum repo configuration directly by @tpdownes in #904
- Bump github.com/aws/aws-sdk-go from 1.33.0 to 1.34.0 by @dependabot in #911
- Fail integration tests on validation warnings. by @mr0re1 in #910
- Set minimal scopes for HTCondor IDTOKENs by @tpdownes in #919
- Fix
cmd/root_test.gotest runs from linked Git worktrees. by @mr0re1 in #918 - OFE update 14/02/2023. by @ek-nag in #913
- Improve error message for yaml parsing failures by @heyealex in #923
- HTCondor job track machine information by @tpdownes in #924
- Update develop with release v1.13.0 by @nick-stroud in #928
- Add explicit output dependencies to HTCondor by @tpdownes in #925
- Pin terraform google provider to v4.53.1 by @nick-stroud in #929
- Fix root_test failure on MacOS by @mr0re1 in #932
- Update htcondor-configure README example snippet by @tpdownes in #935
- Bump github.com/hashicorp/go-getter from 1.6.2 to 1.7.0 by @dependabot in #930
- Add retries to cloud ops install by @heyealex in #933
- Bump django from 3.2.17 to 3.2.18 in /community/front-end/ofe by @dependabot in #922
- Remove unused error message by @tpdownes in #939
- Bump google.golang.org/api from 0.109.0 to 0.110.0 by @dependabot in #937
- Fix typo in hcls instructions command by @nick-stroud in #940
- Update hcls example to use lysozyme protein instead of factor xa by @nick-stroud in #942
- HTCondor Job Queue High Availability by @tpdownes in #934
- Refactor useModule by @tpdownes in #941
- Bugfix to HTCondor autoscaler script by @tpdownes in #945
- Update hcls spack builder to use c2 machine by @nick-stroud in #948
- Enable OS Login by default in HTCondor execute points by @tpdownes in #944
- Remove dependency on unused module by @mr0re1 in #947
- Add health check for HTCondor VMs by @tpdownes in #946
- HCLS tutorial update by @nick-stroud in #950
- Bump github.com/spf13/afero from 1.9.3 to 1.9.4 by @dependabot in #955
- Bump github.com/zclconf/go-cty from 1.12.1 to 1.13.0 by @dependabot in #954
- Allow VM instance name to include prefix and deployment name by @nick-stroud in #949
- Use HTCondor Python bindings in autoscaler by @tpdownes in #951
- Update hcls Lysozyme example to include visualization instructions by @nick-stroud in #958
- Unify validatorConfig check logic, reduce code duplication by @mr0re1 in #952
- Add warnings about tutorial cost and left behind resources by @nick-stroud in #957
- Lysozyme example update by @vjb-collab in #956
- Do not populate subnetwork_project as it is populated from subnetwork_self_link by @nick-stroud in #959
- Remove warning about schema changes by @mr0re1 in #960
- Re-order tasks in HTCondor configuration runner by @tpdownes in #961
- Mask sleep/suspend targets on chrome-remote-desktop to prevent shutdown by @nick-stroud in #966
- Version update by @mr0re1 in #988
- Release v1.14.0 by @mr0re1 in #989
New Contributors
- @vjb-collab made their first contribution in #956
Full Changelog: v1.13.0...v1.14.0