-
Notifications
You must be signed in to change notification settings - Fork 95
Issue/Question: HTTP 500 errors logged in job.err when running a suite under an Active Directory account on RHEL #4492
Description
Describe the bug
I'm currently testing Cylc on a RedHat 8.1 virtual machine hosted in Azure.
I've installed Cylc 7.9.5, and have successfully submitted an example "Hello world" suite from the VM itself. All of the logs and outputs show that it ran successfully.
This works for any of the local accounts on the VM.
The VM is also Active Directory joined using Realm. The issue I'm having is that when running the same suite from the VM using an AD account I'm seeing the following errors in job.err -
2021-11-01T12:29:20Z WARNING - Message send failed, try 1 of 7: Cannot connect: http://RHELAK01-vm:43081/put_messages: HTTP Error 500: Internal Server Error
This error is repeated 7 times, every 30 seconds. Running cylc scan returns nothing and the suite gets stuck in a hung state until I delete the contact file.
Despite the errors, the suite has run successfully and job.out is showing the expected output.
Release version(s) and/or repository branch(es) affected?
Same behaviour has been seen on Cylc 7.9.5 and 7.9.2, and RHEL 8.1 and 8.2.
Steps to reproduce the bug
- Deploy a RHEL VM,
- Join VM to an Active Directory domain using Realm
- Install Cylc
- Login to the VM as an AD user and create the
~/.cylc/global.rcfile - Create a test suite.rc file and register it i.e.
cylc register mysuite /tut/suite.rc - Run the suite i.e.
cylc run mysuite --host=localhost
Expected behavior
- Running
cylc scanshould show the currently running suite - the
job.errfile should not be showing HTTP 500 errors - The suite should gracefully finish and not require the contact file deleting
Additional context
Troubleshooting steps completed so far include -
- disabled the VM's firewall
- disabled SELinux
- ensured the AD user's home folder is on the local VM
/home - confirmed
~/.cylc/global.rcmatches across all home folders
As far as I can tell there's no difference in configuration between the local VM accounts and the AD accounts, they're members of the same local groups etc.
This may not be a bug per se but are there any known compatibility issues when submitting a job as an AD user? Or additional configuration required?