Skip to content

Issue/Question: HTTP 500 errors logged in job.err when running a suite under an Active Directory account on RHEL #4492

@andrew-kelleher

Description

@andrew-kelleher

Describe the bug
I'm currently testing Cylc on a RedHat 8.1 virtual machine hosted in Azure.

I've installed Cylc 7.9.5, and have successfully submitted an example "Hello world" suite from the VM itself. All of the logs and outputs show that it ran successfully.

This works for any of the local accounts on the VM.

The VM is also Active Directory joined using Realm. The issue I'm having is that when running the same suite from the VM using an AD account I'm seeing the following errors in job.err -

2021-11-01T12:29:20Z WARNING - Message send failed, try 1 of 7: Cannot connect: http://RHELAK01-vm:43081/put_messages: HTTP Error 500: Internal Server Error

This error is repeated 7 times, every 30 seconds. Running cylc scan returns nothing and the suite gets stuck in a hung state until I delete the contact file.

Despite the errors, the suite has run successfully and job.out is showing the expected output.

Release version(s) and/or repository branch(es) affected?
Same behaviour has been seen on Cylc 7.9.5 and 7.9.2, and RHEL 8.1 and 8.2.

Steps to reproduce the bug

  1. Deploy a RHEL VM,
  2. Join VM to an Active Directory domain using Realm
  3. Install Cylc
  4. Login to the VM as an AD user and create the ~/.cylc/global.rc file
  5. Create a test suite.rc file and register it i.e. cylc register mysuite /tut/suite.rc
  6. Run the suite i.e. cylc run mysuite --host=localhost

Expected behavior

  1. Running cylc scan should show the currently running suite
  2. the job.err file should not be showing HTTP 500 errors
  3. The suite should gracefully finish and not require the contact file deleting

Additional context
Troubleshooting steps completed so far include -

  • disabled the VM's firewall
  • disabled SELinux
  • ensured the AD user's home folder is on the local VM /home
  • confirmed ~/.cylc/global.rc matches across all home folders

As far as I can tell there's no difference in configuration between the local VM accounts and the AD accounts, they're members of the same local groups etc.

This may not be a bug per se but are there any known compatibility issues when submitting a job as an AD user? Or additional configuration required?

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething is wrong :(

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions