-
Notifications
You must be signed in to change notification settings - Fork 3
Slurm nodes installation under Ubuntu16.04 #2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Changes from all commits
a48702a
6632b26
f2c7ef1
b5893f8
0e8c1f2
8e54231
0b8945e
3510b4e
7e84382
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
This file was deleted.
This file was deleted.
This file was deleted.
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1 @@ | ||
| deb http://archive.ubuntu.com/ubuntu xenial main restricted universe multiverse |
This file was deleted.
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -9,4 +9,4 @@ | |
| # who wants to be able to SSH in as root via public-key on Biomedia servers. | ||
| # disable SSH for anybody but root | ||
| +:root:ALL | ||
| -:ALL EXCEPT (csg) dr jpassera bglocker:ALL | ||
| -:ALL EXCEPT (csg) (biomedia) dr jpassera bglocker jgao:ALL | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. regular users shouldn't have SSH access to the cluster nodes, hence the previous config |
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -16,7 +16,8 @@ CgroupReleaseAgentDir=/var/spool/slurm-llnl/cgroup | |
| ConstrainCores=yes | ||
| TaskAffinity=yes | ||
| #ConstrainRAMSpace=no | ||
| ### not used yet | ||
| #ConstrainDevices=no | ||
| #AllowedDevicesFile=/etc/slurm-llnl/cgroup_allowed_devices_file.conf | ||
|
|
||
| ConstrainSwapSpace=yes | ||
| AllowedSwapSpace=10.0 | ||
| # Not well supported until Slurm v14.11.4 https://groups.google.com/d/msg/slurm-devel/oKAUed7AETs/Eb6thh9Lc0YJ | ||
| #ConstrainDevices=yes | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Should that be enabled and not commented out then? |
||
| #AllowedDevicesFile=/etc/slurm-llnl/cgroup_allowed_devices_file.conf | ||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,2 +1,4 @@ | ||
| Name=gpu File=/dev/nvidia0 | ||
| Name=gpu File=/dev/nvidia1 | ||
| Name=gpu File=/dev/nvidia2 | ||
| Name=gpu File=/dev/nvidia3 |
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -6,7 +6,7 @@ ArchiveSuspend=no | |
| #ArchiveScript=/usr/sbin/slurm.dbd.archive | ||
| #AuthInfo=/var/run/munge/munge.socket.2 | ||
| AuthType=auth/munge | ||
| DbdHost={{ pillar['slurm']['controller'] }} | ||
| DbdHost=biomedia03 | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. no hardcoded values please => use Pillar |
||
| DbdPort=6819 | ||
| DebugLevel=info | ||
| PurgeEventAfter=1month | ||
|
|
@@ -16,10 +16,10 @@ PurgeSuspendAfter=1month | |
| LogFile=/var/log/slurm-llnl/slurmdbd.log | ||
| PidFile=/var/run/slurm-llnl/slurmdbd.pid | ||
| SlurmUser=slurm | ||
| #StorageHost={{ pillar['slurm']['controller'] }} | ||
| #StorageHost=biomedia03 | ||
| StorageHost=localhost | ||
| StorageType=accounting_storage/mysql | ||
| StoragePort=3306 | ||
| StorageLoc={{ pillar['slurm']['db']['name'] }} | ||
| StorageUser={{ pillar['slurm']['db']['user'] }} | ||
| StoragePass={{ pillar['slurm']['db']['password'] }} | ||
| StorageLoc=slurmdb | ||
| StorageUser=slurm | ||
| StoragePass=1BUy4eVv7X | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. password in cleartext in the commit history... |
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,3 @@ | ||
| get slurmd ver: | ||
| cmd.run: | ||
| - name: dpkg -s slurmd |grep "^Version:" > /tmp/local_slurm_ver.txt |
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,6 +1,8 @@ | ||
| Name=gpu File=/dev/nvidia0 | ||
| Name=gpu File=/dev/nvidia1 | ||
| Name=gpu File=/dev/nvidia2 | ||
| Name=gpu File=/dev/nvidia3 | ||
| Name=gpu File=/dev/nvidia4 | ||
| Name=gpu File=/dev/nvidia5 | ||
| Name=gpu File=/dev/nvidia6 | ||
| Name=gpu File=/dev/nvidia7 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe move that lower in an "Instructions" subsection instead of replacing what the formula contains?