Есть две машины: одна head, другая node в slurm.
HEAD:
● slurmctld.service - Slurm controller daemon
Loaded: loaded (/usr/lib/systemd/system/slurmctld.service; enabled; vendor preset: disabled)
Active: active (running) since Sat 2020-04-25 13:48:30 EEST; 11min ago
Process: 726 ExecStart=/usr/bin/slurmctld $SLURMCTLD_OPTIONS (code=exited, status=0/SUCCESS)
Main PID: 738 (slurmctld)
Tasks: 7 (limit: 2361)
Memory: 8.2M
CGroup: /system.slice/slurmctld.service
└─738 /usr/bin/slurmctld
Apr 25 13:48:29 ASUS-X52DE systemd[1]: Starting Slurm controller daemon...
Apr 25 13:48:30 ASUS-X52DE systemd[1]: slurmctld.service: Supervising process 738 which is not our child. We'll most likely not notice when it exits.
Apr 25 13:48:30 ASUS-X52DE systemd[1]: Started Slurm controller daemon.
NODE:
● slurmd.service - Slurm node daemon
Loaded: loaded (/lib/systemd/system/slurmd.service; enabled; vendor preset: enabled)
Active: active (running) since Sat 2020-04-25 13:55:43 EEST; 1s ago
Docs: man:slurmd(8)
Process: 1356 ExecStart=/usr/sbin/slurmd $SLURMD_OPTIONS (code=exited, status=0/SUCCESS)
Main PID: 1358 (slurmd)
Tasks: 2
Memory: 1.5M
CGroup: /system.slice/slurmd.service
└─1358 /usr/sbin/slurmd
апр 25 13:55:43 bravo-cloud systemd[1]: Starting Slurm node daemon...
апр 25 13:55:43 bravo-cloud systemd[1]: slurmd.service: Can't open PID file /run/slurmd.pid (yet?) after start: Operation not permitted
апр 25 13:55:43 bravo-cloud systemd[1]: Started Slurm node daemon.
Впринципе оно запустилось, но такие ошибки остались. Здесь скорее проблема не в slurm, а systemd.
Давал права
chmod 777 /run/slurmd.pid
Ещё вопрос — как проверить работоспособность slurm?