forked from stanford-rc/slurm-spank-lua
-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathspank-lua.8
432 lines (374 loc) · 14.2 KB
/
spank-lua.8
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
.TH "spank-lua" "8" "2012-02-18" "spank-lua" "lua bindings for Slurm spank framework"
.SH "NAME"
\fBspank-lua\fR \- Lua bindings for Slurm in a SPANK plugin
.SH "DESCRIPTION"
This manual page describes the \fBspank-lua\fR lua bindings for
Slurm, which allow lua scripts to be embedded in Slurm job launch
code via the \fBspank\fR(8) interface.
The \fBlua.so\fR spank plugin for Slurm allows lua scripts to take the
place of compiled C shared objects in the Slurm \fBspank\fR(8) framework.
All the functionality of the C SPANK API is exported to lua via this plugin,
which loads one or more scripts and executes lua functions during the appropriate
Slurm phase (as described in the \fBspank\fR(8) manpage).
This plugin provides Slurm administrators with the ability to easily
expose new options to the Slurm user commands \fBsrun\fR(1),
\fBsalloc\fR(1), and \fBsbatch\fR(1) commands, modify the Slurm job launch
environment, and execute custom code from within the context of Slurm
all with the power of a fast interpreted language. For one example of
\fBspank-lua\fR usage scenario, see the \fIEXAMPLES\fR section below.
.SH "CONFIGURATION"
The lua SPANK plugin is configured like other SPANK plugins by adding
a line to the plugstack.conf like:
.nf
required lua.so [\fIOPTIONS\fR] [[\fIGLOB\fR] [\fISCRIPT OPTIONS\fR]...
.fi
where \fIGLOB\fR is a path to a lua script to load or a glob which
may load multiple lua scripts, \fIOPTIONS\fR is a list of options for
\fBspank-lua\fR itself, and \fISCRIPT OPTIONS\fR is a space-separated list
of extra options to pass to all lua scripts loaded by this line. These
options are passed to lua scripts in the \fBargs\fR member of the
\fBspank\fR handle. For example:
.nf
required lua.so /etc/slurm/lua.d/* verbose=1
.fi
Would load all files in /etc/slurm/lua.d and would pass an
\fBargs\fR table in the \fBspank\fR handle passed to lua scripts
such that
.nf
\fBspank.args\fR = { "verbose=1" }
.fi
At this time, the only supported \fIOPTIONS\fR for \fBspank-lua\fR are
\fBfailonerror\fR, which enabled fatal errors for script loading and
parsing errors, instead of just skipping the current lua script.
.SH "SPANK LUA API"
As with compiled spank plugins, spank lua scripts may be called
from one or more of many contexts during the allocation, launch,
and exit of a Slurm job. Which of these contexts a spank-lua
script is called from depends on which of the callbacks listed
in the \fBspank\fR(8) manpage are defined by the loaded script.
These callbacks are listed here by name for reference, but please
see \fBspank\fR(8) manpage for more information on the context
of each hook listed:
.TP 8
.B slurm_spank_init
Called at initialization in all contexts (except slurmd).
.TP
.B slurm_spank_slurmd_init
Called at slurmd initialization.
.TP
.B slurm_spank_job_prolog
Called just before job prolog.
.TP
.B slurm_spank_init_post_opt
Called post option processing in all contexts.
.TP
.B slurm_spank_local_user_init
Called in local context post-job-allocation but before job launch.
.TP
.B slurm_spank_user_init
Called after privileges have been temporarily dropped on remote
nodes (called once per node).
.TP
.B slurm_spank_task_init_privileged
Called for each task just after fork, but before elevated privileges
are dropped permanently.
.TP
.B slurm_spank_task_init
Called once for each task just before execve(2).
.TP
.B slurm_spank_task_post_fork
Called for each task from parent process after fork(2) is complete.
Due to the fact that \fBslurmd\fR does not exec any tasks until all
tasks have completed fork(2), this call is guaranteed to run before
the user task is executed. (remote context only)
.TP
.B slurm_spank_task_exit
Called for each task as its exit status is collected by Slurm.
(remote context only)
.TP
.B slurm_spank_job_epilog
Called just before the job epilog.
.TP
.B slurm_spank_slurmd_exit
Called in slurmd just before the daemon exits.
.TP
.B slurm_spank_exit
Called once just before \fBslurmstepd\fR exits in remote context.
In local context, called before \fBsrun\fR exits.
.LP
One or more of these hooks may exist in each lua script
loaded by the \fBspank-lua\fR plugin. Each hook is run at the
appropriate time, and is passed a single argument, the \fBspank\fR
handle which is described below.
.LP
These scripts are allowed to return a single error code back
to Slurm. A value of -1 indicates failure (See \fBSPANK.FAILURE\fR
below), and anything else is considered to indicate success. No
return value is accepted is considered to indicate success.
.SH "SPANK-LUA HANDLE"
.LP
Each time one of the spank functions is called from a lua
script, the spank-lua plugin sets up a \fBspank\fR table to pass
as the first and only argument to that function. This table
serves as the spank handle for the duration of the call, and
exports several methods that may be used by the script, including
.TP 8
.B context
The value of \fBspank.context\fR indicates the current context in which
the spank plugin is running, and may have the string value
"\fIlocal\fR", "\fIallocator\fR", or "\fIremote\fR".
.TP
.B args
If any extra arguments were included in the \fBplugstack.conf\fR entry
for \fBlua.so\fR, these are collected and included in \fBspank.args\fR
array. (Remember that lua arrays are indexed starting at 1, not 0).
.TP
.B slurm_version
Version of Slurm as a string, e.g. "\fB2.1.0\fR".
.TP
.BI get_item " (ITEM)"
Return the spank item indicated by \fIITEM\fR, where
ITEM is the string representation of the spank_item_t enumerated type
as named in slurm/spank.h. (For example, the Slurm job stepid
would be obtained via
.nf
local stepid = spank:get_item ("\fBS_JOB_STEPID\fR")
.fi
For items that return multiple values, spank-lua will return a table,
for example, spank:get_item ("S_JOB_ENV") returns a lua table of
the job environment with t[name] = val for each environment entry
\fBname\fR=\fBval\fR.
For some specific items, \fBget_item\fR will return additional
values following the raw item value. Currently, the only item
for which this is true is \fBS_TASK_EXIT_STATUS\fR:
.nf
\fIstatus\fR, \fIexitcode\fR, \fItermsig\fR, \fIcore\fR =
spank:get_item ("\fBS_TASK_EXIT_STATUS\fR")
.fi
Where \fIstatus\fR is the raw exit status as reported by \fBwait\fR(2),
\fIexitcode\fR is the exit code as returned by \fBWEXITSTATUS\fR(\fIstatus\fR)
(or \fInil\fR if not \fBWIFEXITED\fR(\fIstatus\fR)), and \fItermsig\fR and
\fIcore\fR hold the values of \fBWTERMSIG\fR(\fIstatus\fR) and
\fBWCOREDUMP\fR(\fIstatus\fR) respectively, or \fInil\fR if not
\fBWIFSIGNALED\fR(\fIstatus\fR). For example, to print information
about the exiting task, you might run:
.nf
local fmt = string.format
local msg
local _, exitcode, termsig, core =
spank:get_item ("S_TASK_EXIT_STATUS")
if exitcode then
msg = fmt ("Exited with code %d\\n", exitcode)
elseif termsig then
local s = ""
if core then s = " (Core dumped)" end
msg = fmt ("Killed by signal %d%s\\n", termsig, s)
end
print (msg)
.fi
.TP
.BI getenv " (name)"
Return the value of environment variable \fIname\fR from the job's
environment. As with regular \fBspank\fR plugins, this call is only
necessary in remote context, since the job's environment in local
context can be obtained via os.getenv(). For example:
.nf
local ldpath = spank:getenv ("LD_LIBRARY_PATH")
.fi
.TP
.BI setenv " (name, value, [overwrite])"
Set the environment variable \fIname\fR to \fIvalue\fR in the job's
environment, overwriting the old value of \fI name\fR if the optional
boolean parameter \fIoverwrite\fR is true. For example:
.nf
if not spank:setenv ("LD_LIBRARY_PATH", ldpath, 1) then
print ("Failed to set LD_LIBRARY_PATH")
end
.fi
.TP
.BI unsetenv " (name)"
Unset the environment variable \fIname\fR from the job's environment
if it exists.
.TP
.BI job_control_setenv " (name, value, [overwrite])"
Like \fBsetenv\fR, but for the \fIjob control\fR environment, which
is passed to the Slurm control scripts like epilog, prolog, SlurmctldProlog,
and SlurmctldEpilog. Variables set in this way have the string SPANK_
prepended to them before they are set in the environment of these
scripts.
.TP
.BI job_control_getenv " (name)"
Same as \fBgetenv\fR, but for the job control environment.
.TP
.BI job_control_unsetenv " (name)"
Same as \fBunsetenv\fR, but for the job control environment.
.TP
.BI register_option " (spankopt)"
Registers a single option \fIspankopt\fR to Slurm. See \fISPANK OPTIONS\fR
below for the format of the \fIspankopt\fR parameter. This method is
only valid from the \fBslurm_spank_init\fR hook in local and allocator
context.
.TP
.BI getopt " (spankopt) "
Checks for supplied option \fIspankopt\fR in the current context. This
is an alternative to using a callback function and global variable in
normal spank callbacks. This function can only be used after options have
been processed (i.e. after \fBslurm_spank_init\fR in most contexts). In
\fBjob_script\fR context, \fBspank:getopt\fR is the \fIonly\fR supported
method to check for supplied options (that is, from \fBslurm_spank_job_prolog\fR
and \fBslurm_spank_job_epilog\fR).
.LP
In addition to the \fBspank\fR handle passed to spank-lua functions,
spank-lua scripts also have access to utility functions and values in
a global \fBSPANK\fR table. The members of this table include:
.TP 8
.B SPANK.log_*
Utility functions for logging messages using Slurm's log facility.
Included log functions are \fBlog_error\fB, \fBlog_info\fR, \fBlog_verbose\fR,
\fBlog_debug\fR. These functions all take a lua format string and a
variable number of arguments, for example:
.nf
SPANK.log_error ("%s: %s", myname, errormsg)
.fi
.TP
.B SPANK.SUCCESS
Return value to indicate a successful return from a spank callback. That is,
lua functions should return \fBSPANK.SUCCESS\fR on successful completion.
.TP
.B SPANK.FAILURE
Return value indicating failure of a spank-lua function.
.LP
.SH "SPANK OPTIONS"
Exporting options to \fBsrun\fR(1), \fRsalloc\fR(1) and \fBsbatch\fR(1)
may be accomplished with spank-lua in a similar manner as a normal \fBspank\fR
plugin. Each option to be exported is set in a lua table such as:
.nf
spankopt = {
name = STRING,
usage = STRING,
val = NUM,
cb = STRING,
has_arg = BOOLEAN,
arginfo = STRING,
}
.fi
Where the meaning of each member of the table has the same meaning as the
struct \fBspank_option\fR members described in \fBspank\fR(8) manpage. That
is:
.TP 8
.B name
is the name of the option. That is the option will be specified by users
as --\fBname\fR. This is a required parameter.
.TP
.B usage
is a short description of the option suitable for \-\-help output. This
is a required parameter.
.TP
.B val
A plugin\-local value which is returned to the option callback function.
This is a required parameter.
.TP
.B cb
Is the name of a global function to use as the callback function when
an option is invoked by the user. The option callback is invoked in both
local/allocator and remote contexts, and must take three arguments like:
.nf
function cb (val, optarg, remote)
.fi
Where val is the \fBspankopt.val\fR used when registering the option,
\fBoptarg\fR is the argument (if any) passed to the option, and
\fRremote\fR is a boolean indicating whether the option callback is
being made in local/allocator or remote context.
.LP
Options may be registered by spank-lua scripts either by use of
the spank:register_option() method from \fBslurm_spank_init\fR, or
by exporting a global \fBspank_options\fR table. The \fBspank_options\fR
table must be a list of spank option tables as described above,
for example:
.nf
spank_options = {
{
name = "test",
usage = "A test option for spank-lua",
val = 1,
cb = "option_handler",
}
}
.fi
When \fBspank:getopt\fR is supported, the callback function \fIcb\fR is
optional, since the \fBgetopt\fR function may be used as an alternative.
.SH EXAMPLE
The following example \fBspank-lua\fR script exports an environment
varable to the Slurm prolog and epilog to control the current value
of memory overcommit on the nodes of the job. Users can enable this
optional behavior by using the new commandline option --no-memory-overcommit.
.nf
.sp
-- Global spank_options table:
--
\fBspank_options\fR = {
{
name = "no-memory-overcommit",
usage = "Disable memory overcommit on nodes of Slurm job",
cb = "opt_handler"
}
}
.sp
got_option = false
function opt_handler (v, arg, remote) got_option = true end
.sp
function \fBslurm_spank_init_post_opt\fR (spank)
--
-- Return success if we're not in local context, or the user
-- did not specify --no-memory-overcommit.
--
if \fBspank.context\fR == "remote" or not got_option then
return \fISPANK.SUCCESS\fR
end
\fBSPANK.log_info\fR ("slurm_spank_init_post_opt")
--
-- Set SPANK_NO_OVERCOMMIT in the "job control" environment
--
local rc, msg = \fBspank:job_control_setenv\fR ("NO_OVERCOMMIT", 1, 1)
--
-- Like other lua functions, spank methods return nil and an
-- error string on failure.
--
if rc == nil then
\fBSPANK.log_error\fR ("Failed to propagate NO_OVERCOMMIT: %s", msg)
return \fISPANK.FAILURE\fR
end
return \fISPANK.SUCCESS\fR
end
.sp
.fi
.LP
The corresponding section of the Slurm prolog might then look like:
.nf
if test -n "$SPANK_NO_OVERCOMMIT"; then
echo 2 > /proc/sys/vm/overcommit_memory
fi
.fi
(Corresponding code to reset the overcommit_memory value in the epilog
should also be included in any full-featured solution)
.LP
The \fBsrun\fR command would now present a new option to the user:
.nf
$ srun --help
...
Options provided by plugins:
--no-memory-overcommit Disable memory overcommit on nodes of Slurm job
.fi
.SH COPYRIGHT
Copyright (C) 2009 Lawrence Livermore National Security, LLC.
Produced at Lawrence Livermore National Laboratory. UCRL-CODE-235358
This is free software; you can redistribute it and/or modify it under the
terms of the GNU General Public License as published by the Free Software
Foundation.
.SH "SEE ALSO"
.BR spank (8),
.BR srun (1),
.BR salloc (1),
.BR sbatch (1),
.PP
\fBhttp://slurm-spank-plugins.googlecode.com/\fR