(Click to open topic with navigation)
The following is a list of some key bugs fixed in Moab HPC Suite – Basic Edition. Following each issue description is an associated issue number in parentheses.
Resolved issues are aggregated and grouped by the release version in which they were resolved.
7.2.9/
Moab Authentication Bypass issue. This has been fixed. (CVE-2014-5300, MOAB-7100, MOAB-7524)
Moab was not decrementing GRES correctly within a Cray environment. Fixed a GRES bug found in Cray environments. (MOAB-7213)
GRESTOJOBATTR entry in moab.cfg breaks MAXGRES set in IDCFG. Fixed bug where MAXGRES ignored when reading IDCFG information from file and GRESTOJOBATTR set in moab.cfg. (MOAB-7078)
FLAGS=SharedMem and MEMWEIGHT did not work together. Fixed bug where MEMWEIGHT not applied when FLAGS=SharedMem was configured. (MOAB-7033)
TORQUE was leaving behind error and out files when a job was preempted or requeued. Fixed bug where OU files were being left in spool when job was preempted or requeued. (TRQ-2732)
Viewpoint was not unsetting Node Allocation Priority Policy. Fixed bug where Viewpoint wasn't correctly unsetting Node Allocation Priority Policy. (VEW-5721)
momctl -q clearmsg didn't seem to clear error messages permanently. Fixed bug where `momctl -q clearmsg` didn't properly clear error messages. (TRQ-2828)
Moab XML was adding incorrect information. Fixed bug where incorrect data was being added to job XML. (MOAB-7108)
max_user_queuable limit reached, however, there were no jobs in the queue. Fixed bug where jobs rejected due to max_user_queuable limit reached, yet no jobs in the queue. (TRQ-2795)
Jobs were going to wrong partition when partition(s) down. Fixed bug where jobs were being placed in wrong partition when one or more requested partitions were down. (MOAB-7103)
Preemptor Jobs were not getting started correctly. Fixed bug where preemptor jobs were not getting started correctly. (MOAB-7205)
qsub did not process arguments correctly when a submitfilter is used. Fixed bug where qsub did not process args correctly when using a submit filter. (TRQ-2646)
Reported cput was incorrect. Fixed bug where reported cput was incorrect. (TRQ-2759)
Jobs were not being rejected even when the jobs violate fairshare tree policy. Fixed bug where some jobs were not rejected even though they violated fairshare policy. (MOAB-7077)
Large environment variables (>1024 chars) crashed interactive jobs. Fixed bug where very large environment variables caused failures in interactive jobs. (MOAB-7021)
Moab was losing limit set with changeparam. Fixed bug with changeparam and multidimensional policies. (MOAB-7234)
mschedctl -m config "IDCFG[accounts] REFRESHPERIOD=minute|hour|infinity" did not change setting in Moab. Fixed bug with mschedctl -m config related to IDCFG and REFRESHPERIOD. (MOAB-6887)
Moab reservation corruption was blocking jobs. Fixed bug with reservations across moab restarts. (MOAB-7140)
qsub was passing arguments to submit filter all together with interactive jobs. Fixed bug with submit filter arguments not being parsed during interactive jobs. (TRQ-2707)
TORQUE appeared to be truncating environment variables when the node list is larger (such as procs=1952). Fixed bug with truncated job environment variables in cases where node lists are very long. (MOAB-7136)
Build bug reported with MIC libraries. Fixed build bug related to newer Intel MIC libraries installing in different locations. (TRQ-2653)
A sample mom.layout file might look like
nodes=0 gpu=0
nodes=1 gpu=1
This only works if you use nvml. The nvidia-smi command is not supported.
Crash on start up when reading empty array file. Fixed start up bug related to empty job array (.AR) files. (TRQ-2787)
Long startup time reported with 7.2.8. Improved performance during startup. (MOAB-7243)
qstat wouldn't parse anything after a bad job ID. Improved qstat behavior in cases where bad job IDs were referenced in the command. (TRQ-2410)
multireq jobs take hours to start. Fix bug where multi-req jobs were slow to start in certain cases. (MOAB-6824).
SRCFG with ROLLBACKOFFSET not creating reservations. Fixed bug where SRCFG with ROLLBACKOFFSET was not creating expected reservations (MOAB-7148)
Node going down briefly caused node conflict with reservations. Fixed bug related to standing reservation conflicts over a node reported as Down. (MOAB-7378)
Reservation was not being created when some resources are available. Fixed an issue with standing reservations that occasionally weren't created when they should be. (MOAB-7384)
Separate headers for multiple jobs IDs provided to qstat. Fixed output format bug in cases where multiple job IDs are passed into qstat. (TRQ-2411)
MWS should allow job script inside the POST body. Job scripts can now be included in the POST body when submitting jobs via MWS. (WS-2112)
7.2.8/
Moab had several memory leaks. Moab no longer has these memory leaks, use of uninit variable, and double free. (MOAB-6811)