Moab Workload Manager

mdiag -n

(Moab Node Diagnostics)

Synopsis

mdiag -n [-t partitionid] [-A creds] [-v] [--format=xml] [nodeid]

Overview

The mdiag -n command provides detailed information about the state of nodes Moab is currently tracking. This command also performs a large number of sanity and state checks. The node configuration and status information as well as the results of the various checks are presented by this command.

Arguments

Flag Argument Description
{user|group|account|qos|class|job}:<OBJECTID> report if each node is accessible by requested job or credential
<partitionid> report only nodes from specified partition
- show verbose output (do not truncate columns and add columns for additional node attributes)

Output

This command presents detailed node information in whitespace delineated fields.

The output of this command can be extensive and the values for a number of fields may be truncated. If truncated, the '-v flag can be used to display full field content.
Column Format
<NODE NAME
<NODE STATE
<AVAILABLE PROCS>:<CONFIGURED PROCS>
<AVAILABLE MEMORY>:<CONFIGURED MEMORY>
<AVAILABLE DISK>:<CONFIGURED DISK>
<AVAILABLE SWAP>:<CONFIGURED SWAP>
<RELATIVE MACHINE SPEED>
<NODE OPERATING SYSTEM>
<NODE HARDWARE ARCHITECTURE>
<PARTITION NODE IS ASSIGNED TO>
<CURRENT 1 MINUTE BSD LOAD>
<NUMBER OF RESERVATIONS ON NODE>
<CLASS NAME><CLASS INSTANCES AVAILABLE>:<CLASS INSTANCES CONFIGURED>...
<NETWORK NAME>...
<NODE FEATURE>...

Example 1

> mdiag -n

compute node summary
Name                    State   Procs      Memory         Opsys
 
opt-001                  Busy    0:2      2048:2048        SuSE
opt-002                  Busy    0:2      2048:2048        SuSE
opt-003                  Busy    0:2      2048:2048        SuSE
opt-004                  Busy    0:2      2048:2048        SuSE
opt-005                  Busy    0:2      2048:2048        SuSE
opt-006                  Busy    0:2      2048:2048        SuSE
WARNING:   swap is low on node opt-006
opt-007                  Busy    0:2      2048:2048        SuSE
opt-008                  Busy    0:2      2048:2048        SuSE
opt-009                  Busy    0:2      2048:2048        SuSE
opt-010                  Busy    0:2      2048:2048        SuSE
opt-011                  Busy    0:2      2048:2048        SuSE
opt-012                  Busy    0:2      2048:2048        SuSE
opt-013                  Busy    0:2      2048:2048        SuSE
opt-014                  Busy    0:2      2048:2048        SuSE
opt-015                  Busy    0:2      2048:2048        SuSE
opt-016                  Busy    0:2      2048:2048        SuSE
x86-001                  Busy    0:1       512:512       Redhat
x86-002                  Busy    0:1       512:512       Redhat
x86-003                  Busy    0:1       512:512       Redhat
x86-004                  Busy    0:1       512:512       Redhat
x86-005                  Idle    1:1       512:512       Redhat
x86-006                  Idle    1:1       512:512       Redhat
x86-007                  Idle    1:1       512:512       Redhat
x86-008                  Busy    0:1       512:512       Redhat
x86-009                  Down    1:1       512:512       Redhat
x86-010                  Busy    0:1       512:512       Redhat
x86-011                  Busy    0:1       512:512       Redhat
x86-012                  Busy    0:1       512:512       Redhat
x86-013                  Busy    0:1       512:512       Redhat
x86-014                  Busy    0:1       512:512       Redhat
x86-015                  Busy    0:1       512:512       Redhat
x86-016                  Busy    0:1       512:512       Redhat
P690-001                 Busy    0:1     16384:16384        AIX
P690-002                 Busy    0:1     16384:16384        AIX
P690-003                 Busy    0:1     16384:16384        AIX
P690-004                 Busy    0:1     16384:16384        AIX
P690-005                 Busy    0:1     16384:16384        AIX
P690-006                 Busy    0:1     16384:16384        AIX
P690-007                 Idle    1:1     16384:16384        AIX
P690-008                 Idle    1:1     16384:16384        AIX
WARNING:   node P690-008 is missing ethernet adapter
P690-009                 Busy    0:1     16384:16384        AIX
P690-010                 Busy    0:1     16384:16384        AIX
P690-011                 Busy    0:1     16384:16384        AIX
P690-012                 Busy    0:1     16384:16384        AIX
P690-013                 Busy    0:1     16384:16384        AIX
P690-014                 Busy    0:1     16384:16384        AIX
P690-015                 Busy    0:1     16384:16384        AIX
P690-016                 Busy    0:1     16384:16384        AIX
-----                     ---    6:64   745472:745472     -----
 
Total Nodes: 36  (Active: 30  Idle: 5  Down: 1)

Note: Warning messages are interspersed with the node configuration information with all warnings preceded by the keyword 'WARNING'.

XML Output

If XML output is requested (via the --format=xml argument), XML based node information will be written to STDOUT in the following format:

mdiag -n --format=xml

<Data>
  <node> <ATTR>="<VAL>" ... </node>
  ...
</Data>

In addition to the attributes listed below, mdiag -n's node element XML has children that describe a node's messages (Messages XML element).

XML Attributes

Name Description
Available generic resources
Special allocated resources (like vlans)
The node's processor architecture.
Classes available on the node
Time when the node will no longer be availble (used in Utility centers)
Time when the node will be available (used in Utility centers)
Classes configured on the node
generic resources on the node
If true, a node's state and usage is tracked over time.
A list of comma separated custom features describing a node.
A list of comma separated consumable resources associated with a node.
How many hops the node took to reach this Moab (used in hierarchical grids)
Node has been deleted
Node is dynamic (used in Utility centers)
The list of jobs currently running on a node.
current load as reported by the resource manager
load weight used when calculating node priority
See Node Policies for details.
See Node Policies for details.
See Node Policies for details.
See Node Policies for details.
See Node Policies for details.
The ability to specify which networks are available to a given node is limited to only a few resource manager. Using the NETWORK attribute, administrators can establish this node to network connection directly through the scheduler. The NODECFG parameter allows this list to be specified in a comma delimited list.
The unique identifier for a node.
The state of a node.
A node's operating system.
Operating systems the node can run
URL for changing the operating system
Credential type and name of owner
The partition a node belongs to. See Node Location for details.
The state of the node's power. Either ON or OFF.
The fixed node priority relative to other nodes.
A node's processsor speed information specified in MHz.
The rack associated with a node's physical location.
The total available disk on a node.
The total available memory available on a node.<
The total number of processors available on a node.
The total available swap on a node.
The total configured memory on a node.
The total configured processors on a node.
The total configured swap on a node.
Number of reservations on the node
List of reservations on the node
Deprecated (use GRES)
A comma separated list of resource managers who have access to a node.
The number of slots or size units consumed by the node.
The first slot in the rack associated with the node's physical location.
A node's relative speed.
speed weight used to calculate node's priority
Time node was active
Time node's state was modified
Time node has been monitored
Time node has been up
The number of tasks on a node.

See Also