(Click to open topic with navigation)
Synopsis
checknode options nodeID
ALL
Overview
This command shows detailed state information and statistics for nodes that run jobs.
The following information is returned by this command:
Name | Description |
---|---|
Disk | Disk space available |
Memory | Memory available |
Swap | Swap space available |
State | Node state |
Opsys | Operating system |
Arch | Architecture |
Adapters | Network adapters available |
Features | Features available |
Classes | Classes available |
StateTime | Time node has been in current state in HH:MM:SS notation |
Downtime | Displayed only if downtime is scheduled |
Load | CPU Load (Berkley one-minute load average) |
TotalTime | Total time node has been detected since statistics initialization expressed in HH:MM:SS notation |
UpTime | Total time node has been in an available (Non-Down) state since statistics initialization expressed in HH:MM:SS notation (percent of time up: UpTime/TotalTime) |
ActiveTime | Total time node has been busy (allocated to active jobs) since statistics initialization expressed in HH:MM:SS notation (percent of time busy: BusyTime/TotalTime) |
EffNodeAccessPolicy | Configured effective node access policy |
After displaying this information, some analysis is performed and any unusual conditions are reported.
Access
By default, this command can be run by any Moab Administrator (see ADMINCFG).
Parameters
Name | Description |
---|---|
NODE | Node name you want to check. Moab uses regular expressions to return any node that contains the provided argument. For example, if you ran checknode node1, Moab would return information about node1, node10, node100, etc. If you want to limit the results to node1 only, you would run checknode "^node1$". |
Flags
Name | Description |
---|---|
ALL | Returns checknode output on all nodes in the cluster. |
-h | Help for this command. |
-v | Returns verbose output. |
--xml | Output in XML format. Same as mdiag -n --xml. |
Example 4-4: checknode
> checknode P690-032 node P690-032 State: Busy (in current state for 11:31:10) Configured Resources: PROCS: 1 MEM: 16G SWAP: 2000M DISK: 500G Utilized Resources: PROCS: 1 Dedicated Resources: PROCS: 1 Opsys: AIX Arch: P690 Speed: 1.00 CPULoad: 1.000 Network: InfiniBand,Myrinet Features: Myrinet Attributes: [Batch] Classes: [batch] Total Time: 5:23:28:36 Up: 5:23:28:36 (100.00%) Active: 5:19:44:22 (97.40%) Reservations: Job '13678'(x1) 10:16:12:22 -> 12:16:12:22 (2:00:00:00) Job '13186'(x1) -11:31:10 -> 1:12:28:50 (2:00:00:00) Jobs: 13186
Example 4-5: checknode ALL
> checknode ALL node ahe State: Idle (in current state for 00:00:30) Configured Resources: PROCS: 12 MEM: 8004M SWAP: 26G DISK: 1M Utilized Resources: PROCS: 1 SWAP: 4106M Dedicated Resources: --- MTBF(longterm): INFINITY MTBF(24h): INFINITY Opsys: linux Arch: --- Speed: 1.00 CPULoad: 1.400 Flags: rmdetected Classes: [batch] RM[ahe]*: TYPE=PBS EffNodeAccessPolicy: SHARED Total Time: 00:01:44 Up: 00:01:44 (100.00%) Active: 00:00:00 (0.00%) Reservations: --- node ahe-ubuntu32 State: Running (in current state for 00:00:05) Configured Resources: PROCS: 12 MEM: 2013M SWAP: 3405M DISK: 1M Utilized Resources: PROCS: 6 SWAP: 55M Dedicated Resources: PROCS: 6 MTBF(longterm): INFINITY MTBF(24h): INFINITY Opsys: linux Arch: --- Speed: 1.00 CPULoad: 2.000 Flags: rmdetected Classes: [batch] RM[ahe]*: TYPE=PBS EffNodeAccessPolicy: SHARED Total Time: 00:01:44 Up: 00:01:44 (100.00%) Active: 00:00:02 (1.92%) Reservations: 6x2 Job:Running -00:00:07 -> 00:01:53 (00:02:00) 7x2 Job:Running -00:00:06 -> 00:01:54 (00:02:00) 8x2 Job:Running -00:00:05 -> 00:01:55 (00:02:00) Jobs: 6,7,8 node ahe-ubuntu64 State: Busy (in current state for 00:00:06) Configured Resources: PROCS: 12 MEM: 2008M SWAP: 3317M DISK: 1M Utilized Resources: PROCS: 12 SWAP: 359M Dedicated Resources: PROCS: 12 MTBF(longterm): INFINITY MTBF(24h): INFINITY Opsys: linux Arch: --- Speed: 1.00 CPULoad: 0.000 Flags: rmdetected Classes: [batch] RM[ahe]*: TYPE=PBS EffNodeAccessPolicy: SHARED Total Time: 00:01:44 Up: 00:01:44 (100.00%) Active: 00:00:55 (52.88%) Reservations: 0x2 Job:Running -00:01:10 -> 00:00:50 (00:02:00) 1x2 Job:Running -00:00:20 -> 00:01:40 (00:02:00) 2x2 Job:Running -00:00:20 -> 00:01:40 (00:02:00) 3x2 Job:Running -00:00:17 -> 00:01:43 (00:02:00) 4x2 Job:Running -00:00:13 -> 00:01:47 (00:02:00) 5x2 Job:Running -00:00:07 -> 00:01:53 (00:02:00) Jobs: 0,1,2,3,4,5 ALERT: node is in state Busy but load is low (0.000)
Related topics