You are here: 12 Troubleshooting and System Maintenance > Diagnostic Scripts

12.8 Diagnostic Scripts

Moab Workload Manager provides diagnostic scripts that can help aid in monitoring the state of the scheduler, resource managers, and other important components of the cluster software stack. These scripts can also be used to help diagnose issues that may need to be resolved with the help of Adaptive Computing support staff. This section introduces available diagnostic scripts.

12.8.1 support-diag.py

The support-diag.py script has a two-fold purpose. First, it can be used by a Moab trigger or cron job to create a regular snapshot of the state of Moab. The script captures the output of several Moab diagnostic commands (such as showq, mdiag -n, and mdiag -S), gathers configuration/log files, and records pertinent operating system information. This data is then compressed in a time-stamped tarball for easy long-term storage.

Second, the script provides Adaptive Computing support personnel with a complete package of information that can be used to help diagnose configuration issues or system bugs. After capturing the state of Moab, the resulting tarball can be sent to your Adaptive Computing support contact for further diagnosis.

The script asks you for the trouble ticket number, -t <TICKET#>, or -n. If you chose to enter -t <TICKET#> the script uploads your support diagnostic output to Adaptive Computing Customer Support. The upload and ticket number request can be prevented using the -n option.

12.8.1.A Synopsis

support-diag.py [<options>]

12.8.1.B Arguments

Argument Description
-h, --help Show this help message and exit.
-q, --diag-torque-off, --without-torque Disable Torque diagnostics.
-p TMPDIR, --tmp-dir=TMPDIR Use a different tmp directory to store output.
-n, --no-upload Do not upload to Adaptive Computing.
-t TICKET# Support ticket number.
-f, --full-mode Gather additional logs, stats and, moab.db files.
-u TIMEOUT, --moab-timeout=TIMEOUT Define Moab command timeout (Default 300 seconds).
-d, --debug-mode support-diag print debug variables.
-o, --offline-mode Gather offline logging only.
-r, --ftp Use ftp instead of scp.
-V, --version Print version information.

12.8.2 support.diag.pl

This script is deprecated with the 8.0 release. Use the support-diag.py script instead.

The support.diag.pl script has a two-fold purpose. First, it can be used by a Moab trigger or cron job to create a regular snapshot of the state of Moab. The script captures the output of several Moab diagnostic commands (such as showq, mdiag -n, and mdiag -S), gathers configuration/log files, and records pertinent operating system information. This data is then compressed in a time-stamped tarball for easy long-term storage.

The second purpose of the support.diag.pl script is to provide Adaptive Computing support personnel with a complete package of information that can be used to help diagnose configuration issues or system bugs. After capturing the state of Moab, the resulting tarball could be sent to your Adaptive Computing support contact for further diagnosis.

The support.diag.pl will ask you for the trouble ticket number then guide you through the process of uploading the data to Adaptive Computing Customer Support. The uploading and ticket number request may be prevented using the --no-upload and --support-ticket=<SUPPORT_TICKET_ID> flags detailed in the Arguments table that follows.

12.8.2.A Synopsis

support.diag.pl [--include-log-lines=<NUM>] [--diag-torque]

12.8.2.B Arguments

Argument Description
--include-log-lines=<NUM> Instead of including the entire moab.log file, only the last <NUM> lines are captured in the diagnostics.
--diag-torque Diagnostic commands pertinent to the Torque resource manager are included.
--no-upload Prevents the system from asking the user if they want to upload the tarball to Adaptive Computing Customer Support.
--support-ticket=<SUPPORT_TICKET_ID> Prevents the system from asking the user for a support ticket number.

© 2016 Adaptive Computing