4.239 Compute Nodes

Use the Adaptive Computing Torque package system to create self-extracting tarballs which can be distributed and installed on compute nodes. The Torque package are customizable. See the INSTALL file for additional options and features.

If you installed Torque using the RPMs, you must install and configure your nodes manually by modifying the /var/spool/torque/mom_priv/config file of each one. This file is identical for all compute nodes and can be created on the head node and distributed in parallel to all systems.

[root]# vi /var/spool/torque/mom_priv/config
			
$pbsserver     headnode    # hostname running pbs server     
$logevent      225         # bitmap of which events to log
 
[root]# service pbs_mom restart

To create Torque packages

  1. Configure and make as normal, and then run make packages.
  2. > make packages

    Building ./torque-package-clients-linux-x86_64.sh ...

    Building ./torque-package-mom-linux-x86_64.sh ...

    Building ./torque-package-server-linux-x86_64.sh ...

    Building ./torque-package-gui-linux-x86_64.sh ...

    Building ./torque-package-devel-linux-x86_64.sh ...

    Done.

     

    The package files are self-extracting packages that can be copied and executed on your production machines. Use --help for options.

  3. Copy the desired packages to a shared location.
  4. > cp torque-package-mom-linux-x86_64.sh /shared/storage/

    > cp torque-package-clients-linux-x86_64.sh /shared/storage/

  5. Install the Torque packages on the compute nodes.
  6. Adaptive Computing recommends that you use a remote shell, such as SSH, to install Torque packages on remote systems. Set up shared SSH keys if you do not want to supply a password for each host.

    The only required package for the compute node is mom-linux. Additional packages are recommended so you can use client commands and submit jobs from compute nodes.

    The following is an example of how to copy and install mom-linux in a distributed fashion.

    > for i in node01 node02 node03 node04 ; do scp torque-package-mom-linux-x86_64.sh ${i}:/tmp/. ; done

    > for i in node01 node02 node03 node04 ; do scp torque-package-clients-linux-x86_64.sh ${i}:/tmp/. ; done

    > for i in node01 node02 node03 node04 ; do ssh ${i} /tmp/torque-package-mom-linux-x86_64.sh --install ; done

    > for i in node01 node02 node03 node04 ; do ssh ${i} /tmp/torque-package-clients-linux-x86_64.sh --install ; done

    > for i in node01 node02 node03 node04 ; do ssh ${i} ldconfig ; done

Alternatively, you can use a tool like xCAT instead of dsh.

To use a tool like xCAT

  1. Copy the Torque package to the nodes.
  2. > prcp torque-package-linux-x86_64.sh noderange:/destinationdirectory/

  3. Install the Torque package.
  4. > psh noderange /tmp/torque-package-linux-x86_64.sh --install

Although optional, it is possible to use the Torque server as a compute node and install a pbs_mom with the pbs_server daemon.

Related Topics 

© 2017 Adaptive Computing