You are here: 12 NUMA and Torque > NUMA-Aware Systems

12.1 NUMA-Aware Systems

This topic serves as a central information repository for NUMA-aware systems. This topic provides basic information and contains links to the various NUMA-aware topics found throughout the documentation.

Support for NUMA-aware systems is available only with Torque Resource Manager 6.0 and later and Moab Workload Manager 9.0 and later.

In this topic:

12.1.1 About NUMA-Aware Systems
12.1.2 Installation and Configuration
12.1.3 Job Resource Requests
12.1.4 Job Monitoring
12.1.5 Moab/Torque NUMA Configuration

12.1.1 About NUMA-Aware Systems

The NUMA-aware architecture is a hardware design which separates its cores into multiple clusters where each cluster has its own local memory region and still allows cores from one cluster to access all memory in the system. However, if a processor needs to use memory that is not its own memory region, it will take longer to access that (remote) memory. For applications where performance is crucial, preventing the need to access memory from other clusters is critical.

Torque uses cgroups to better manage cpu and memory accounting, memory enforcement, cpuset management, and binding jobs to devices such as MICs and GPUs. Torque will try to place jobs which request GPUs or MICs on NUMA nodes next to the GPU or MIC device to be used.

PCIe devices are similar to cores in that these devices will be closer to the memory of one NUMA node than another. Examples of PCIe devices are GPUs, NICs, disks, etc.

The resources of a processor chip have a hierarchy. The largest unit is a socket. A socket can contain one or more NUMA nodes with its cores and memory. A NUMA node will contain a set of cores and threads and memory which is local to the NUMA node. A core may have 0 or more threads.

A socket refers to the physical location where a processor package plugs into a motherboard. The processor that plugs into the motherboard is also known as a socket. The socket can contain one or more NUMA nodes.
A core is an individual execution unit within a processor that can independently execute a software execution thread and maintains its execution state separate from the execution state of any other cores within a processor.
A thread refers to a hardware-based thread execution capability. For example, the Intel Xeon 7560 has eight cores, each of which has hardware that can effectively execute two software execution threads simultaneously, yielding 16 threads.

The following image is a simple depiction of a NUMA-aware architecture. In this example, the system has two NUMA nodes with four cores per NUMA node. The cores in each NUMA node have access to their own memory region but they can also access the memory region of the other NUMA node through the inter-connect.

If the cores from NUMA chip 0 need to get memory from NUMA chip 1 there will be a greater latency to fetch the memory.

Click to enlarge

12.1.2 Installation and Configuration

Once Torque is first installed, you need to perform configuration steps.

See:

2.23 Torque NUMA-Aware Configuration

12.1.3 Job Resource Requests

NUMA-aware resources can be requested at the time of job submission using the qsub/msub -L parameter. In addition, the req_infomation_max and req_information_min queue attributes let you specify the maximum and minimum resource limits allowed for jobs submitted to a queue.

See:

12.1.4 Job Monitoring

When using NUMA-aware, job resources are tracked per task. qstat -f produces a new category of information that begins with the " req_information" keyword. Following each "req_information keyword" is another keyword giving information about how the job was allocated. When the job has completed, the output will also include the per task resident memory used and per task cpu time used.

See

3.13.1 Monitoring NUMA Job Task Placement

12.1.5 Moab/Torque NUMA Configuration

Moab does not require special configuration to support this NUMA-aware system. However, there are a few Moab-specific things that would be helpful to know and understand.

See

Using NUMA with Moab in the Moab Workload Manager Administrator Guide