There are several key considerations in getting a batch system to scale.
Proper Resource Manager Configuration - TORQUE
Direct Node Communication - NODEPOLLFREQUENCY
Aggregating Scheduling Cycles - JOBAGGREGATIONTIME