TORQUE* Resource Manager's user community, in cooperation with Cluster Resources, contributed several fixes in the last month to create TORQUE 1.2.0p5.
One important fix stops the problem of qstat hanging while pbs_server is waiting for an update from an offline MOM. The problem occurred when a MOM in the "down, job-exclusive" state was marked offline using the "pbsnodes -o" command.
A patch submitted by the University of Maine fixes a communication error during job cancellation. When Cluster Resources tested the patch, they found it increased the performance of qdel in certain circumstances.
On behalf of the user community we would also like to thank all those not mentioned above who helped find and fix problems and test new patches. To learn more about TORQUE visit: http://www.clusterresources.com/products/torque/ . To join TORQUE's user community go to: http://www.clusterresources.com/mailing.shtml .
* This product includes software developed by NASA Ames Research Center, Lawrence Livermore National Laboratory, and Veridian Information Solutions, Inc. Visit www.OpenPBS.org for OpenPBS software support, products, and information. TORQUE is neither endorsed by nor affiliated with Altair Grid Solutions, Inc.
About Cluster Resources:
Cluster Resources, Inc.â„¢ is a leading provider of workload and resource management software and services for cluster, grid, hosting center and utility-based computing environments. Cluster Resources' high-performance computing solutions enable administrators to control and optimize parallel and serial computing resources. Its professional Moab product line provides HPC sites with the most advanced workload management, scheduling and policy control (e.g., advanced reservations, backfill, checkpoint, preemption, fairshare, prioritization, etc.). Moab is compatible with batch resource managers such as Platform's LSF, Altair's PBS Pro, and IBM's LoadLeveler, and open source tools including TORQUE, OpenPBS and others. Moab runs on Linux, Unix, and Mac OS X environments, and is also accessible from Windows. For more information call (801) 873-3400 or visit http://www.clusterresources.com .