In the summer of 2006, a quiet revolution was taking place within the world of supercomputing, driven not by flashy hardware but by a piece of software named TORQUE. This distributed resource manager was designed to orchestrate the chaotic symphony of batch jobs and distributed compute nodes that power the most powerful computers on Earth. Before TORQUE, managing a cluster of thousands of processors was a manual nightmare, often resulting in wasted energy and idle time. The software offered a centralized brain to control, schedule, and administer these massive systems, ensuring that every cycle of computation was put to use. It was a tool built for the invisible labor of science, allowing researchers to submit jobs and have them executed efficiently across a network of machines without human intervention. The story of TORQUE is the story of the invisible infrastructure that makes modern scientific discovery possible, turning raw computing power into actionable knowledge.
From PBS To Power
The origins of TORQUE lie in the Portable Batch System, a legacy software that had long served the scientific community. The TORQUE community did not start from scratch but instead expanded upon the capabilities of this existing foundation to improve scalability and fault tolerance. This evolution was driven by a coalition of high-performance computing entities, including the National Center for Supercomputing Applications, the Ohio Supercomputer Center, and the University of Southern California. These organizations, along with the US Department of Energy and national laboratories like Sandia and PNNL, worked to transform a simple batch system into a robust resource manager capable of handling the demands of the TeraGrid. The transition from PBS to TORQUE was not merely a code update but a strategic shift to meet the growing needs of the scientific community. By integrating with schedulers like the non-commercial Maui Cluster Scheduler or the commercial Moab Workload Manager, TORQUE gained enhanced functionality that allowed for deeper optimization in complex cluster environments. This integration marked a turning point where the software moved from being a simple job queue to a comprehensive management system.The License Dilemma
A significant shift occurred in the software landscape in June 2018 when TORQUE ceased to be considered open-source software due to licensing issues. For years, the project had utilized the OpenPBS version 2.3 license, which allowed for broad distribution and modification. However, the software was eventually categorized as non-free software according to the Debian Free Software Guidelines, a classification that fundamentally changed its status within the community. This decision was not made lightly but reflected the complex legal and ethical considerations surrounding software ownership and distribution. The change meant that the software could no longer be freely modified and distributed under the terms that the community had come to expect. This shift highlighted the fragility of open-source projects when faced with evolving legal standards and the need for sustainable development models. The transition from open-source to non-free status marked a new chapter in the history of the software, forcing users and developers to adapt to a new reality where the code was no longer entirely free to the public.