knielson

TORQUE Protocol 2 – PBS Batch Protocol

This entry is part 3 of 4 in the series TORQUE Protocols

This entry is part 3 of 4 in the series TORQUE ProtocolsIf you started reading TORQUE Protocols 101  you may be wondering why I am starting with Protocol 2 and not Protocol 1. The answer is that the numbers assigned to each protocol in the TORQUE header files was done long before I started looking […]

TORQUE Protocols 101: trqauthd

This entry is part 2 of 4 in the series TORQUE Protocols

This entry is part 2 of 4 in the series TORQUE ProtocolsI decided to make trqauthd the second addition to my TORQUE Protocols 101 series because a knowledge of how trqauthd works is very important to understanding how to debug TORQUE issues. I was actually well into what was originally going to be this month’s […]

TORQUE Protocols 101

This entry is part 1 of 4 in the series TORQUE Protocols

This entry is part 1 of 4 in the series TORQUE ProtocolsOne definition for protocol put forth by Miriam-Webster is a system of rules that explain the correct conduct and procedures to be followed in formal situations When it comes to network communications I think that this definition pretty much covers it. Sure there are […]

Advanced TORQUE Training: Mother Superior

In the last Advanced TORQUE Training we discussed that the responsibility of the TORQUE resource manager is to allocate the nodes requested for a job and then launch the users job script. In a multi-node job pbs_server sends the job information to only one of the nodes in the job and that node is called […]

Advanced TORQUE Training: Job Life Cycle

As I have worked on customer issues over the years I often hear administrators muse that it would be nice to have some advanced TORQUE training to help them to better understand what is happening on their cluster when things are not working as expected. This post will be the first in a series of […]