Appendices > Appendix M: BLCR acceptance tests > Test 2 - Persistence of checkpoint images

M.3 Test 2 - Persistence of checkpoint images

Introduction

This test determines if the checkpoint files remain in the default directory after the job is removed from the TORQUE queue.

Note that this behavior was requested by a customer but in fact may not be the right thing to do as it leaves the checkpoint files on the execution node. These will gradually build up over time on the node being limited only by disk space. The right thing would seem to be that the checkpoint files are copied to the users home directory after the job is purged from the execution node.

Test steps

Assuming the steps of Test 1 (see Test 1 - Basic operation), delete the job and then wait until the job leaves the queue after the completed job hold time. Then look at the contents of the default checkpoint directory to see if the files are still there.

> qsub -c enabled test.sh

999.xxx.yyy

> qhold 999

> qdel 999

> sleep 100

> qstat

>

> find /var/spool/torque/checkpoint

... files ...

Possible failures

The files are not there, did Test 1 actually pass?

Successful results

The files are there.

Related topics