Identifying job on a node
Revision as of 09:17, 14 July 2008 by import>Em427
Sometimes it may happens that on some node you have more than one job running and only one among them you need to kill. How find which one it is? Here is an explanation given by Catherine:
Get the pids of the candidate processes and examine their open files, which are kept in /proc/<pid>/fd on given node. Do an 'ls -l' on that directory for each pid, which will show you the open files, and see which one has that file open.