Identifying job on a node

From CUC3
Revision as of 09:17, 14 July 2008 by import>Em427
Jump to navigation Jump to search

Sometimes it may happens that on some node you have more than one job running and only one among them you need to kill. How find which one it is? Here is an explanation given by Catherine:

 Get the pids of the candidate processes and examine their open files, which are kept in /proc/<pid>/fd on a given node. 
 Do an 'ls -l' on that directory for each pid, which will show you the open files, and see which one has that file open.