Difference between revisions of "New mek-quake"

From CUC3
Jump to navigation Jump to search
import>Cen1001
import>Cen1001
Line 1: Line 1:
 
= New mek-quake =
 
= New mek-quake =
   
This is a large cluster system to be used by the Wales and Vendruscolo
+
This is a large cluster system to be used by the Wales and Vendruscolo groups. There are some decisions to be made about setting it up.
groups. There are some decisions to be made about setting it up.
 
   
On clust (the other Wales group cluster) there are dual-CPU nodes but
+
On clust (the other Wales group cluster) there are dual-CPU nodes but these are deliberately hidden from the queueing system. The smallest unit
these are deliberately hidden from the queueing system. The smallest unit
 
 
you can get is a single node. This was to avoid the situation we get on
 
you can get is a single node. This was to avoid the situation we get on
 
nimbus, a similar machine, where parallel jobs sometimes get assigned one
 
nimbus, a similar machine, where parallel jobs sometimes get assigned one
 
task on certain nodes, instead of two. This seems to happen despite the
 
task on certain nodes, instead of two. This seems to happen despite the
queues being configured to ask for two tasks per node on parallel jobs but it is always possible for users to reset this so I
+
queues being configured to ask for two tasks per node on parallel jobs but it is always possible for users to reset this so I do not knpw

Revision as of 15:50, 11 April 2006

New mek-quake

This is a large cluster system to be used by the Wales and Vendruscolo groups. There are some decisions to be made about setting it up.

On clust (the other Wales group cluster) there are dual-CPU nodes but these are deliberately hidden from the queueing system. The smallest unit you can get is a single node. This was to avoid the situation we get on nimbus, a similar machine, where parallel jobs sometimes get assigned one task on certain nodes, instead of two. This seems to happen despite the queues being configured to ask for two tasks per node on parallel jobs but it is always possible for users to reset this so I do not knpw