IRIDIA cluster installing cluster software

From IridiaWiki
Jump to navigationJump to search

This page contains information on installing clustering software such as the Sun Grid Engine and parallel programming middleware such as PVM and MPI.

Sun Grid Engine

The Sun Grid Engine can be installed from the backup or from scratch. Below are the instructions on installing it from a backup:

Copy the backed up tar.bz2 on majorana and untar it into /usr/local/sge. Add the following line to /etc/profile:

. /usr/local/sge/default/common/settings.sh

and in /etc/csh.cshrc:

source /usr/local/sge/default/common/settings.csh

(All modifications done both on the master and on the clients -- TODO: HALVA WHAT DO YOU MEAN BY THIS, WHICH MODIFICATIONS DO "ALL" REFER TO IN "ALL MODIFICATIONS" --- I know it should be easy, but I am still slightly confused?)

Added to /etc/services:

sge_commd	536/tcp	# Sun Grid Engine


Export the Sun Grid Engine directory. Modify

/etc/diskless-image/config.sh 

in order to add the correct entry in /etc/fstab on each client node. Search for the block that starts with "cat > /etc/fstab << EOF" and add two (TODO: I only see one!) lines below:

192.168.100.101:/usr/local/sge   /usr/local/sge/	nfs rw,defaults,	0	1

Add the line to:

/usr/lib/diskless-image/template/etc/fstab

as well.

Rerun dpkg-reconfigure diskless-image-simple, then create the directory:

/usr/local/sge

in the diskless image.

Then run

update-host-directories.

Now on majorana:

 
cd /etc; ln -s . rc.d; cd /usr/local/sge/;
./install_qmaster

On the diskless image: copy rcsge to /etc/init.d/ and then run:

update-rc.d rcsge defaults 95
update-host-directories

On polyphemus: add an entry to /etc/fstab to mount r01:/usr/local/sge and

cp /usr/local/sge/default/common/rcsge /etc/init.d; update-rc.d rcsge defaults 95

Note: SGE is on majorana, but the schedules is still on polyphemus!