Difference: DLPool (1 vs. 7)

Revision 723 Feb 2010 - Main.DaveCable

Line: 1 to 1
 
META TOPICPARENT name="WebHome"

Daresbury Laboratory Condor Pool

Line: 56 to 56
 
  1. In /home/condor/condor-7.2.1 run "condor_install --prefix=/opt/condor-7.2.1"
  2. You may have to install libstdc++ or a compatibility layer if you dont already have it. On Gentoo "emerge libstdc++"
  3. cd to /opt and create a soft link by doing "ln -s condor-7.2.1 condor". This will be used in subsequent steps and facilitates easy updates. This is also done by the condor_setup_script mentioned below.
Changed:
<
<
  1. Edit /opt/condor/etc/condor_config to allow appropriate hosts e.g. "HOSTALLOW_READ = *" and "HOSTALLOW_WRITE = *" is suitable for a submit node inside a firewall. Everything else is over-ridden in condor_config.local (see below).
>
>
  1. Edit /opt/condor/etc/condor_config to allow appropriate hosts e.g. "HOSTALLOW_READ = *" and "HOSTALLOW_WRITE = *" is suitable for a submit node inside a firewall (note: in 7.4.1 it was found that these settings have been changed to ALLOW_READ and ALLOW_WRITE, but the syntax is the same). Everything else is over-ridden in condor_config.local (see below).
 
  1. You will need to create the init script. On Gentoo this is /etc/init.d/condor and is of SysV style. A sample can be found in /opt/condor/etc/examples/condor.boot which can be copied over and edited. It is however better to use John Kewley's modified script which has start, stop and restart options. It can be found here.
  2. You need a directory called /opt/condor/local.$(HOSTNAME). This should contain condor_config.local of which this is an example modified for tardis.dl.ac.uk on the DL pool.
  3. If you do not have an /etc/profile.d/condor.sh you need to create one, here is an example. Note, this should have been created by the condor_install command.

Revision 623 Jul 2009 - Main.RobAllan

Line: 1 to 1
 
META TOPICPARENT name="WebHome"

Daresbury Laboratory Condor Pool

Line: 50 to 50
  Further instructions on UW Madison Web site for 7.2.1.
Deleted:
<
<
  1. As root, download appropriate distro from Condor Web site into /usr/local/src. If the O/S is not matched, try the generic glibc32 tarball. On Gentoo the RHEL3 dynamic version seems to work. Here is the site for Stable release v7.2.1
  2. Unpack the tarball. This will create something like /usr/local/src/condor-7.2.1
 
  1. Setup up the condor user, on DL systems this is uid 14168 in group condor which is gid 14168. I've created a script to do this.
Changed:
<
<
  1. In /usr/local/src/condor-7.2.1 run "condor_install --prefix=/opt/condor-7.2.1"
>
>
  1. As root, download appropriate distro from Condor Web site into /home/condor or /usr/local/src. If the O/S is not matched, try the generic glibc32 tarball. On Gentoo the RHEL3 dynamic version seems to work. Here is the site for Stable release v7.2.1
  2. Unpack the tarball. This will create something like /home/condor/condor-7.2.1
  3. In /home/condor/condor-7.2.1 run "condor_install --prefix=/opt/condor-7.2.1"
 
  1. You may have to install libstdc++ or a compatibility layer if you dont already have it. On Gentoo "emerge libstdc++"
  2. cd to /opt and create a soft link by doing "ln -s condor-7.2.1 condor". This will be used in subsequent steps and facilitates easy updates. This is also done by the condor_setup_script mentioned below.
  3. Edit /opt/condor/etc/condor_config to allow appropriate hosts e.g. "HOSTALLOW_READ = *" and "HOSTALLOW_WRITE = *" is suitable for a submit node inside a firewall. Everything else is over-ridden in condor_config.local (see below).
Line: 99 to 99
  For more examples see http://www.cs.wisc.edu/condor/quick-start.html
Changed:
<
<

Information about Globus jobmanager-condor.pm

>
>

Information about Globus jobmanager-condor

cp /home/globus/Growl/packages/gt4.2.1-all-source-installer//source-trees/gram/jobmanager/setup/condor/condor.in /usr/local/globus-4.2.1/lib/perl/Globus/GRAM/JobManager/condor.pm

or wherever you have the installed job managers. Edit the BEGIN initialiser as follows:

# RJA done this config by hand
    $condorbin = '/opt/condor/bin';
    $condor_submit      = "$condorbin/condor_submit";
    $condor_rm          = "$condorbin/condor_rm";
    $condor_config      = '/opt/condor/etc/condor_config';

You now need to add some files:

globus*/etc/grid-services/jobmanager-condor

For additional information on this see http://www.cs.wisc.edu/condor/condor_globus.html

You can set the ephemeral port range used by the Condor jobmanager to the same as Globus by setting LOWPORT=64000 and HIGHPORT=65256 in the local config file.

Revision 522 Jul 2009 - Main.RobAllan

Line: 1 to 1
 
META TOPICPARENT name="WebHome"

Daresbury Laboratory Condor Pool

Line: 33 to 33
  Total 14 0 0 14 0 0 0
Changed:
<
<
Forthcoming changes include:
>
>
Recent changes include:
 
Added:
>
>
  • Move RMCS server to rmcs.dl.ac.uk in Grid DMZ
  • Insall Globus on rmcs
  • Add dl1.nw-grid.ac.uk head node

Forthcoming changes:

  • Install jobmanager-condor on dl1 and rmcs servers
 
  • Add machines in A27A.
  • Add machines in A27. 2x machines to add plus desktop systems
  • Add machines in A33. New desktop PC plus existing PC.
Line: 49 to 56
 
  1. In /usr/local/src/condor-7.2.1 run "condor_install --prefix=/opt/condor-7.2.1"
  2. You may have to install libstdc++ or a compatibility layer if you dont already have it. On Gentoo "emerge libstdc++"
  3. cd to /opt and create a soft link by doing "ln -s condor-7.2.1 condor". This will be used in subsequent steps and facilitates easy updates. This is also done by the condor_setup_script mentioned below.
Changed:
<
<
  1. Edit /opt/condor/etc/condof_config to allow appropriate hosts e.g. "HOSTALLOW_READ = *" and "HOSTALLOW_WRITE = *" is suitable for a submit node inside a firewall. Everything else is over-ridden in condor_config.local (see below).
>
>
  1. Edit /opt/condor/etc/condor_config to allow appropriate hosts e.g. "HOSTALLOW_READ = *" and "HOSTALLOW_WRITE = *" is suitable for a submit node inside a firewall. Everything else is over-ridden in condor_config.local (see below).
 
  1. You will need to create the init script. On Gentoo this is /etc/init.d/condor and is of SysV style. A sample can be found in /opt/condor/etc/examples/condor.boot which can be copied over and edited. It is however better to use John Kewley's modified script which has start, stop and restart options. It can be found here.
  2. You need a directory called /opt/condor/local.$(HOSTNAME). This should contain condor_config.local of which this is an example modified for tardis.dl.ac.uk on the DL pool.
  3. If you do not have an /etc/profile.d/condor.sh you need to create one, here is an example. Note, this should have been created by the condor_install command.
Line: 91 to 98
 Note: executable must be executable!

For more examples see http://www.cs.wisc.edu/condor/quick-start.html

Added:
>
>

Information about Globus jobmanager-condor.pm

Revision 412 Mar 2009 - Main.RobAllan

Line: 1 to 1
 
META TOPICPARENT name="WebHome"

Daresbury Laboratory Condor Pool

Line: 56 to 56
 
  1. "chown -R condor:condor /opt/condor/local.$(HOSTNAME) /opt/condor/etc/condor_config". I've created a script to copy the examples and do this part of the setup and it also does the soft link mentioned above.
  2. Do "/etc/init.d/condor start" and sit back. You can access the Web site at http://tardis.dl.ac.uk/Condor to check the status or do "condor_status -pool condor-main.dl.ac.uk"
  3. Don't forget to put ntpd on the machine too for any distributed computing activity.
Added:
>
>

Using and Testing

Need to source /opt/condor/condor.sh in user's environment somewhere. On some machines we set it in /etc/profile.

Some useful condor commands include:

condor_submit -- submits a job

condor_q -- see your own jobs

condor_q -g -- see all jobs

condor_rm -- kill a job

condor_q -better-analyze -- find out why a job isn't running

A sample job submission script which includes file transfer stuff and submits a linux job from a linux node is:

universe = vanilla
should_transfer_files = YES
when_to_transfer_output = ON_EXIT
requirements = ( Arch=="X86_64") && ( OpSys=="LINUX" )
executable = hostname.sh
output = host.out
error = host.err
log = host.log
notification = Error
queue

Note: executable must be executable!

For more examples see http://www.cs.wisc.edu/condor/quick-start.html

Revision 304 Mar 2009 - Main.RobAllan

Line: 1 to 1
 
META TOPICPARENT name="WebHome"

Daresbury Laboratory Condor Pool

This static information was last updated 3/3/2009. This pool is currently maintained by Rob Allan and Dave Cable.

Changed:
<
<
condor_status -pool condor-main.dl.ac.uk
>
>
/opt/condor/bin/condor_status
  Name OpSys? Arch State Activity LoadAv? Mem ActvtyTime?
Changed:
<
<
animal.dl.ac.uk LINUX INTEL Unclaimed Idle 0.040 2048 0+00:00:04 slot1@rjavig. LINUX INTEL Unclaimed Idle 0.000 1643 3+18:13:29 slot2@rjavig. LINUX INTEL Unclaimed Idle 0.000 1643 3+18:13:30 slot1@tardis.dl LINUX X86_64 Unclaimed Idle 1.000 1024 3+18:13:51 slot2@tardis.dl LINUX X86_64 Unclaimed Idle 5.050 1024 22+10:33:32 slot1@rjaport.dl LINUX INTEL Unclaimed Idle 1.000 1024 3+18:13:51 slot2@rjaport.dl LINUX INTEL Unclaimed Idle 5.050 1024 22+10:33:32 rjavig6.dl.ac WINNT51 INTEL Unclaimed Idle 1.120 2030 3+20:53:55
>
>
animal.dl.ac.uk LINUX INTEL Unclaimed Idle 0.000 2048 0+02:30:04 beaker.dl.ac.uk LINUX INTEL Unclaimed Idle 0.000 2048 0+01:05:04 bunsen.dl.ac.uk LINUX INTEL Unclaimed Idle 0.000 2048 0+01:15:04 fozzy.dl.ac.uk LINUX INTEL Unclaimed Idle 0.000 2048 0+01:10:04 kermit.dl.ac.uk LINUX INTEL Unclaimed Idle 0.000 2048 0+01:05:04 slot1@rjaport.dl.ac LINUX INTEL Unclaimed Idle 0.000 1643 0+01:10:04 slot2@rjaport.dl.ac LINUX INTEL Unclaimed Idle 0.000 1643 0+23:15:28 slot1@rjavig.dl.ac LINUX INTEL Unclaimed Idle 0.000 1643 0+01:10:04 slot2@rjavig.dl.ac LINUX INTEL Unclaimed Idle 0.000 1643 0+23:15:28 statler.dl.ac.uk LINUX INTEL Unclaimed Idle 0.240 2048 0+00:00:04 waldorf.dl.ac.uk LINUX INTEL Unclaimed Idle 0.010 2048 0+00:00:04 slot1@tardis.dl.ac LINUX X86_64 Unclaimed Idle 1.000 1024 0+02:40:04 slot2@tardis.dl.ac LINUX X86_64 Unclaimed Idle 3.420 1024 1+02:40:49 rjavig6.dl.ac.uk WINNT51 INTEL Unclaimed Idle 0.020 2030 3+22:01:52
  Total Owner Claimed Unclaimed Matched Preempting Backfill
Changed:
<
<
INTEL/LINUX 5 0 0 5 0 0 0
>
>
INTEL/LINUX 11 0 0 11 0 0 0
  INTEL/WINNT51 1 0 0 1 0 0 0 X86_64/LINUX 2 0 0 2 0 0 0
Changed:
<
<
Total 8 0 0 8 0 0 0
>
>
Total 14 0 0 14 0 0 0
 

Forthcoming changes include:

Line: 32 to 38
 
  • Add machines in A27A.
  • Add machines in A27. 2x machines to add plus desktop systems
  • Add machines in A33. New desktop PC plus existing PC.
Deleted:
<
<
  • Re-configure and add Muppets. This is an 8-node IBM Xeon cluster which is around 5 years old.
 

Installation Procedure on Linux Systems

Further instructions on UW Madison Web site for 7.2.1.

  1. As root, download appropriate distro from Condor Web site into /usr/local/src. If the O/S is not matched, try the generic glibc32 tarball. On Gentoo the RHEL3 dynamic version seems to work. Here is the site for Stable release v7.2.1
Changed:
<
<
  1. Unpack the tarball. This will create something like /usr/local/src/condor-7.0.1
  2. Setup up the condor user, on DL systems this is uid 14168 in group condor which is gid 14168
  3. In /usr/local/src/condor-7.0.1 run "condor_install --prefix /opt/condor-7.0.1"
>
>
  1. Unpack the tarball. This will create something like /usr/local/src/condor-7.2.1
  2. Setup up the condor user, on DL systems this is uid 14168 in group condor which is gid 14168. I've created a script to do this.
  3. In /usr/local/src/condor-7.2.1 run "condor_install --prefix=/opt/condor-7.2.1"
 
  1. You may have to install libstdc++ or a compatibility layer if you dont already have it. On Gentoo "emerge libstdc++"
Changed:
<
<
  1. cd to /opt and create a soft link by doing "ln -s condor-7.0.1 condor". This will be used in subsequent steps and facilitates easy updates.
  2. Edit the required settings in /opt/condor/etc/condor_config
  3. Also edit /opt/condor/etc/condof_config to allow appropriate hosts e.g. "HOSTALLOW_READ = *" and "HOSTALLOW_WRITE = *" is suitable for a submit node inside a firewall
>
>
  1. cd to /opt and create a soft link by doing "ln -s condor-7.2.1 condor". This will be used in subsequent steps and facilitates easy updates. This is also done by the condor_setup_script mentioned below.
  2. Edit /opt/condor/etc/condof_config to allow appropriate hosts e.g. "HOSTALLOW_READ = *" and "HOSTALLOW_WRITE = *" is suitable for a submit node inside a firewall. Everything else is over-ridden in condor_config.local (see below).
 
  1. You will need to create the init script. On Gentoo this is /etc/init.d/condor and is of SysV style. A sample can be found in /opt/condor/etc/examples/condor.boot which can be copied over and edited. It is however better to use John Kewley's modified script which has start, stop and restart options. It can be found here.
  2. You need a directory called /opt/condor/local.$(HOSTNAME). This should contain condor_config.local of which this is an example modified for tardis.dl.ac.uk on the DL pool.
Changed:
<
<
  1. If you do not have an /etc/profile.d/condor.sh you need to create one, here is an example
  2. "chown -R condor:condor /opt/condor/local.$(HOSTNAME) /opt/condor/etc/condor_config"
>
>
  1. If you do not have an /etc/profile.d/condor.sh you need to create one, here is an example. Note, this should have been created by the condor_install command.
  2. "chown -R condor:condor /opt/condor/local.$(HOSTNAME) /opt/condor/etc/condor_config". I've created a script to copy the examples and do this part of the setup and it also does the soft link mentioned above.
 
  1. Do "/etc/init.d/condor start" and sit back. You can access the Web site at http://tardis.dl.ac.uk/Condor to check the status or do "condor_status -pool condor-main.dl.ac.uk"
  2. Don't forget to put ntpd on the machine too for any distributed computing activity.

Revision 203 Mar 2009 - Main.RobAllan

Line: 1 to 1
 
META TOPICPARENT name="WebHome"

Daresbury Laboratory Condor Pool

Line: 9 to 9
  Name OpSys? Arch State Activity LoadAv? Mem ActvtyTime?
Added:
>
>
animal.dl.ac.uk LINUX INTEL Unclaimed Idle 0.040 2048 0+00:00:04
 slot1@rjavig. LINUX INTEL Unclaimed Idle 0.000 1643 3+18:13:29 slot2@rjavig. LINUX INTEL Unclaimed Idle 0.000 1643 3+18:13:30
Changed:
<
<
vm1@tardis.dl LINUX X86_64 Unclaimed Idle 1.000 1024 3+18:13:51 vm2@tardis.dl LINUX X86_64 Unclaimed Idle 5.050 1024 22+10:33:32 vm1@rjaport.dl LINUX INTEL Unclaimed Idle 1.000 1024 3+18:13:51 vm2@rjaport.dl LINUX INTEL Unclaimed Idle 5.050 1024 22+10:33:32
>
>
slot1@tardis.dl LINUX X86_64 Unclaimed Idle 1.000 1024 3+18:13:51 slot2@tardis.dl LINUX X86_64 Unclaimed Idle 5.050 1024 22+10:33:32 slot1@rjaport.dl LINUX INTEL Unclaimed Idle 1.000 1024 3+18:13:51 slot2@rjaport.dl LINUX INTEL Unclaimed Idle 5.050 1024 22+10:33:32
 rjavig6.dl.ac WINNT51 INTEL Unclaimed Idle 1.120 2030 3+20:53:55

Total Owner Claimed Unclaimed Matched Preempting Backfill

Changed:
<
<
INTEL/LINUX 4 0 0 4 0 0 0
>
>
INTEL/LINUX 5 0 0 5 0 0 0
  INTEL/WINNT51 1 0 0 1 0 0 0 X86_64/LINUX 2 0 0 2 0 0 0
Changed:
<
<
Total 7 0 0 7 0 0 0
>
>
Total 8 0 0 8 0 0 0
 

Forthcoming changes include:

Line: 41 to 42
 
  1. Unpack the tarball. This will create something like /usr/local/src/condor-7.0.1
  2. Setup up the condor user, on DL systems this is uid 14168 in group condor which is gid 14168
  3. In /usr/local/src/condor-7.0.1 run "condor_install --prefix /opt/condor-7.0.1"
Added:
>
>
  1. You may have to install libstdc++ or a compatibility layer if you dont already have it. On Gentoo "emerge libstdc++"
 
  1. cd to /opt and create a soft link by doing "ln -s condor-7.0.1 condor". This will be used in subsequent steps and facilitates easy updates.
  2. Edit the required settings in /opt/condor/etc/condor_config
  3. Also edit /opt/condor/etc/condof_config to allow appropriate hosts e.g. "HOSTALLOW_READ = *" and "HOSTALLOW_WRITE = *" is suitable for a submit node inside a firewall
  4. You will need to create the init script. On Gentoo this is /etc/init.d/condor and is of SysV style. A sample can be found in /opt/condor/etc/examples/condor.boot which can be copied over and edited. It is however better to use John Kewley's modified script which has start, stop and restart options. It can be found here.
  5. You need a directory called /opt/condor/local.$(HOSTNAME). This should contain condor_config.local of which this is an example modified for tardis.dl.ac.uk on the DL pool.
Changed:
<
<
  1. "chown -R condor:condor /opt/local.$(HOSTNAME) /etc/condor_config"
>
>
  1. If you do not have an /etc/profile.d/condor.sh you need to create one, here is an example
  2. "chown -R condor:condor /opt/condor/local.$(HOSTNAME) /opt/condor/etc/condor_config"
 
  1. Do "/etc/init.d/condor start" and sit back. You can access the Web site at http://tardis.dl.ac.uk/Condor to check the status or do "condor_status -pool condor-main.dl.ac.uk"
Added:
>
>
  1. Don't forget to put ntpd on the machine too for any distributed computing activity.

Revision 103 Mar 2009 - Main.RobAllan

Line: 1 to 1
Added:
>
>
META TOPICPARENT name="WebHome"

Daresbury Laboratory Condor Pool

This static information was last updated 3/3/2009. This pool is currently maintained by Rob Allan and Dave Cable.

condor_status -pool condor-main.dl.ac.uk

Name          OpSys       Arch   State      Activity   LoadAv Mem   ActvtyTime

slot1@rjavig. LINUX       INTEL  Unclaimed  Idle       0.000  1643  3+18:13:29
slot2@rjavig. LINUX       INTEL  Unclaimed  Idle       0.000  1643  3+18:13:30
vm1@tardis.dl LINUX       X86_64 Unclaimed  Idle       1.000  1024  3+18:13:51
vm2@tardis.dl LINUX       X86_64 Unclaimed  Idle       5.050  1024 22+10:33:32
vm1@rjaport.dl LINUX       INTEL Unclaimed  Idle       1.000  1024  3+18:13:51
vm2@rjaport.dl LINUX       INTEL Unclaimed  Idle       5.050  1024 22+10:33:32
rjavig6.dl.ac WINNT51     INTEL  Unclaimed  Idle       1.120  2030  3+20:53:55

                     Total Owner Claimed Unclaimed Matched Preempting Backfill

         INTEL/LINUX     4     0       0         4       0          0        0
       INTEL/WINNT51     1     0       0         1       0          0        0
        X86_64/LINUX     2     0       0         2       0          0        0

               Total     7     0       0         7       0          0        0

Forthcoming changes include:

  • Add machines in A27A.
  • Add machines in A27. 2x machines to add plus desktop systems
  • Add machines in A33. New desktop PC plus existing PC.
  • Re-configure and add Muppets. This is an 8-node IBM Xeon cluster which is around 5 years old.

Installation Procedure on Linux Systems

Further instructions on UW Madison Web site for 7.2.1.

  1. As root, download appropriate distro from Condor Web site into /usr/local/src. If the O/S is not matched, try the generic glibc32 tarball. On Gentoo the RHEL3 dynamic version seems to work. Here is the site for Stable release v7.2.1
  2. Unpack the tarball. This will create something like /usr/local/src/condor-7.0.1
  3. Setup up the condor user, on DL systems this is uid 14168 in group condor which is gid 14168
  4. In /usr/local/src/condor-7.0.1 run "condor_install --prefix /opt/condor-7.0.1"
  5. cd to /opt and create a soft link by doing "ln -s condor-7.0.1 condor". This will be used in subsequent steps and facilitates easy updates.
  6. Edit the required settings in /opt/condor/etc/condor_config
  7. Also edit /opt/condor/etc/condof_config to allow appropriate hosts e.g. "HOSTALLOW_READ = *" and "HOSTALLOW_WRITE = *" is suitable for a submit node inside a firewall
  8. You will need to create the init script. On Gentoo this is /etc/init.d/condor and is of SysV style. A sample can be found in /opt/condor/etc/examples/condor.boot which can be copied over and edited. It is however better to use John Kewley's modified script which has start, stop and restart options. It can be found here.
  9. You need a directory called /opt/condor/local.$(HOSTNAME). This should contain condor_config.local of which this is an example modified for tardis.dl.ac.uk on the DL pool.
  10. "chown -R condor:condor /opt/local.$(HOSTNAME) /etc/condor_config"
  11. Do "/etc/init.d/condor start" and sit back. You can access the Web site at http://tardis.dl.ac.uk/Condor to check the status or do "condor_status -pool condor-main.dl.ac.uk"
 
This site is powered by the TWiki collaboration platformCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback