GlideinWMS The Glidein-based Workflow Management System

User Pool

User Pool Installation

1. Description

The User Pool will be the HTCondor Central Manager for the glidein pool, i.e. it will run the HTCondor Collector and Negotiator daemons. All glideins submitted by the Factories connected to this User Pool will report to it and be available to match with the jobs submitted to the Schedds (User submit nodes) connected to it. These daemons define the glidein pool; if this node dies, the pool dies with it.

2. Hardware requirements

CPUs Memory Disk
1 - 2 1+GB 5+GB

This machine needs one or two fast CPUs and a moderate amount of memory (1GB should be enough for most tasks; really big pools may need more).

It must have reliable network connectivity and must be on the public internet, with no firewalls; all worker nodes will be continuously sending UDP packets to the Collector.

The machine must be very stable; if the Collector dies, the glidein pool dies with it (There are Condor techniques to minimize this damage, but you should still try to choose the stablest machine you can afford.)

The disk needed is just for Condor binaries and log files (5GB should be enough)

3. Needed software

See the prerequisites page for a list of software requirements.

4. Installation instructions

The User Pool can be installed either as root or as a non privileged user. Either case, make sure that the user has access to the needed GSI credentials. There is no real advantage to install as root, so non-privileged installation is recommended if installed separately.

See below for a description of the ini file attributes:

Attribute Example Description Comments
install_type tarball or rpm If this is a VOFrontend RPM installation and you are doing a '--configure', then rpm should be specified.
If this is a stand-alone User Collector install, only tarball installations are supported.
Valid values: tarball, rpm.
hostname usercollector.domain.name hostname for User Collector.
username collector (non-root account) UNIX user account that this services will run under. DO NOT use "root". For security purposes, this value should always be a non-root user.
service_name userpool Used as the 'nickname' for the GSI DN in the condor_mapfile of other services.
condor_location /path/to/condor-userpool Directory in which the condor software will be installed. IMPORTANT: The User Collector can share the same instance of Condor as the Frontend. The condor_location must not be a subdirectory of the Frontend's install_location or logs_dir. They may share the same parent, however.
collector_port 9618 (Condor default) Defines the Condor Collector port. Optional: default is 9618 (Condor default)
If multiple glidein services are installed on the same node, this should be unique for each service.
number_of_secondary_collectors 5 The desired number of secondary collectors to be used. Optional: default is 0 (zero)
A rough estimate is to use one collector per 100 glideins with a hard limit on 200 glideins per collector.
x509_cert_dir /path/to/certificates-location The directory where the CA certificates are maintained. The installer will validate for the precesence of *.0 and *.r0 files. If the CAs are installed from the VDT distribution, this will be the VDT_LOCATION/globus/TRUSTED_CA directory.
x509_cert /path-to-cert-location/cert.pm The location of the certificate file being used. This file must be owned by the user installing (starting/stopping) this service. Permissions should be 644 or 600.
x509_key /path-to-cert-location/key.pm The location of the certificate key file being used and associated with the certtificate defined by the x509_cert option above. This file must be owned by the user installing (starting/stopping) this service. Permissions should be 600 or 400.
x509_gsi_dn dn-subject-of-x509_cert-using-openssl This is the identity of the certificate used by this service to contact the other Condor based glideinWMS services. This is the subject of the certificate (x509_cert option).
openssl x509 -subject -noout -in [x509_cert]
It is used to populate the condor_config file GSI_DAEMON_NAME and condor_mapfile entries of this and the other glideinWMS services as needed.
condor_tarball /path/to/condor/tarballs/condor-7.5.0-linux-x86-rhel3-dynamic.tar.gz Location of the condor tarball. The installation script will perform the installation of condor using this tarball. It must be a zipped tarball with a *.tg.tz name.
condor_admin_email whomever@email.com The email address to get Condor notifications in the event of a problem. Used in the condor_config.local only.
vdt_location /path/to/glidein/vdt The location of the OSG/VDT client software. The installer looks for the existence of 2 files to verify if this is a valid OSG/VDT client installation:
1. setup.sh
2. existence of a voms-proxy-init executable.

glideinwms_location /path/to/glideinWMS Directory of the glideinWMS software. Since this is a Condor service only, this software is only used during the installation process.

For example configuration files, see here.

The script to manage the glidein service components is:

glideinWMS/install/manage_glideins

It is recommended to run the validation step before installing. This will help you debug any issues so the install will finish.

./manage_glideins --validate usercollector --ini /path/to/glideinWMS.ini

To install the user pool collector:

./manage_glideins --install usercollector --ini /path/to/glideinWMS.ini

5. To Start/Stop Pool Collector

Setup the environment

source /path/to/condor/location/condor.sh

To start Condor run:

/path/to/condor/location/condor start

You should see three processes run as user condor: condor_master, condor_collector and condor_negotiator.

The log files can be found in

/path/to/condor/location/condor_local/log/

To stop Condor run:

/path/to/condor/location/condor stop