Documentation

 Documentation for IGI Site Managers

Topics

INFN Grid (IG) middleware release

  • Guidelines are documented here wiki

gLite middleware release

Specific IG middleware components

  • StoRM is the recommended middleware for Storage Resource Management in IGI. StoRM implements the SRM v.2.2 standard and can be used for the management of both disk and tape resources. StoRM development is supported by the INFN Grid project. [Read more]
  • DGAS is the Distributed Grid Accounting System of choice in IGI. DGAS provides the functionality needed for the implementation of a complete stand-alone infrastructure for computing and storage accounding of national Grids and Grid Virtual Organizations. It provides sensors for many different Local Resource Management Systems, repositories for persistent storage of usage records at site-level or nationa/regional-level, and it is comes along with a solution for a user-friendly display of accounting data, called HLRMon. DGAS and HLRmon are supported by the INFN Grid and EGEE projects. [Read more]
  • GLUE Schema Specification, version 1.3 Final, Jan 16 2007
  • GridFTP: GT 4.2.1 GridFTP System Administrator's Guide

MPI Configuration

Guidelines are documented here.

Site availability and reliability

Certified sites that are part of the production infrastructure are requested to provide a minimum monthly Availability (70%) and a minimum montly Reliability (75%). Availability are Reliability are computed montly through the Service Availability Monitoring framework. Monthy statistics are published at the following url. Sites that do not meet these minimum performance figures are requested to report on the problems encountered to the European Grid Operations Coordination bodies.

EGI documentation

  • Manuals: The EGI Operations Manuals are technical prescriptive documents that provide guidelines on how to accomplish a given task. These documents are periodically reviewed, and need to be followed by all partners (as opposed to a best-practice documents that provide optional guidelines). Contact: operational-documentation-manuals[at]mailman.egi.eu
  • Best Practises: The EGI Operations Best Practices are technical documents providing reference information on how technical operational tasks can be addressed. A best practice can be optionally adopted by a partner if deemed suitable, the guidelines provided are optional. Nevertheless a best practice is periodically reviewed, and if relevant to grid middleware deployment it is preliminary approved by the relevant Technology Provider. A Best Practice is contributed on a voluntary basis by EGI partners to share knowledge and experience. Contact: operational -documentation-best-practices[at]mailman.egi.eu
  • Procedures: EGI Operational Procedures are prescriptive documents that describe a step-by-step process requiring action from two or more partners. The purpose of a procedure is to define the related workflow. Procedures are approved by the OMB and are periodically reviewed. The intructions providewd by a procedure are mandatory. Applicable areas:
    • Ticket management
    • Operations Center Management
    • Resource Centre Management
    • Availability and monitoring
    • Security Incident Handling
    • Vulnerability Issue Handling
  • FAQs: Frequently Asked Questions are question/answer documents providing information on specific technical areas. The FAQ provides a high-level overview of the issue and collection of links for futher investigation. Contact: operational-documentation (at) mailman.egi.eu
  • Training guides: A collection of HowTos and Training Guides relevant to Operations.

EGEE-III operational documentation (OLD)

Tools for the site manager

  • a comprehensive list of operational tools for site management (broadcast tool, downtime management, SAM portal, ticketing system etc.)
  • demo on the WMS Monitor tool
  • LRMSinfo: tool for computation of declared installed capacity for a given site and more (number of delcared and online slots, number of online cores, type of processors installed in the farm and other information)