HPC DevOps Engineer

Updated: about 1 month ago
Location: Ireland,
Deadline: 29 Jun 2021

Applications are invited from suitably qualified candidates for a full-time fixed term position as a Research Associate with the Irish Centre for High End Computing (ICHEC) at the National University of Ireland, Galway. This position is available from June 2021 for 15 months and based at our offices in Dublin or Galway.

Irish Centre for High-End Computing (ICHEC)

ICHEC is Ireland’s national centre for High-Performance Computing (HPC) providing digital infrastructure capabilities and expertise through R&D engagements and skills development programmes to academia, industry and public sector organisations.
With a highly ambitious leading-edge Strategy for 2021-2025, ICHEC provides infrastructure services and expertise in HPC and data intelligence to develop efficient platforms, solutions and services based on technologies including AI, high performance data analytics, Earth Observation, quantum computing and cybersecurity across a number of sectors including environmental sciences, healthcare, agriculture, energy, financial services and ICT.
ICHEC works in close partnership with a number of national and international researchers, enterprises and public authorities for joint R&D, skills development, and provisioning HPC and data services to accelerate their digital transformation and green transition.
For more details reach out to our Centre Infrastructure Manager, Mr. Niall Wilson or visit www.ichec.ie .

Job Description:

The HPC DevOps Engineer will be an integral member of the Infrastructure Programme at ICHEC which focuses on research and development and maintaining a diverse set of services and platforms across a number of computer architectures, storage systems and public cloud services.
The Infrastructure Programme works in tandem with all other Activities at the Centre including AI & Edge Computing, Big Data & Analytics, Environmental Sciences, Performance Engineering, Quantum Computing, and Training & Education.

Duties: The successful candidate will

  • be responsible for developing the technical expertise and working on projects across a number of domains including data science, machine learning, HPC systems and applications;
  • leverage good understanding and experience in the area of HPC platform and system software, along with tools for HPC and data-driven solutions development, for their efficient deployment on different compute/data platforms and environments;
  • have the opportunity to work in research projects across all Activities to develop technical solutions with national and international partners in academia, industry and public sector;
  • assist the Centre in identifying and seeking new opportunities, partnerships and projects;
  • disseminate the outcomes of the projects through reports, research publications, press releases and presentations at events;
  • prepare and deliver training courses to academic researchers, industry and public sector audiences.
    This is a role for a highly motivated problem-solver, with a creative and analytical mind, who is excited to work hands-on for solutions research and development as well as deployment on existing and emerging HPC technologies and systems.

Qualifications/Skills required:

Essential Requirements:
  • A Bachelor's Degree with experience of at least 4 years in Computer Science/Engineering, or a related discipline.
  • Significant experience with the Linux OS environment and with developing and deploying applications on HPC cluster and cloud computing platforms (such as OpenStack, AWS, Microsoft Azure) using CI/CD workflow tools.
  • Experience with configuration management tools (Saltstack, Ansible, Chef, or similar).
  • Experience deploying monitoring and alerting tools for services and applications (Nagios, Grafana, Graphite, Prometheus, or similar).
  • Experience with virtualisation and containerisation of software stacks and applications (such as Kubernetes, Docker, Singularity, or similar).
  • Working experience with languages such as C, C++, Python, R, Ruby, or similar.
  • Ability to work in a multi-disciplinary team with academic, research and industry partners, with excellent communication and organisational skills.

  • Desirable Requirements:
  • Experience managing and deploying applications on Microsoft Windows Server platform.
  • Building, optimising, testing and deploying machine learning and data science solutions on HPC cluster and/or cloud computing platforms.
  • Working with HPC compute and storage systems, lower-level software and system administration tools and batch schedulers.
  • Exposure to Agile/PRINCE2 project management frameworks.
  • Salary: €44,659 to €50,031 per annum pro rata for shorter and/or part-time contracts (public sector pay policy rules pertaining to new entrants will apply).

    Start date: Position is available from June 2021 for 15 months, renewable based on funding availability.

    Further information on research and working at NUI Galway is available on Research at NUI Galway Researchers at NUI Galway are encouraged to avail of a range of training and development opportunities designed to support their personal career development plans. NUI Galway provides continuing professional development supports for all researchers seeking to build their own career pathways either within or beyond academia. Researchers are encouraged to engage with our Researcher Development Centre (RDC) upon commencing employment - see www.nuigalway.ie/rdc for further information.

    For information on moving to Ireland please see www.euraxess.ie

    Further information about ICHEC is available at www.ichec.ie
    Informal enquiries concerning the post may be made to recruit@ichec.ie

    To Apply:

    Applications to include a 1-page covering letter summarising your suitability and motivation for the role, CV (max. 3 pages), and the contact details of three referees should be sent, via e-mail (in a single PDF only) to recruit@ichec.ie
    Please put reference number NUIG RES 105-21 in subject line of e-mail application.

    View or Apply

    Similar Positions