You are visiting this website from:
View All Jobs

Cloud Site Reliability Engineer

Job Seekers Ireland IT Telecoms

Job Summary

  • Dublin
  • Contract
  • BBBH764604
  • Aug 13, 2020
  • Competitive
Job Description

Our vision for our future Cloud SRE will see them focusing on several Internet, OpenStack, network and public cloud components.

Cloud Site Reliability Engineer- Telecoms-Dublin

Job Purpose:

Our vision for our future Cloud SRE will see them focusing on several Internet, OpenStack, network and public cloud components. In order to further support our customers in Europe, we are expanding our footprint in Ireland and this is an opportunity for you to learn and practice the knowledge and experience of the Internet, ultra-large-scale distributed systems, AI, Image processing, Streaming media and eventually become elite in these knowledge areas.

Are you a highly analytical thinker and do you enjoy working in a fast-paced environment?

You will need to be a confident communicator as this role will see you working with geo-dispersed cross-functional teams across different disciplines. If you are someone who likes working with the latest cutting-edge technology, fast pace, challenging work then you should read on…

Key Responsibilities:

  • Operate the cloud infrastructure and engineer improvements to make operation more efficient by automation and monitoring. Be responsible for availability and SLO and respond to incidents.
  • Collaborate with our team of SRE architects for Public cloud Operation & Maintenance, make regular business trips to other cities and countries
  • Design, code and use our automation platform and standard opensource tools to achieve efficient and safe operation and reliability of a large scale public cloud.
  • Design and create distributed systems to manage multiple regions' data centers network, such as switches, firewalls and routers.
  • Improve monitoring and associated tools for better incident detection and noise reduction.
  • Provide support and guidance to the Cloud Service Center operations teams in using and maintaining the deployed monitoring tools.
  • Identify opportunities for new or enhanced automation and monitoring approaches to support continuous improvement of SLIs and efficiency, produce designs and plans to address those opportunities.

Experience Required:

  • Linux administration and troubleshooting.
  • Scripting in any popular Linux scripting language: perl, python, or ruby.
  • Shell scripting basics.
  • Over 2 years' experience in operating large computer systems.

In addition, 2 years of experience in one of the following is required:

  • Use and operation of SQL databases, MySQL or postgres preferred. Good level in SQL language, this isn't just about running the database engine. Familiarity with SQL engine tuning and operation (backups, optimizations, redundancy, monitoring).
  • Networking, load balancing, and redundancy technologies: TCP/IP, DNS, load balancers (LVS and nginx preferred), reverse proxies etc.
  • Using and operating at least 5 tools widely used in modern distributed systems, for example openstack (preferred), kafka, rabbit MQ, etcd, zookeeper, kubernetes, docker, containerd, redis, hadoop, prometheus, opencensus, zipkin, zabbix, elasticsearch, logstash, kibana, spinnaker.
  • Software development in go, java or C++.

Morgan McKinley is acting as an Employment Agency and references to pay rates are indicative.

BY APPLYING FOR THIS ROLE YOU ARE AGREEING TO OUR TERMS OF SERVICE WHICH TOGETHER WITH OUR PRIVACY STATEMENT GOVERN YOUR USE OF MORGAN MCKINLEY SERVICES.

Consultant Details

Consultant Details

Fiona Durkin
Fiona Durkin
  • Talent Acquisition Specialist
  • 01 6324650
  • fdurkin@morganmckinley.com