Principal Site Reliability Engineer

New
  • Location
    Galway, Galway
  • Category
    IT and Telecoms - Other IT & Telecoms
  • Job type
    Permanent
  • External Reference
    SRE

Our client is seeking a passionate and experienced Lead Site Reliability Engineer to join their dynamic Site Reliability Engineering group within Enterprise Infrastructure! If you thrive in an environment that combines Operations Excellence with Development Experience, this is the opportunity for you!

It is a permanent role in Galway with amazing benefits and career progression opportunities!

You will lead efforts to define and execute a comprehensive reliability and observability strategy, ensuring our systems are always available for our customers; troubleshoot stack-wide engineering issues across hardware, software, network, applications, and cloud service providers; coach and mentor peer SREs and development teams on building highly available systems; be an escalation point during major incidents, taking hands-on responsibility to lead production bridges across teams; conduct thorough post-mortem reviews, focusing on deep technical root cause analysis, observability, and automation enhancements.

Skills required:

A Bachelor's degree (or higher) in a technology-related field (e.g., Engineering, Computer Science) is required; a master's degree is a plus.
Extensive hands-on experience deploying and supporting highly distributed multi-tiered systems at scale.
Practical experience with Public Cloud platforms, preferably AWS or Azure.
Proficiency with EKS, AKS, or Rancher Kubernetes Service for container orchestration.
Experience with distributed architectures, including microservices, containerized services, and serverless architectures.
Strong hands-on Kubernetes skills.
Programming experience in compiled/OOP languages (e.g., C#, Java) and scripting languages (e.g., JavaScript/TypeScript, Python).
Proven ability to maintain scalability and resiliency in complex environments.
Familiarity with modern monitoring tools (e.g., Datadog, Prometheus, Splunk).
Technical and operational leadership with the ability to handle production incidents effectively.

Be part of a vibrant team that values collaboration and continuous improvement.
Work in an environment where your contributions directly impact the reliability of critical systems.
Enjoy opportunities for professional growth and development in a supportive atmosphere.
If you're excited about driving reliability and resilience in high-scale environments while working alongside talented professionals, we want to hear from you!

Apply today and embark on an exciting journey with us!

Adecco is a disability-confident employer. It is important to us that we run an inclusive and accessible recruitment process to support candidates of all backgrounds and all abilities to apply. Adecco is committed to building a supportive environment for you to explore the next steps in your career. If you require reasonable adjustments at any stage, please let us know and we will be happy to support you.

Adecco Ireland is acting as an Employment Agency in relation to this vacancy.

Please apply with your CV to: Natalia Merritt