Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Senior Site Reliability Engineer

AT IBM
IBM

Senior Site Reliability Engineer

Krakow, Poland

Introduction
At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most challenging problems? If so, lets talk.
Curiosity and courageous thinking are both vital when working in IBM, as we continue our dedication in guaranteeing that we are at the forefront of cloud technology. Our renowned legacy means we are leading the way in everything from analytics and security through to unmatched hardware & software designs. We provide our clients with the full end-to-end transformation as we build IBM's next generation cloud platform which is focused around delivering performance and predictability at a global scale.

Want more jobs like this?

Get Software Engineering jobs in Krakow, Poland delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


Your Role and Responsibilities
As a Site Reliability Engineer, you will play a crucial role in supporting, maintaining, and operationally improving the cloud infrastructure. Working closely with various teams, your focus will be on ensuring the health and reliability of production and test systems. Your proactive approach will be essential in responding promptly to issues and alerts, contributing to the development of new capabilities, and collaborating with other SRE teams and program managers to deliver mission-critical services to the market.

Key Duties:
  • Platform Engineering: Participate in development and maintenance of large-scale Internal Developer Platform (IDP) based on Kubernetes
  • Collaborative Partnership: Partner with development teams and program managers, contributing to the seamless delivery of mission-critical services to the market.
  • Automation Execution: Execute changes in the production environment through automation, ensuring efficiency and minimizing downtime.
  • Cross-Functional Troubleshooting: Collaborate with engineering teams to provide initial assessments and possible workarounds for production issues. Troubleshoot and resolve production issues effectively.
  • Integration Planning: Work with support and development teams to identify and resolve issues. Discuss and plan integration tasks to enhance overall system performance.
  • Rapid Issue Response: Respond promptly to production issues and alerts, providing swift resolution and maintaining system availability.
  • Integration Planning: Work with support and development teams to identify and resolve issues. Discuss and plan integration tasks to enhance overall system performance.

Required Technical and Professional Expertise

  • Proven Experience: Expertise in large-scale, distributed Linux/Unix environments and container orchestration technologies like Docker, Kubernetes, and Helm.
  • System Monitoring and Troubleshooting: Strong experience with Observability tools (e.g., Prometheus, Grafana, DataDog) to ensure optimal system performance and uptime.
  • Automation Proficiency: Proficiency in using declarative infrastructure tools like Terraform and CloudFormation to automate production workflows efficiently.
  • Collaborative Mindset: Ability to work collaboratively across teams while adopting GitOps and CI/CD principles for seamless operations.
  • Effective Communication Skills: Excellent communication and mentoring skills, fostering knowledge sharing and team development in English.

Preferred Technical and Professional Expertise

  • Collaborative Mindset: Proven ability to work effectively with team members in a primarily remote environment, fostering strong connections and shared success.
  • Proven Experience: Demonstrated expertise in developing Kubernetes operators, ensuring seamless integration and functionality in containerized environments.

Client-provided location(s): Kraków, Poland
Job ID: IBM-20666914
Employment Type: Full Time

Company Videos

Hear directly from employees about what it is like to work at IBM.