Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Site Reliability Engineering Manager - Shift

AT IBM
IBM

Site Reliability Engineering Manager - Shift

Dublin, Ireland

Introduction
At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most challenging problems? If so, lets talk.

Your Role and Responsibilities
The shift toward the consumption of IT as a service, i.e., the cloud, is one of the most important changes to happen to our industry in decades. At IBM, we are driven to shift our technology to an as-a-service model and to help our clients transform themselves to take full advantage of the cloud. With industry leadership in analytics, security, commerce, and cognitive computing and with unmatched hardware and software design and industrial research capabilities, no other company is as well positioned to address the full opportunity of cloud computing.

Want more jobs like this?

Get jobs in Dublin, Ireland delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


We are looking for a dynamic, curious, high-potential, deep technical leader who desires a broadening assignment with the opportunity to deliver substantial business value to IBM. A leader, who innovates & shares our passion for winning in the cloud marketplace. The IaaS Operations is a team dedicated to ensuring that the IBM Cloud is at the forefront of cloud technology, from data center design to network architecture to storage and compute clusters to flexible infrastructure services.
The ideal candidate should be strong in 'getting things done', have an entrepreneurial spirit, communicates well, have a great deal of energy, and enjoy working as part of a global collaborative team. Candidates for this position will need strong Delivery and Execution skills, to help projects at all phases overcome technical obstacles.
In this role, you will be responsible for setting the direction for operations, to deliver value to our clients in a fast-changing cloud landscape. The SRE team is dedicated to ensuring that the IBM Cloud is at the forefront of cloud technology, from Storage & Network architecture and compute clusters to flexible infrastructure services. We are building IBM's next generation cloud platform to deliver performance and predictability for our customers' most demanding workloads. This role will be a shift role - Monday - Friday 4pm to 12.30 am.

  • Hire and develop high performing technical talent with a particular focus on delivering operations and SRE solutions.
  • During Incidents to lead multiple service teams to a fast resolution and return to BAU for customers.
  • Manage Site Reliability Engineers including team's day to day operation, all quarterly reviews, evaluations, and career development.
  • Drive the team to establish comprehensive development plans and innovative solutions to problems and challenges that meet desired outcomes.
  • Allocate and balance resources across multiple platforms to meet needs of business priorities
  • Provide management oversight to several activities running in parallel, address issues/concerns with speed, and enable coarse corrective actions.
  • Report on operating status to program stakeholders as needed
  • Performs other duties as required
  • Develop, implement, and monitor day-to-day operational systems and processes
  • Enhance all existing monitoring solutions and implement robust monitoring for all platforms globally
  • Analyze current operational processes and performance, recommending solutions for improvement where necessary
  • Managing critical customer issues, this requires on going communication with Service SRE, development, and customer support teams
  • Lead the team with integrity and to establish and maintain a trusting, inclusive, and productive environment

Required Technical and Professional Expertise
.
  • Exposure to team leadership and operational excellence
  • Technical Skills - Good hold on Cloud technologies in Networking, Storage and Compute.
  • Experience with managing team of team size 15-20 with varied skills like SREs and developers,
  • Expert in Agile and Scrum Methodology.
  • Excellent leadership and management skills with emphasis on mentoring, motivating, and driving a large team to success.
  • Managing critical customer issues, this requires on going communication with Service SRE, development, and customer support teams
  • Experience using Splunk and or other dashboards
  • Understanding of web technologies and technology stack
  • Working knowledge with Network and Storage technologies
  • Working knowledge with ServiceNow, JIRA, Confluence, and GitHub

Preferred Technical and Professional Expertise

  • You love collaborative environments that use agile methodologies to encourage creative design thinking and find innovative ways to develop with cutting edge technologies
  • Ambitious individual who can work under their own direction towards agreed targets/goals and with creative approach to work
  • Intuitive individual with an ability to manage change and proven time management
  • Proven interpersonal skills while contributing to team effort by accomplishing related results as needed
  • Up-to-date technical knowledge by attending educational workshops, reviewing publications
  • Working knowledge & experience with Networking /Storage/ Databases in the Cloud

Client-provided location(s): Coolmine, Mulhuddart, Co. Dublin, Ireland
Job ID: IBM-20872868
Employment Type: Full Time

Company Videos

Hear directly from employees about what it is like to work at IBM.