Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Intern Conversion - Site Reliability Engineering

AT IBM
IBM

Intern Conversion - Site Reliability Engineering

Markham, Canada

Introduction
A career in means you'll be part of a team that transforms our customer's challenges into solutions.
Seeking new possibilities and always staying curious, we are a team dedicated to creating the world's leading AI-powered, -native solutions for our customers. Our renowned legacy creates endless global opportunities for our IBMers, so the door is always open for those who want to grow their career.
IBM's product and technology landscape includes Research, , and Infrastructure. Entering this domain positions you at the heart of , where growth and innovation thrive.

Your Role and Responsibilities
Start dates for this position are in 2025

As a Site Reliability Engineer, you will work in an agile, collaborative environment to build, deploy, configure, and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes.

Want more jobs like this?

Get jobs in Markham, Canada delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

Your primary responsibilities include:
  • Deployment and Configuration: The process of installing Couchbase Enterprise software and configuring buckets for the offerings.
  • Cluster Management and Troubleshooting: Activities involved in troubleshooting issues with clusters and determining appropriate hardware configuration for them.
  • Security and Compliance Implementation: Implementing security measures such as setting up certificates and ensuring compliance with ITCS-104, ISO, SOC standards, and associated regulations.
  • Maintenance and Support: Tasks related to applying Couchbase security patches and upgrades, supporting Cassandra and Mongo for pager duty rotation, and collaborating with Couchbase Product support for issue resolution.

Required Technical and Professional Expertise

  • Availability and Flexibility: Willingness to work in shifts or support 24 x 7 coverage as per the business needs.
  • Couchbase, Mongo, and Cassandra: Excellent knowledge of Couchbase, with a solid foundation in Mongo and Cassandra.
  • Linux Proficiency: Exceptional knowledge of Linux operating systems.
  • Operation and Support Experience: Demonstrated experience in handling day-to-day operations, alert management, incident support, migration tasks, and break-fix support.
  • Shell Scripting and Ansible Skills: Good knowledge of shell scripting and the Ansible configuration management tool.

Preferred Technical and Professional Expertise

  • Kubernetes/OpenShift: Strongly preferred experience in working with production Kubernetes/OpenShift environments.
  • Change Management Expertise: Experience with change management workflows.
  • ELK/EFK Stack Familiarity: Experience with the ELK/EFK stack, which includes ElasticSearch, Logstash/Fluentd, and Kibana.
  • Distributed Event Streaming Platform Experience: Experience with platforms such as Kafka.
  • SQL and NoSQL Datastore Experience: Experience with SQL and/or NoSQL datastores, including DB2 and Oracle data services.
  • Application Load Balancing Concepts: including F5 and ELB.

Client-provided location(s): Markham, ON, Canada
Job ID: IBM-20941292
Employment Type: Intern

Company Videos

Hear directly from employees about what it is like to work at IBM.