Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Site Reliability Engineer - PowerVS Network

AT IBM
IBM

Site Reliability Engineer - PowerVS Network

Alajuela, Costa Rica

Introduction
At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most challenging problems? If so, lets talk.

Your Role and Responsibilities

Site Reliability engineers apply Software Engineering principles to perform infrastructure management tasks more efficiently.
We're seeking skilled, automation-focused Network Engineers to maintain and administer the PowerVS Cloud Infrastructure-as-a-Service environment and provide reliable and secure network operations.

Want more jobs like this?

Get jobs in Alajuela, Costa Rica delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

The Network Infrastructure operations Site Reliability Engineer works with clients to ensure their specific networking requirements are provided, and handles issues reported by monitoring/automation. Adhering to strict change control, the SRE will make required configuration changes in the environment and perform various updates/upgrades to the Cisco ACI-based software-defined networking environment. Constant attention to automating manual toil is a core focus of this role.
PowerVS is a fast-paced environment, our engineers provide technical support and resolve client networking issues within the PowerVS IaaS offering. They identify repetitive tasks and develop automation to reduce manual toil and seek proactive avoidance of client-impacting events.
Responsibilities:
• identify automation opportunities and develop quality reliable automation scripting (Python, BASH/shell scripting, Ansible, and related technologies) to reduce toil and increase accuracy/reliability in areas such as:
• Maintain and firmware upgrades on Cisco ASR/ACI/Leaf switches and Juniper as per security compliance.
• Create customer network configurations with their unique requirements in the IBM Cloud environment.
• Perform code updates and troubleshoot issues with network automation (PowerNS).
• Troubleshoot network issues - raise support cases with network vendors as required to resolve hardware/software issues.
• investigate and resolve port down/flap/bandwidth issues over ACI/ASR infrastructure, NNI links, and other parts of the network.
• Day to day follow-ups on incidents and collaborating with our Compute, Storage, and Monitoring SRE teams.
• Guide customers to create a DirectLink connections, dedicated links, and configure Megaport VXC Connectivity from Cloud to the client environment.
• Configure, troubleshoot, and resolve datacenter inter-Connectivity using GRE over
NNI links and from the IBM Cloud environment to other cloud and infrastructure. • Configure and troubleshoot PUB-VLAN expansion, ACI/VSRX, and VPC expansion on ACI.
• Enable VPNaaS Connectivity from Power Cloud to Customer servers.
• Coordinating with Architects, Leads and Cloud support teams on new custom configuration setup for customers.

Required Technical and Professional Expertise

Scripting/automation experience
• Scripting and Automation including: Python, Perl, shell scripting (bash, etc), Go, Ansible relating to Cisco enterprise networking.
• Understanding of Code Management and Updates (GitHub, Jira, etc).
• Self-Starter who is a fast learner of technology and willing to experiment to find the best solutions.
• Experience creating basic network configurations and working with a variety of network devices, troubleshooting and programming.
• Expert-level hands on experience configuring and troubleshooting network technologies including: Overall Cisco HW and SW, VLANs, Cisco ACI, Cisco ASRs,
GRE, BGP, DirectLink, IPSEC VPN with Vyatta and vSRX, VPNaaS.
• Good understanding of Juniper Junos, Juniper vSRX.
• Knowledge of restricting traffic onto specific tcp/udp ports and modify static routes when needed upon customer requests.
• troubleshooting on MTU related issues.
• Strong written and verbal communication skills
- Fluent in English

Preferred Technical and Professional Expertise

• 3+ years' experience supporting customers using ServiceNow or Salesforce.
• Bachelor's degree in computer science or related IT field.
• IBM Cloud experience (networking, provisioning, etc)
• Knowledge of IBM Power Systems hardware and virtualization
• Knowledge of Brocade SAN fabrics.
• Knowledge of IBM FlashSystem Storage devices.

Client-provided location(s): Heredia Province, Heredia, Costa Rica
Job ID: IBM-21044570
Employment Type: Full Time

Company Videos

Hear directly from employees about what it is like to work at IBM.