Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Middle Site Reliability Engineer

AT EPAM Systems
EPAM Systems

Middle Site Reliability Engineer

Bahía Blanca, Argentina

We are seeking a Middle Site Reliability Engineer with a focus on cost savings and maintenance of systems to join our team.
In this role, you'll be crucial in building and supporting robust, high-capacity systems that are efficient and cost-effective. You'll work within our AWS infrastructure, collaborating with product development teams to enhance automation, improve performance, and ensure the reliability of our systems while optimizing costs.
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

Want more jobs like this?

Get jobs in Bahía Blanca, Argentina delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


#LI-DNI

Responsibilities
  • Implement and refine cloud cost optimization strategies through analysis and resizing recommendations
  • Collaborate with engineering and product teams to create cost-aware architectural solutions
  • Develop, maintain, and optimize dashboards for monitoring cloud expenditures
  • Identify and leverage AWS cost-saving opportunities such as Reserved Instances and Savings Plans
  • Educate and promote a culture of financial responsibility regarding cloud resource usage
  • Design, analyze, and troubleshoot highly distributed large-scale production systems and cloud-based services
  • Support continuity planning including failure injections and validating monitoring configurations
  • Enhance infrastructure scalability plans to handle double the expected load
  • Manage middleware, network, storage, database, and server coordination
  • Conduct performance testing and tuning for optimized system responsiveness
  • Develop and maintain telemetry processes to monitor key operational metrics
Requirements
  • 2+ years of experience as a software engineer developing, debugging, and deploying enterprise applications
  • Proven background reporting on cloud infrastructure costs utilizing tools like AWS Cost Explorer
  • Proficiency in infrastructure automation technologies such as Terraform
  • Capability to manage container orchestration using ECS or Kubernetes
  • Versatile troubleshooting skills across hosting technologies including web servers, operating systems, and network components
  • Skills in continuous deployment frameworks and lifecycle management (e.g., CI/CD)
  • Competency in database operations and deployment with cloud databases like RDS MySQL, Postgres, and Aurora
  • Knowledge of caching strategies for high concurrency workloads
  • Understanding of Lean/Agile deployment processes such as Blue/Green, ZDT, and Canary
  • Familiarity with telemetry SaaS systems including New Relic products like APM and Synthetics
  • Strong problem-solving and root cause analysis capabilities
  • Excellent communication skills and ability to manage culturally aligned escalation response plans
  • English level B2+ for effective communication
Nice to have
  • Bachelor's Degree in Computer Science
  • Ability to communicate across a broad range of technical and non-technical stakeholders
  • Fluency in multiple programming languages including JavaScript, Python, and PHP, among others
We offer
  • Connectivity Bonus (15,000 ARS are paid with a salary receipt at the end of each month as a non-wages concept)
  • Medicina Prepaga (It covers the collaborator and direct family group)
  • Paternity Leave (Two additional days are added to what is established by law, total of 4 days)
  • Discounts card
  • English Training (English lessons, twice per week)
  • Training Program (Access to multiple customized training plans according to the needs of each role within the company)
  • Marriage bonus (The company doubles the allowance established by law that ANSES offers)
  • Referral Program (Referral bonus is paid when the referral of a collaborator joins the Company)
  • External Agreements and Discounts
  • Vacations: 14 calendar days a year
By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM's Privacy Notice and Policy.

Client-provided location(s): Argentina
Job ID: EPAM-epamgdo_blt7121cea226a9ef64_en-us_Other_Argentina
Employment Type: Other