Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Chief AWS Site Reliability Engineer (SRE)

AT EPAM Systems
EPAM Systems

Chief AWS Site Reliability Engineer (SRE)

Buenos Aires, Argentina

EPAM Systems is looking for a Chief AWS SRE Engineer who fully understands and practices SRE activities and philosophy to join the global engineering team that ensures fleet services reliability and availability under the SRE model.

If you're passionate about innovation, we invite you to apply and become part of our team!
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

Want more jobs like this?

Get jobs in Buenos Aires, Argentina delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


#LI-DNI

Responsibilities
  • Collaborate with service teams to improve the reliability and efficiency of workloads and services using SRE practices
  • Develop and improve CI/CD processes to enhance release cadence and success
  • Build, consume toil backlog, automating toilsome tasks
  • Document knowledge and processes
  • Practice and promote sustainable incident response and blameless postmortems
  • Write code that improves scalability, performance, maintainability, and security
  • Implement distributed monitoring practices
  • Refine monitoring processes, configurations, and thresholds
  • Contribute towards the identification and implementation of service level indicators and objectives for workloads and services
Requirements
  • 7+ years of cloud engineering experience, with a good track record of highly scalable, distributed systems projects in the past 5 years
  • Previous experience working as an SRE engaged with active development teams is a must, and the candidate should have a good understanding of SRE methodologies and philosophies
  • AWS cloud expertise
  • Ideally, has experience running multi-region workloads and has in-depth knowledge of most of the commonly used AWS services
  • Observability experience with distributed services, for example, experience of distributed tracing and similar concepts
  • Independent and self-directed people to work alongside client engineering teams under minimal supervision
  • Strong programming and automation experience: Python, Golang
  • Understanding of the software development lifecycle
  • Fluent English communication skills at a B2+ level
We offer
  • Connectivity Bonus (15,000 ARS are paid with a salary receipt at the end of each month as a non-wages concept)
  • Medicina Prepaga (It covers the collaborator and direct family group)
  • Paternity Leave (Two additional days are added to what is established by law, total of 4 days)
  • Discounts card
  • English Training (English lessons, twice per week)
  • Training Program (Access to multiple customized training plans according to the needs of each role within the company)
  • Marriage bonus (The company doubles the allowance established by law that ANSES offers)
  • Referral Program (Referral bonus is paid when the referral of a collaborator joins the Company)
  • External Agreements and Discounts
  • Vacations: 14 calendar days a year
By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM's Privacy Notice and Policy.

Client-provided location(s): Buenos Aires, Argentina
Job ID: EPAM-epamgdo_bltb176f8df92f28c7a_en-us_BuenosAires_Argentina
Employment Type: Other