We are looking for a skilled and motivated Middle Site Reliability Engineer (SRE) to join our dynamic team. In this role, you will focus on ensuring the reliability, performance, and scalability of our infrastructure through automation, monitoring, and continuous improvement. You will work closely with development teams to maintain cloud-based environments and ensure smooth, efficient operations. This is an excellent opportunity for a talented individual to grow their skills in a collaborative and fast-paced environment.
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Want more jobs like this?
Get jobs in Bahía Blanca, Argentina delivered to your inbox every week.
#LI-DNI
Responsibilities
- Assist in automating and managing infrastructure using Terraform and other configuration management tools
- Help ensure the reliability and availability of cloud-based services and applications
- Work with teams to improve system performance and optimize infrastructure components
- Monitor system health and performance with tools like CloudWatch and New Relic
- Troubleshoot and resolve issues in infrastructure and systems
- Support and contribute to CI/CD pipeline development using Jenkins, CircleCI, and GitHub Actions
- Participate in on-call rotations to provide operational support
- Collaborate with cross-functional teams to deliver infrastructure improvements and best practices
- Experience with Terraform and configuration management tools
- Familiarity with cloud platforms (AWS) and container technologies (ECS, EKS, Docker)
- Strong knowledge of RDBMS (MySQL, PostgreSQL) and basic understanding of NoSQL databases
- Experience with monitoring and alerting tools such as New Relic and CloudWatch
- Understanding of CI/CD tools like Jenkins, CircleCI, and GitHub Actions
- Proficiency in scripting or programming with Python
- B2+ English level (effective written and verbal communication)
- Willingness to participate in on-call rotations and provide operational support
- Strong problem-solving skills and attention to detail
- Self-motivated with the ability to work independently and as part of a team
- Exposure to software development (Java, PHP, Node, GoLang)
- Experience with infrastructure-as-code tools like CloudFormation
- Familiarity with NoSQL databases such as Couchbase and DynamoDB
- Basic knowledge of performance tuning and system optimization
- Experience with tools like PagerDuty and Exigence for operational monitoring and incident response
- Connectivity Bonus (15,000 ARS are paid with a salary receipt at the end of each month as a non-wages concept)
- Medicina Prepaga (It covers the collaborator and direct family group)
- Paternity Leave (Two additional days are added to what is established by law, total of 4 days)
- Discounts card
- English Training (English lessons, twice per week)
- Training Program (Access to multiple customized training plans according to the needs of each role within the company)
- Marriage bonus (The company doubles the allowance established by law that ANSES offers)
- Referral Program (Referral bonus is paid when the referral of a collaborator joins the Company)
- External Agreements and Discounts
- Vacations: 14 calendar days a year