We are seeking a Middle Site Reliability Engineer with a focus on cost savings and maintenance of systems to join our team.
In this role, you'll be crucial in building and supporting robust, high-capacity systems that are efficient and cost-effective. You'll work within our AWS infrastructure, collaborating with product development teams to enhance automation, improve performance, and ensure the reliability of our systems while optimizing costs.
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Want more jobs like this?
Get jobs in Bahía Blanca, Argentina delivered to your inbox every week.
#LI-DNI
Responsibilities
- Implement and refine cloud cost optimization strategies through analysis and resizing recommendations
- Collaborate with engineering and product teams to create cost-aware architectural solutions
- Develop, maintain, and optimize dashboards for monitoring cloud expenditures
- Identify and leverage AWS cost-saving opportunities such as Reserved Instances and Savings Plans
- Educate and promote a culture of financial responsibility regarding cloud resource usage
- Design, analyze, and troubleshoot highly distributed large-scale production systems and cloud-based services
- Support continuity planning including failure injections and validating monitoring configurations
- Enhance infrastructure scalability plans to handle double the expected load
- Manage middleware, network, storage, database, and server coordination
- Conduct performance testing and tuning for optimized system responsiveness
- Develop and maintain telemetry processes to monitor key operational metrics
- 2+ years of experience as a software engineer developing, debugging, and deploying enterprise applications
- Proven background reporting on cloud infrastructure costs utilizing tools like AWS Cost Explorer
- Proficiency in infrastructure automation technologies such as Terraform
- Capability to manage container orchestration using ECS or Kubernetes
- Versatile troubleshooting skills across hosting technologies including web servers, operating systems, and network components
- Skills in continuous deployment frameworks and lifecycle management (e.g., CI/CD)
- Competency in database operations and deployment with cloud databases like RDS MySQL, Postgres, and Aurora
- Knowledge of caching strategies for high concurrency workloads
- Understanding of Lean/Agile deployment processes such as Blue/Green, ZDT, and Canary
- Familiarity with telemetry SaaS systems including New Relic products like APM and Synthetics
- Strong problem-solving and root cause analysis capabilities
- Excellent communication skills and ability to manage culturally aligned escalation response plans
- English level B2+ for effective communication
- Bachelor's Degree in Computer Science
- Ability to communicate across a broad range of technical and non-technical stakeholders
- Fluency in multiple programming languages including JavaScript, Python, and PHP, among others
- Connectivity Bonus (15,000 ARS are paid with a salary receipt at the end of each month as a non-wages concept)
- Medicina Prepaga (It covers the collaborator and direct family group)
- Paternity Leave (Two additional days are added to what is established by law, total of 4 days)
- Discounts card
- English Training (English lessons, twice per week)
- Training Program (Access to multiple customized training plans according to the needs of each role within the company)
- Marriage bonus (The company doubles the allowance established by law that ANSES offers)
- Referral Program (Referral bonus is paid when the referral of a collaborator joins the Company)
- External Agreements and Discounts
- Vacations: 14 calendar days a year