We are seeking a skilled Senior DevOps Engineer who can oversee our large-scale infrastructure for high-stakes, public-facing products. Candidates should bring a wealth of hands-on experience and strategic insight to our evolving DevOps operations, driving efficiency and reliability across our systems.
#LI-DNI
Responsibilities
- Maintain and improve the stability of our site reliability engineering efforts to better serve our infrastructure needs at scale
- Develop, implement, and manage CI/CD pipelines, focusing prominently on automation and deployment frequency improvements
- Design and maintain infrastructures with cloud computing platforms like AWS, GCP, or Azure
- Utilize infrastructure-as-code tools such as Terraform and Ansible for configuration and deployment activities
- Deploy and manage containerized applications using Docker and Kubernetes
- Monitor system health and performance with tools such as Prometheus and Grafana, diagnosing and resolving issues promptly
- Scale and optimize web sockets based infrastructure to support substantial traffic loads
- Collaborate cross-functionally to ensure project requirements, deadlines, and schedules are on track
- Provide detailed documentation and system diagrams to effectively communicate system design and architecture
Want more jobs like this?
Get jobs in Río Grande, Mexico delivered to your inbox every week.
- Background in Site Reliability Engineering with at least 3 years of experience, especially in production environments
- Familiarity with Python or similar OOP languages
- Proficiency in cloud computing platforms including AWS, GCP or Azure
- Expertise in implementing CI/CD processes
- Competency in containerization technologies and orchestration with Docker and Kubernetes
- Skills in monitoring tools like Prometheus and Grafana
- Outstanding problem-solving capability and an attention to detail
- Proven track record of delivering reliable, efficient, and scalable infrastructure
- Experience with Azure
- Experience or involvement with ML/AI projects
- Career plan and real growth opportunities
- Unlimited access to LinkedIn learning solutions
- International Mobility Plan within 25 countries
- Constant training, mentoring, online corporate courses, eLearning and more
- English classes with a certified teacher
- Support for employee's initiatives (Algorithms club, toastmasters, agile club and more)
- Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)
- Flexible work schedule and dress code
- Collaborate in a multicultural environment and share best practices from around the globe
- Hired directly by EPAM & 100% under payroll
- Law benefits (IMSS, INFONAVIT, 25% vacation bonus)
- Major medical expenses insurance: Life, Major medical expenses with dental & visual coverage (for the employee and direct family members)
- 13 % employee savings fund, capped to the law limit
- Grocery coupons
- 30 days December bonus
- Employee Stock Purchase Plan
- 12 vacations days plus 4 floating days
- Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th & 31st)
- Monthly non-taxable amount for the electricity and internet bills
By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM's Privacy Notice and Policy.